Awesome multilingual OCR toolkits based on PaddlePaddle
High-Resolution 3D Assets Generation with Large Scale Diffusion Models
Inference script for Oasis 500M
Provides convenient access to the Anthropic REST API from any Python 3
Easy Docker setup for Stable Diffusion with user-friendly UI
A Systematic Framework for Interactive World Modeling
A Unified Framework for Text-to-3D and Image-to-3D Generation
Reproduction of Poetiq's record-breaking submission to the ARC-AGI-1
A SOTA open-source image editing model
The Clay Foundation Model - An open source AI model and interface
High-Fidelity and Controllable Generation of Textured 3D Assets
RGBD video generation model conditioned on camera input
ChatGPT interface with better UI
Stable Diffusion with Core ML on Apple Silicon
The ChatGPT Retrieval Plugin lets you easily find personal documents
AI Suite for upscaling, interpolating & restoring images/videos
StudioOllamaUI is a local, portable interface for Ollama
Open Multilingual Multimodal Chat LMs
Real-time behaviour synthesis with MuJoCo, using Predictive Control
Example Discord bot written in Python that uses the completions API
Let us control diffusion models
Dia-1.6B generates lifelike English dialogue and vocal expressions