Industrial-level controllable zero-shot text-to-speech system
DeepSeek Coder: Let the Code Write Itself
Easy Docker setup for Stable Diffusion with user-friendly UI
Visual Causal Flow
A Systematic Framework for Interactive World Modeling
ChatGPT interface with better UI
Mixture-of-Experts Vision-Language Models for Advanced Multimodal
The official repo of Qwen chat & pretrained large language model
Open-source deep-learning framework
Official repository for LTX-Video
Generating Immersive, Explorable, and Interactive 3D Worlds
The Clay Foundation Model - An open source AI model and interface
Phi-3.5 for Mac: Locally-run Vision and Language Models
Hackable and optimized Transformers building blocks
Tongyi Deep Research, the Leading Open-source Deep Research Agent
MapAnything: Universal Feed-Forward Metric 3D Reconstruction
Programmatic access to the AlphaGenome model
ChatGLM-6B: An Open Bilingual Dialogue Language Model
MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training
Video Object and Interaction Deletion
Recovering the Visual Space from Any Views
LTX-Video Support for ComfyUI
Towards Real-World Vision-Language Understanding
HY-Motion model for 3D character animation generation
tiktoken is a fast BPE tokeniser for use with OpenAI's models