A trainable PyTorch reproduction of AlphaFold 3
Large Multimodal Models for Video Understanding and Editing
Qwen3-omni is a natively end-to-end, omni-modal LLM
A series of math-specific large language models of our Qwen2 series
Programmatic access to the AlphaGenome model
26m function call model that runs on incredibly small devices
Video understanding codebase from FAIR for reproducing video models
Bidirectional token-classification model for identifiable info
Genome modeling and design across all domains of life
Project Lyra: Open Generative 3D World Models
Achieving 3+ generation speedup on reasoning tasks
Ultra-Efficient LLMs on End Device
Easy Docker setup for Stable Diffusion with user-friendly UI
PyTorch code and models for the DINOv2 self-supervised learning
tiktoken is a fast BPE tokeniser for use with OpenAI's models
A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming
Stable Diffusion WebUI Forge is a platform on top of Stable Diffusion
Global weather forecasting model using graph neural networks and JAX
MapAnything: Universal Feed-Forward Metric 3D Reconstruction
CodeGeeX2: A More Powerful Multilingual Code Generation Model
The Clay Foundation Model - An open source AI model and interface
Open-source large language model family from Tencent Hunyuan
Open Source Speech Language Model
Open-source industrial-grade ASR models
Foundation model for image generation