FAIR Sequence Modeling Toolkit 2
OCR expert VLM powered by Hunyuan's native multimodal architecture
Inference script for Oasis 500M
tiktoken is a fast BPE tokeniser for use with OpenAI's models
Open-source large language model family from Tencent Hunyuan
Collection of Gemma 3 variants that are trained for performance
The official PyTorch implementation of Google's Gemma models
A 0.1B Omni model trained from scratch
Block Diffusion for Ultra-Fast Speculative Decoding
Long-form streaming TTS system for multi-speaker dialogue generation
A SOTA open-source image editing model
Pretrained time-series foundation model developed by Google Research
Official implementation of DreamCraft3D
VMZ: Model Zoo for Video Modeling
Reproduction of Poetiq's record-breaking submission to the ARC-AGI-1
LLM-based Reinforcement Learning audio edit model
Hackable and optimized Transformers building blocks
ChatGPT interface with better UI
Qwen2.5-Coder is the code version of Qwen2.5, the large language model
Open-source, high-performance Mixture-of-Experts large language model
The ChatGPT Retrieval Plugin lets you easily find personal documents
Powerful open source image generation model
Open Multilingual Multimodal Chat LMs
Release for Improved Denoising Diffusion Probabilistic Models
Real-time behaviour synthesis with MuJoCo, using Predictive Control