Official inference repo for FLUX.2 models
Python inference and LoRA trainer package for the LTX-2 audio–video
Project Lyra: Open Generative 3D World Models
Long-form streaming TTS system for multi-speaker dialogue generation
Wan2.2: Open and Advanced Large-Scale Video Generative Model
Visual Causal Flow
Official Python inference and LoRA trainer package
Flux 2 image generation model pure C inference
High-Resolution 3D Assets Generation with Large Scale Diffusion Models
From Images to High-Fidelity 3D Assets
Qwen2.5-VL is the multimodal large language model series
An experimental version of DeepSeek model
A Multi-Modal World Model for Reconstructing, Generating, Simulation
gpt-oss-120b and gpt-oss-20b are two open-weight language models
Tooling for the Common Objects In 3D dataset
DeepSeek Coder: Let the Code Write Itself
GLM-4.5: Open-source LLM for intelligent agents by Z.ai
Code for running inference and finetuning with SAM 3 model
DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models
Advancing Formal Mathematical Reasoning via Reinforcement Learning
State-of-the-art TTS model under 25MB
Z80-μLM is a 2-bit quantized language model
Video understanding codebase from FAIR for reproducing video models
Large-language-model & vision-language-model based on Linear Attention
Controllable & emotion-expressive zero-shot TTS