Python inference and LoRA trainer package for the LTX-2 audio–video
Official inference repo for FLUX.2 models
Project Lyra: Open Generative 3D World Models
Long-form streaming TTS system for multi-speaker dialogue generation
Wan2.2: Open and Advanced Large-Scale Video Generative Model
Visual Causal Flow
Official Python inference and LoRA trainer package
Multi-modal large language model designed for audio understanding
High-Resolution Image Synthesis with Latent Diffusion Models
High-Resolution 3D Assets Generation with Large Scale Diffusion Models
An experimental version of DeepSeek model
From Images to High-Fidelity 3D Assets
Qwen2.5-VL is the multimodal large language model series
gpt-oss-120b and gpt-oss-20b are two open-weight language models
Code for running inference and finetuning with SAM 3 model
GLM-4.5: Open-source LLM for intelligent agents by Z.ai
FAIR Sequence Modeling Toolkit 2
State-of-the-art TTS model under 25MB
Controllable & emotion-expressive zero-shot TTS
DeepSeek Coder: Let the Code Write Itself
High-Fidelity and Controllable Generation of Textured 3D Assets
Z80-μLM is a 2-bit quantized language model
Video understanding codebase from FAIR for reproducing video models
Tooling for the Common Objects In 3D dataset
Reproduction of Poetiq's record-breaking submission to the ARC-AGI-1