Official inference repo for FLUX.2 models
Reference implementations of MLPerf™ training benchmarks
A Family of Open Sourced Music Foundation Models
A Customizable Image-to-Video Model based on HunyuanVideo
Towards Human-Level Text-to-Speech through Style Diffusion
MARS5 speech model (TTS) from CAMB.AI
A high-quality rapid TTS voice cloning model
A sound cloning tool with a web interface, using your voice
Interface for OuteTTS models
A lightweight text-to-speech model with zero-shot voice cloning
Clone a voice in 5 seconds to generate arbitrary speech in real-time
Implementation of Vision Transformer, a simple way to achieve SOTA
Instant voice cloning by MIT and MyShell. Audio foundation model
Z80-μLM is a 2-bit quantized language model
TTS for Context-Aware Speech Generation and True-to-Life Voice Cloning
gpt-oss-120b and gpt-oss-20b are two open-weight language models
Specification and documentation for Agent Skills
Personalize Any Characters with a Scalable Diffusion Transformer
Taming Stable Diffusion for Lip Sync
Framework for building neural networks
The official Meta Llama 3 GitHub site
Utilities intended for use with Llama models
Set of tools to assess and improve LLM security
An implementation of a deep learning recommendation model (DLRM)
Official DeiT repository