A PyTorch library for implementing flow matching algorithms
text and image to video generation: CogVideoX (2024) and CogVideo
Official inference repo for FLUX.1 models
A Powerful Native Multimodal Model for Image Generation
State-of-the-art Parameter-Efficient Fine-Tuning
Virtual AI anchor that combines state-of-the-art technology
Global weather forecasting model using graph neural networks and JAX
SOTA Open Source TTS
Speech-AI-Forge is a project developed around TTS generation model
A fast TTS architecture with conditional flow matching
An Open Source text-to-speech system built by inverting Whisper
MII makes low-latency and high-throughput inference possible
Replace OpenAI GPT with another LLM in your app
Advanced language and coding AI model
Qwen3-omni is a natively end-to-end, omni-modal LLM
NVIDIA Isaac GR00T N1.5 is the world's first open foundation model
Implementation of Recurrent Interface Network (RIN)
Consistency Distilled Diff VAE
A Universal Customization Method for Single and Multi Conditioning
Flexible Photo Recrafting While Preserving Your Identity
Official code for Style Aligned Image Generation via Shared Attention
Plug-n-play module turning text-to-image models into animation
A simple, high-quality voice conversion tool focused on ease of use
AI discovers 520000 stable inorganic crystal structures for research
Reference PyTorch implementation and models for DINOv3