Python inference and LoRA trainer package for the LTX-2 audio–video
Official inference repo for FLUX.2 models
Towards Human-Level Text-to-Speech through Style Diffusion
Wan2.2: Open and Advanced Large-Scale Video Generative Model
Long-form streaming TTS system for multi-speaker dialogue generation
PyTorch code and models for VJEPA2 self-supervised learning from video
The repository provides code for running inference with SAM 2
Visual Causal Flow
High-Resolution Image Synthesis with Latent Diffusion Models
Multi-modal large language model designed for audio understanding
Flux 2 image generation model pure C inference
An experimental version of DeepSeek model
High-Resolution 3D Assets Generation with Large Scale Diffusion Models
Code for the paper Language Models are Unsupervised Multitask Learners
From Images to High-Fidelity 3D Assets
Qwen2.5-VL is the multimodal large language model series
Get up and running with Llama 2 and other large language models
Inference Llama 2 in one file of pure C
LLM Frontend for Power Users
3D reconstruction software
Subtitle Creation Assistant
Code for running inference and finetuning with SAM 3 model
gpt-oss-120b and gpt-oss-20b are two open-weight language models
OpenAI ChatGPT, GPT-3, GPT-4, DALL·E, Whisper API wrapper for Go
Gives you a whole dev team of AI agents in your code editor