Open-source multi-speaker long-form text-to-speech model
Open-source, high-performance AI model with advanced reasoning
GPT4V-level open-source multi-modal model based on Llama3-8B
Official inference repo for FLUX.2 models
Inference framework for 1-bit LLMs
GLM-4.5: Open-source LLM for intelligent agents by Z.ai
Stable Virtual Camera: Generative View Synthesis with Diffusion Models
Wan2.2: Open and Advanced Large-Scale Video Generative Model
Agentic, Reasoning, and Coding (ARC) foundation models
Build portable, production-ready MLOps pipelines
HunyuanVideo: A Systematic Framework For Large Video Generation Model
Qwen3 is the large language model series developed by Qwen team
A simple, high-quality voice conversion tool focused on ease of use
A modular graph-based Retrieval-Augmented Generation (RAG) system
Image inpainting tool powered by SOTA AI Model
A GUI tool for extracting hard-coded subtitle (hardsub) from videos
Large-language-model & vision-language-model based on Linear Attention
A collaboration friendly studio for NeRFs
Qwen2.5-VL is the multimodal large language model series
Stable Diffusion built-in to Blender
Anthropic's original performance take-home, now open for you to try
The official repo of Qwen chat & pretrained large language model
FAIR Sequence Modeling Toolkit 2
Speech-AI-Forge is a project developed around TTS generation model
Best practices on recommendation systems