Open-source AI agent framework
AI Fully Automated Short Video Engine
High-Resolution 3D Assets Generation with Large Scale Diffusion Models
Generate short videos with one click using AI LLM
A lightweight audio-to-MIDI converter with pitch bend detection
Open-source multi-speaker long-form text-to-speech model
A Multi-Modal World Model for Reconstructing, Generating, Simulation
Tokenizer-Free TTS for Multilingual Speech Generation
Open source AI VTuber platform with voice chat and Live2D avatars
Universal LLM Deployment Engine with ML Compilation
Official inference repo for FLUX.1 models
Advancing Open-source World Models
A high-throughput and memory-efficient inference and serving engine
Uses Qwen3-ASR, local LLM, Whisper, TEN-VAD
From Images to High-Fidelity 3D Assets
Powerful tool that lets you create and run intelligent agents
Code to accompany "A Method for Animating Children's Drawings"
Qwen3-Coder is the code version of Qwen3
Image polygonal annotation with Python
The highest-scoring AI memory system ever benchmarked
Reference PyTorch implementation and models for DINOv3
An Open Source implementation of Notebook LM with more flexibility
Use Microsoft Edge's online text-to-speech service from Python
Build resilient language agents as graphs
21 Lessons, Get Started Building with Generative AI