Deep Research framework, combining language models with tools
Create Customized Software using Natural Language Idea
Powerful AI language model (MoE) optimized for efficiency/performance
SOTA Open Source TTS
Plan-first AI workflow plugin for Claude Code, OpenAI Codex
AI video generator optimized for low VRAM and older GPUs use
A modular graph-based Retrieval-Augmented Generation (RAG) system
Speech-AI-Forge is a project developed around TTS generation model
21 Lessons, Get Started Building with Generative AI
Automate the process of making money online
A set of ready to use Agent Skills for research, science, engineering
PPTAgent: Generating and Evaluating Presentations
Instruction-tuning LLM with Chinese Medical Knowledge
Wan2.2: Open and Advanced Large-Scale Video Generative Model
A collaboration friendly studio for NeRFs
Pre & Post-training & Dataset & Evaluation & Depoly & RAG
A course of learning LLM inference serving on Apple Silicon
Instant voice cloning by MIT and MyShell. Audio foundation model
The repository provides code for running inference with SAM 2
Open-source infrastructure for Computer-Use Agents. Sandboxes
Official Python inference and LoRA trainer package
Robust Speech Recognition via Large-Scale Weak Supervision
3D reconstruction software
Open-source, high-performance AI model with advanced reasoning
Implementation of TurboQuant (ICLR 2026)