Open source AI VTuber platform with voice chat and Live2D avatars
Dealing with all unstructured data, such as reverse image search
Automate native Android apps with AI using accessibility APIs
Open source libraries and APIs to build custom preprocessing pipelines
Offical Implementation for "Recursive Multi-Agent Systems"
Repository containing notebooks of my posts on Medium
Using AI models to automatically provide commentary and edit videos
Qwen3-ASR is an open-source series of ASR models
Making RAG Simpler with Small and Open-Sourced Language Models
A New Axis of Sparsity for Large Language Models
"Big Model" trains a visual multimodal VLM with 26M parameters
Flexible Photo Recrafting While Preserving Your Identity
Bailing is a voice dialogue robot similar to GPT-4o
Implementation of "MobileCLIP" CVPR 2024
Tensor search for humans
Context database designed specifically for AI Agents
HunyuanVideo: A Systematic Framework For Large Video Generation Model
Qwen3 is the large language model series developed by Qwen team
A python tool that uses GPT-4, FFmpeg, and OpenCV
Implementation of Video Diffusion Models
SOTA discrete acoustic codec models with 40/75 tokens per second
Build a large language model from 0 only with Python foundation
PPTAgent: Generating and Evaluating Presentations
Deterministic LLMs Outputs for AI Applications and AI Agents
An open phone agent model & framework