TTS for Context-Aware Speech Generation and True-to-Life Voice Cloning
ComfyUI integration for Microsoft's VibeVoice text-to-speech model
A framework to enable multimodal models to operate a computer
Python framework for AI workflows and pipelines with chain of thought
Automated translation solution for visual novels
Low-code framework for building custom LLMs, neural networks
Deep learning optimization library: makes distributed training easy
AI-Powered tool for automated pull request analysis
A state-of-the-art open visual language model
ChatGLM3 series: Open Bilingual Chat LLMs | Open Source Bilingual Chat
Collection of Gemma 3 variants that are trained for performance
A unified, comprehensive and efficient recommendation library
A simple screen parsing tool towards pure vision based GUI agent
Tooling for the Common Objects In 3D dataset
Real-time voice interactive digital human
Hummingbird compiles trained ML models into tensor computation
Mentat - The AI Coding Assistant
Ultimate meta-skill for generating best-in-class Claude Code skills
Open Multilingual Multimodal Chat LMs
Deep learning library
Large-language-model & vision-language-model based on Linear Attention
Capable of understanding text, audio, vision, video
An easy-to-use LLMs quantization package with user-friendly apis
Visual Instruction Tuning: Large Language-and-Vision Assistant
Autonomous GPT-4 agent platform