Faster Whisper transcription with CTranslate2
The best way to use Hermes Agent from the web or from your phone
Offline Text To Speech synthesis for python
Open-source AI agent framework
A Lightweight Face Recognition and Facial Attribute Analysis
High-Quality Voice Cloning TTS for 600+ Languages
Generate audiobooks from e-books, voice cloning & 1107+ languages
SoTA open-source TTS
Qwen3-TTS is an open-source series of TTS models
Comprehensive Gradio WebUI for audio processing
Advanced LLM-powered brute-force tool combining AI intelligence
All-in-one WebUI for AI generative image and video creation
Open-source autonomous AI software engineer
Aider is AI pair programming in your terminal
Tokenizer-Free TTS for Multilingual Speech Generation
Deepfakes Software For All
Ready-to-use OCR with 80+ supported languages
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
LTX-Video Support for ComfyUI
Python inference and LoRA trainer package for the LTX-2 audio–video
Text and image to video generation: CogVideoX and CogVideo
A Python wrapper you can't refuse
Fast stable diffusion on CPU and AI PC
lightweight package to simplify LLM API calls
AI-powered video clipping and highlight generation