Comprehensive Gradio WebUI for audio processing
Real-World Centric Foundation GUI Agents
Python inference and LoRA trainer package for the LTX-2 audio–video
Generate audiobooks from e-books, voice cloning & 1107+ languages
text and image to video generation: CogVideoX (2024) and CogVideo
SoTA open-source TTS
Use Microsoft Edge's online text-to-speech service from Python
Ready-to-use OCR with 80+ supported languages
Open-source autonomous AI software engineer
Powerful tool that lets you create and run intelligent agents
AI Toolkit for Healthcare Imaging
ChatGLM-6B: An Open Bilingual Dialogue Language Model
Chemcrow
Reference PyTorch implementation and models for DINOv3
Framework for Telegram Bot API written in Python 3.7 with asyncio
The Open Source Cowork Desktop to Unlock Your Exceptional Productivity
Make websites accessible for AI agents
EPUB to audiobook converter, optimized for Audiobookshelf
Qwen2.5-VL is the multimodal large language model series
State-of-the-art TTS model under 25MB
The official gpt4free repository
A sound cloning tool with a web interface, using your voice
An experimental version of DeepSeek model
A python tool that uses GPT-4, FFmpeg, and OpenCV
HY-Motion model for 3D character animation generation