A community-supported supercharged version of paperless
Python inference and LoRA trainer package for the LTX-2 audio–video
Comprehensive Gradio WebUI for audio processing
No fortress, purely open ground. OpenManus is Coming
Ready-to-use OCR with 80+ supported languages
text and image to video generation: CogVideoX (2024) and CogVideo
Generate audiobooks from e-books, voice cloning & 1107+ languages
AI Toolkit for Healthcare Imaging
Use Microsoft Edge's online text-to-speech service from Python
ChatGLM-6B: An Open Bilingual Dialogue Language Model
Open-source autonomous AI software engineer
SoTA open-source TTS
Chemcrow
Powerful tool that lets you create and run intelligent agents
Open-source multi-speaker long-form text-to-speech model
The Open Source Cowork Desktop to Unlock Your Exceptional Productivity
State-of-the-art TTS model under 25MB
A command-line productivity tool powered by AI large language models
Reference PyTorch implementation and models for DINOv3
Open Source Document Management System for Digital Archives
Framework for Telegram Bot API written in Python 3.7 with asyncio
An experimental version of DeepSeek model
Make websites accessible for AI agents
The official gpt4free repository
Qwen2.5-VL is the multimodal large language model series