The most powerful local music generation model
Autoregressive Model Beats Diffusion
Create videos with Stable Diffusion
OCR model for complex documents with layout-aware structured outputs
The most accurate natural language detection library for Python
Toolkit for conversational AI
Unleashing 10,000+ Word Generation from Long Context LLMs
Stable Diffusion WebUI optimized for AMD GPUs with editing tools
Context-aware desktop AI assistant that understands screen content
Capable of understanding text, audio, vision, video
Document content and metadata extraction microservice
A Web UI for easy subtitle using whisper model
StreamSpeech is a seamless model for offline speech recognition
An open source implementation of CLIP
End-to-end speech processing toolkit
Automated translation solution for visual novels
A very simple framework for state-of-the-art NLP
A high-quality PDF to Markdown tool based on large language model
Long-form streaming TTS system for multi-speaker dialogue generation
A Unified Framework for Text-to-3D and Image-to-3D Generation
lightweight package to simplify LLM API calls
A Model Context Protocol (MCP) server
LLM
Stanford NLP Python library for many human languages
Open source NLP guide with models, methods, and real use cases