A very simple framework for state-of-the-art NLP
Easy-to-use and powerful NLP library with Awesome model zoo
A sound cloning tool with a web interface, using your voice
StreamSpeech is a seamless model for offline speech recognition
Speech-AI-Forge is a project developed around TTS generation model
Official Python inference and LoRA trainer package
Deep Research framework, combining language models with tools
Voice Recognition to Text Tool
Towards Human-Sounding Speech
Instant voice cloning by MIT and MyShell. Audio foundation model
An Open Source implementation of Notebook LM with more flexibility
The most powerful local music generation model
Knowledge Graph Generation from Any Text
Controllable & emotion-expressive zero-shot TTS
Controllable and fast Text-to-Speech for over 7000 languages
Unified web UI for training and running open models locally
Han Language Processing
Create videos with Stable Diffusion
A Multi-Modal World Model for Reconstructing, Generating, Simulation
Foundational model for human-like, expressive TTS
Flowly is 100x faster than OpenClaw
High-Resolution Image Synthesis with Latent Diffusion Models
Framework for building real-time voice and multimodal AI agents
StarVector is a foundation model for SVG generation
Free, high-quality text-to-speech API endpoint to replace OpenAI