Open source personal AI Assistant for Linux, Windows and Mac
Spark-TTS Inference Code
Multimodal embedding and reranking models built on Qwen3-VL
Create prompt-friendly codebase digests from any Git repository URL
Foundational model for human-like, expressive TTS
A Model Context Protocol (MCP) server
TextWorld is a sandbox learning environment for the training
Han Language Processing
The Python code to reproduce illustrations from Machine Learning Book
Controllable & emotion-expressive zero-shot TTS
Controllable and fast Text-to-Speech for over 7000 languages
A python library that makes AMR parsing, generation and visualization
Stable Diffusion web UI
lightweight package to simplify LLM API calls
The most powerful local music generation model
A list of free LLM inference resources accessible via API
A Repo For Document AI
Unified web UI for training and running open models locally
Instant voice cloning by MIT and MyShell. Audio foundation model
A modular graph-based Retrieval-Augmented Generation (RAG) system
User toolkit for analyzing and interfacing with Large Language Models
A Multi-Modal World Model for Reconstructing, Generating, Simulation
Public opinion analysis system
Automatically translates the text of a video based on a subtitle file
Flowly is 100x faster than OpenClaw