Parse files for optimal RAG
Audiocraft is a library for audio processing and generation
Open source healthcare AI
Long-form streaming TTS system for multi-speaker dialogue generation
RAG-Anything: All-in-One RAG Framework
Marrying Grounding DINO with Segment Anything & Stable Diffusion
FastAPI framework, high performance, easy to learn, fast to code
Interface for OuteTTS models
MARS5 speech model (TTS) from CAMB.AI
A library to help you make the most out of your Pixoo 64
A Model Context Protocol (MCP) server
TextWorld is a sandbox learning environment for the training
The best free open source website change detection and restock service
LLM
Automatically translates the text of a video based on a subtitle file
Bidirectional token-classification model for identifiable info
HY-Motion model for 3D character animation generation
Search all of YouTube from the command line
Scalable data pre processing and curation toolkit for LLMs
Multi-lingual large voice generation model, providing inference
High-Resolution 3D Assets Generation with Large Scale Diffusion Models
Uses Qwen3-ASR, local LLM, Whisper, TEN-VAD
Open-Sora: Democratizing Efficient Video Production for All
lightweight package to simplify LLM API calls
A modular graph-based Retrieval-Augmented Generation (RAG) system