Python Terminal Toolkit - a Spiced Up TUI Library
Lightweight Markdown-only skills for autonomous ML research
RAG-Anything: All-in-One RAG Framework
An Open Source text-to-speech system built by inverting Whisper
Synchronized Translation for Videos
Unifying 3D Mesh Generation with Language Models
A high-quality PDF to Markdown tool based on large language model
Controllable & emotion-expressive zero-shot TTS
AI-powered code assistant for Vim. OpenAI and ChatGPT plugin for Vim
A Python library for extracting structured information
Knowledge Graph Generation from Any Text
Audiocraft is a library for audio processing and generation
Data Infrastructure providing an approach to multimodal AI workloads
Build multimodal language agents for fast prototype and production
Fast multimodal LLM for real-time voice interaction and AI apps
A Web UI for easy subtitle using whisper model
Lightning-fast, on-device TTS, running natively via ONNX
Collection of Gemma 3 variants that are trained for performance
LaTeX source and supporting code for Think Python, 2nd edition
Framework for building AI-powered interactive digital humans and agent
Towards Human-Sounding Speech
Zero-copy PDF text extraction library written in Zig
PersonaPlex code
StreamSpeech is a seamless model for offline speech recognition
go1pylib is a Python library designed to control the Go1 robot