A Model Context Protocol (MCP) server
Controllable & emotion-expressive zero-shot TTS
Management of Yandex Station and other smart home devices
Controllable and fast Text-to-Speech for over 7000 languages
Stable Diffusion web UI
Framework for building real-time voice and multimodal AI agents
Python library for scraping and analyzing online news articles easily
A python library that makes AMR parsing, generation and visualization
Fast stable diffusion on CPU and AI PC
Interface for OuteTTS models
A Repo For Document AI
CLI tool to extract (meta)data from PDF and manipulate PDF files
Python Terminal Toolkit - a Spiced Up TUI Library
Public opinion analysis system
Scalable data pre processing and curation toolkit for LLMs
A modular graph-based Retrieval-Augmented Generation (RAG) system
Data Infrastructure providing an approach to multimodal AI workloads
High-Resolution Image Synthesis with Latent Diffusion Models
A Coverage-Guided, Native Python Fuzzer
Industrial-strength Natural Language Processing (NLP)
A sound cloning tool with a web interface, using your voice
Enhances Tesseract OCR output using LLMs (local or API)
Code and models for ICML 2024 paper, NExT-GPT
Extract audio and video content and organize it into a Markdown note
StreamSpeech is a seamless model for offline speech recognition