The most powerful local music generation model
Controllable & emotion-expressive zero-shot TTS
Management of Yandex Station and other smart home devices
Controllable and fast Text-to-Speech for over 7000 languages
Python library for scraping and analyzing online news articles easily
A python library that makes AMR parsing, generation and visualization
Framework for building real-time voice and multimodal AI agents
CLI tool to extract (meta)data from PDF and manipulate PDF files
Python Terminal Toolkit - a Spiced Up TUI Library
A Repo For Document AI
A Coverage-Guided, Native Python Fuzzer
Scalable data pre processing and curation toolkit for LLMs
A modular graph-based Retrieval-Augmented Generation (RAG) system
Public opinion analysis system
Interface for OuteTTS models
Stable Diffusion web UI
Data Infrastructure providing an approach to multimodal AI workloads
Fast stable diffusion on CPU and AI PC
Enhances Tesseract OCR output using LLMs (local or API)
Code and models for ICML 2024 paper, NExT-GPT
Extract audio and video content and organize it into a Markdown note
StreamSpeech is a seamless model for offline speech recognition
High-Resolution Image Synthesis with Latent Diffusion Models
Personal mini-web in text
A sound cloning tool with a web interface, using your voice