Open Source Document Management System for Digital Archives
Cross-platform AI language practice app
The official Python SDK for the ElevenLabs API
Underthesea - Vietnamese NLP Toolkit
Generate audiobooks from e-books
A nearly-live implementation of OpenAI's Whisper
An easy 1-click way to create beautiful artwork on your PC using AI
Converts text to speech in realtime
Stable Diffusion web UI
The behavior guidance framework for customer-facing LLM agents
Implementation of Video Diffusion Models
A TTS that fits in your CPU (and pocket)
A community-supported supercharged version of paperless
A Systematic Framework for Interactive World Modeling
Python implementation of TextRank algorithms
A Model Context Protocol (MCP) server
Spark-TTS Inference Code
An Open Source text-to-speech system built by inverting Whisper
Towards Human-Level Text-to-Speech through Style Diffusion
Easy-to-use and powerful NLP library with Awesome model zoo
Documentation for Google's Gen AI site - including Gemini API & Gemma
Collection of Gemma 3 variants that are trained for performance
Marrying Grounding DINO with Segment Anything & Stable Diffusion
State-of-the-art TTS model under 25MB
Video translation and dubbing tool powered by LLMs