Transforming Multimodal Content into Captivating Multilingual Audio
Statusline plugin for vim with prompts for several other applications
A general purpose syntax highlighter in pure Go
Free, high-quality text-to-speech API endpoint to replace OpenAI
A Model Context Protocol (MCP) server
Persian NLP Toolkit
Dockerized FastAPI wrapper for Kokoro-82M text-to-speech model
A library for converting HTML into PDFs using ReportLab
A Powerful Native Multimodal Model for Image Generation
Framework for building real-time voice and multimodal AI agents
Chat with it via text and voice
A Family of Open Sourced Music Foundation Models
TextWorld is a sandbox learning environment for the training
Compute distance between sequences
Snippet solution for Vim
Implementation of Phenaki Video, which uses Mask GIT
Easily compute clip embeddings and build a clip retrieval system
Flet enables developers to easily build realtime web and mobile apps
Mozc - a Japanese Input Method Editor designed for multi-platform
Handwritten Text Recognition (HTR) system implemented with TensorFlow
A simple native web interface that uses ChatTTS to synthesize text
Clone a voice in 5 seconds to generate arbitrary speech in real-time
Industrial-level controllable zero-shot text-to-speech system
Tools for manipulating datasets
Full git and GitHub integration with Sublime Text