High-performance inference server for text embeddings models API layer
A playground to generate images from any text prompt using SD
Hypernetworks that adapt LLMs for specific benchmark tasks
Advanced translator plugin that can be used to translate Unity games
Code for openai.fm, a demo for the OpenAI Speech API
TTS with kokoro and onnx runtime
Offline inference engine for art, real-time voice conversations
A robust, efficient, low-latency speech-to-text library
Mozc - a Japanese Input Method Editor designed for multi-platform
Canvas-based WYSIWYG rich text editor with advanced layout tools
Speech-AI-Forge is a project developed around TTS generation model
Official inference repo for FLUX.1 models
High-Quality Voice Cloning TTS for 600+ Languages
Tokenizer-Free TTS for Multilingual Speech Generation
Multimodal-Driven Architecture for Customized Video Generation
The home of the ICU project source code
Handwritten Text Recognition (HTR) system implemented with TensorFlow
A Family of Open Sourced Music Foundation Models
Lightning-fast, on-device TTS, running natively via ONNX
super expressive prompting model based on ltx2.3
Simple React Component That Makes Titles More Readable
A fast TTS architecture with conditional flow matching
JavaScript OCR and text extraction for images and PDFs
A high-quality rapid TTS voice cloning model
Framework for building realtime multimodal voice AI agents apps