High-quality multi-lingual text-to-speech library by MyShell.ai
Comprehensive Gradio WebUI for audio processing
Speech to Text to Speech, sends text as OSC messages
Transcribe any audio to text, translate and edit subtitles 100% locall
Open source text-to-speech tool, supports extra-long text
OCR software, free and offline
A lightweight text-to-speech model with zero-shot voice cloning
Official inference repo for FLUX.2 models
State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX
Subtitle Creation Assistant
OCR offline image text recognition command line windows program
Robust Speech Recognition via Large-Scale Weak Supervision
Modest natural-language processing
SOTA Open Source TTS
A generative speech model for daily dialogue
Discourse Network Analyzer (DNA)
Use Microsoft Edge's online text-to-speech service from Python
A simple native web interface that uses ChatTTS to synthesize text
Python library and CLI tool to interface with Google Translate
Mixture-of-Experts Vision-Language Models for Advanced Multimodal
MTEB: Massive Text Embedding Benchmark
Underthesea - Vietnamese NLP Toolkit
Generating Immersive, Explorable, and Interactive 3D Worlds
Image generation model with single-stream diffusion transformer
Qwen-Image is a powerful image generation foundation model