A lightweight text-to-speech model with zero-shot voice cloning
Wan2.2: Open and Advanced Large-Scale Video Generative Model
SOTA Open Source TTS
Handwritten Text Recognition (HTR) system implemented with TensorFlow
Converts text to speech in realtime
Python & command-line tool to gather text on the Web
Ready-to-use OCR with 80+ supported languages
Official inference repo for FLUX.2 models
Wan2.1: Open and Advanced Large-Scale Video Generative Model
Voice Recognition to Text Tool
Comprehensive Markdown plugin built for Django
A python parametric CAD scripting framework based on OCCT
Text and image to video generation: CogVideoX and CogVideo
Tokenizer-Free TTS for Multilingual Speech Generation
Compute distance between sequences
A text-to-speech, speech-to-text and speech-to-speech library
Faster Whisper transcription with CTranslate2
High accuracy RAG for answering questions from scientific documents
A Python utility / library to sort imports
A simple, high-quality voice conversion tool focused on ease of use
TTS for Context-Aware Speech Generation and True-to-Life Voice Cloning
CLIP, Predict the most relevant text snippet given an image
Statusline plugin for vim with prompts for several other applications
Label Studio is a multi-type data labeling and annotation tool
Unified web UI for training and running open models locally