Automatic Speech Recognition with Word-level Timestamps
Python & command-line tool to gather text on the Web
Handwritten Text Recognition (HTR) system implemented with TensorFlow
Converts text to speech in realtime
SOTA Open Source TTS
Wan2.2: Open and Advanced Large-Scale Video Generative Model
Ready-to-use OCR with 80+ supported languages
Comprehensive Markdown plugin built for Django
Voice Recognition to Text Tool
Official inference repo for FLUX.2 models
Compute distance between sequences
Wan2.1: Open and Advanced Large-Scale Video Generative Model
A python parametric CAD scripting framework based on OCCT
Tokenizer-Free TTS for Multilingual Speech Generation
A text-to-speech, speech-to-text and speech-to-speech library
Text and image to video generation: CogVideoX and CogVideo
A Python utility / library to sort imports
High accuracy RAG for answering questions from scientific documents
Statusline plugin for vim with prompts for several other applications
Faster Whisper transcription with CTranslate2
TTS for Context-Aware Speech Generation and True-to-Life Voice Cloning
A simple, high-quality voice conversion tool focused on ease of use
CLIP, Predict the most relevant text snippet given an image
A pure-python PDF library capable of splitting, merging, cropping
Persian NLP Toolkit