A very simple framework for state-of-the-art NLP
AI-powered tool for generating, optimizing, and translating subtitles
Easily compute clip embeddings and build a clip retrieval system
OCR software, free and offline
Python binding to the Apache Tika™ REST services
Automatic Speech Recognition with Word-level Timestamps
Advanced NLP with spaCy: A free online course
Audiocraft is a library for audio processing and generation
An opinionated CLI to transcribe Audio files w/ Whisper on-device
End-to-end speech processing toolkit
Public opinion analysis system
Faster Whisper transcription with CTranslate2
Use Microsoft Edge's online text-to-speech service from Python
Pretrained model hub for Keras 3
Stable Diffusion web UI
Open source no-code system for text annotation and building of text
Deep Research framework, combining language models with tools
Framework for building realtime multimodal voice AI agents apps
Fast and customizable framework for automatic ML model creation
Open speech-to-speech models and pipelines by Hugging Face toolkit AI
Chinese XLNet pre-trained model
Bidirectional token-classification model for identifiable info
Voice Recognition to Text Tool
Shared repository for open-sourced projects from the Google AI Lang
Dealing with all unstructured data, such as reverse image search