Python binding to the Apache Tika™ REST services
The official Python SDK for the ElevenLabs API
Recognition and resolution of numbers, units, date/time, etc.
Provides line-oriented text file editing capabilities
Large Language Model Text Generation Inference
Module for automatic summarization of text documents and HTML pages
High-performance inference server for text embeddings models API layer
Hypernetworks that adapt LLMs for specific benchmark tasks
Document (PDF, Word, PPTX ...) extraction and parse API
Oobabooga - The definitive Web UI for local AI, with powerful features
NLP Cloud serves high performance pre-trained or custom models for NER
A playground to generate images from any text prompt using SD
TTS with kokoro and onnx runtime
Speech-to-text, text-to-speech, and speaker recognition
AI tool that removes hardcoded subtitles and text from videos locally
Python tool for converting files and office documents to Markdown
Open source annotation tool for machine learning practitioners
Comprehensive Gradio WebUI for audio processing
OCRmyPDF adds an OCR text layer to scanned PDF files
Use Microsoft Edge's online text-to-speech service from Python
A robust, efficient, low-latency speech-to-text library
Official inference repo for FLUX.1 models
A GUI tool for extracting hard-coded subtitle (hardsub) from videos
Offline Text To Speech synthesis for python
Focus on prompting and generating