The official Python SDK for the ElevenLabs API
Python binding to the Apache Tika™ REST services
Large Language Model Text Generation Inference
Recognition and resolution of numbers, units, date/time, etc.
Provides line-oriented text file editing capabilities
Document (PDF, Word, PPTX ...) extraction and parse API
High-performance inference server for text embeddings models API layer
Oobabooga - The definitive Web UI for local AI, with powerful features
Module for automatic summarization of text documents and HTML pages
Hypernetworks that adapt LLMs for specific benchmark tasks
NLP Cloud serves high performance pre-trained or custom models for NER
A playground to generate images from any text prompt using SD
Comprehensive Gradio WebUI for audio processing
Python tool for converting files and office documents to Markdown
Speech-to-text, text-to-speech, and speaker recognition
TTS with kokoro and onnx runtime
OCRmyPDF adds an OCR text layer to scanned PDF files
A robust, efficient, low-latency speech-to-text library
Awesome multilingual OCR toolkits based on PaddlePaddle
Offline Text To Speech synthesis for python
Python library and CLI tool to interface with Google Translate
Use Microsoft Edge's online text-to-speech service from Python
A generative speech model for daily dialogue
A text-to-speech, speech-to-text and speech-to-speech library
Official inference repo for FLUX.1 models