Python binding to the Apache Tika™ REST services
The official Python SDK for the ElevenLabs API
Provides line-oriented text file editing capabilities
Large Language Model Text Generation Inference
A gradio web UI for running Large Language Models like LLaMA
NLP Cloud serves high performance pre-trained or custom models for NER
Parse files for optimal RAG
High-quality multi-lingual text-to-speech library by MyShell.ai
Contexts Optical Compression
OCRmyPDF adds an OCR text layer to scanned PDF files
File Parser optimised for LLM Ingestion with no loss
A GUI tool for extracting hard-coded subtitle (hardsub) from videos
Code for running inference and finetuning with SAM 3 model
Awesome multilingual OCR toolkits based on PaddlePaddle
Focus on prompting and generating
A robust, efficient, low-latency speech-to-text library
Python library and CLI tool to interface with Google Translate
Comprehensive Gradio WebUI for audio processing
Python implementation of TextRank algorithms
A text-to-speech, speech-to-text and speech-to-speech library
Robust Speech Recognition via Large-Scale Weak Supervision
Offline Text To Speech synthesis for python
Python tool for converting files and office documents to Markdown
Use Microsoft Edge's online text-to-speech service from Python
Nexa SDK is a comprehensive toolkit for supporting ONNX and GGML