Python binding to the Apache Tika™ REST services
Provides line-oriented text file editing capabilities
Large Language Model Text Generation Inference
A gradio web UI for running Large Language Models like LLaMA
NLP Cloud serves high performance pre-trained or custom models for NER
Contexts Optical Compression
OCRmyPDF adds an OCR text layer to scanned PDF files
A GUI tool for extracting hard-coded subtitle (hardsub) from videos
High-quality multi-lingual text-to-speech library by MyShell.ai
Comprehensive Gradio WebUI for audio processing
Awesome multilingual OCR toolkits based on PaddlePaddle
Focus on prompting and generating
Python implementation of TextRank algorithms
Speech recognition module for Python
Robust Speech Recognition via Large-Scale Weak Supervision
The behavior guidance framework for customer-facing LLM agents
Ready-to-use OCR with 80+ supported languages
A robust, efficient, low-latency speech-to-text library
State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX
State-of-the-art TTS model under 25MB
Wan2.2: Open and Advanced Large-Scale Video Generative Model
Code for running inference and finetuning with SAM 3 model
Nexa SDK is a comprehensive toolkit for supporting ONNX and GGML
Wan2.1: Open and Advanced Large-Scale Video Generative Model
A generative speech model for daily dialogue