Python binding to the Apache Tika™ REST services
Provides line-oriented text file editing capabilities
Large Language Model Text Generation Inference
A gradio web UI for running Large Language Models like LLaMA
NLP Cloud serves high performance pre-trained or custom models for NER
Contexts Optical Compression
OCRmyPDF adds an OCR text layer to scanned PDF files
A GUI tool for extracting hard-coded subtitle (hardsub) from videos
High-quality multi-lingual text-to-speech library by MyShell.ai
Comprehensive Gradio WebUI for audio processing
Awesome multilingual OCR toolkits based on PaddlePaddle
Python implementation of TextRank algorithms
Focus on prompting and generating
Speech recognition module for Python
Robust Speech Recognition via Large-Scale Weak Supervision
Ready-to-use OCR with 80+ supported languages
The behavior guidance framework for customer-facing LLM agents
Nexa SDK is a comprehensive toolkit for supporting ONNX and GGML
A robust, efficient, low-latency speech-to-text library
State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX
Wan2.2: Open and Advanced Large-Scale Video Generative Model
State-of-the-art TTS model under 25MB
A generative speech model for daily dialogue
A Powerful Native Multimodal Model for Image Generation
Dataset of GPT-2 outputs for research in detection, biases, and more