Python binding to the Apache Tika™ REST services
Recognition and resolution of numbers, units, date/time, etc.
Provides line-oriented text file editing capabilities
Large Language Model Text Generation Inference
A gradio web UI for running Large Language Models like LLaMA
NLP Cloud serves high performance pre-trained or custom models for NER
Contexts Optical Compression
OCRmyPDF adds an OCR text layer to scanned PDF files
High-quality multi-lingual text-to-speech library by MyShell.ai
Comprehensive Gradio WebUI for audio processing
Awesome multilingual OCR toolkits based on PaddlePaddle
Speech-to-text, text-to-speech, and speaker recognition
Focus on prompting and generating
Speech recognition module for Python
Python implementation of TextRank algorithms
Robust Speech Recognition via Large-Scale Weak Supervision
The behavior guidance framework for customer-facing LLM agents
Nexa SDK is a comprehensive toolkit for supporting ONNX and GGML
State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX
Wan2.2: Open and Advanced Large-Scale Video Generative Model
A robust, efficient, low-latency speech-to-text library
State-of-the-art TTS model under 25MB
A Powerful Native Multimodal Model for Image Generation
A generative speech model for daily dialogue
Dataset of GPT-2 outputs for research in detection, biases, and more