A gradio web UI for running Large Language Models like LLaMA
Provides line-oriented text file editing capabilities
Large Language Model Text Generation Inference
Python binding to the Apache Tika™ REST services
NLP Cloud serves high performance pre-trained or custom models for NER
Focus on prompting and generating
OCRmyPDF adds an OCR text layer to scanned PDF files
Robust Speech Recognition via Large-Scale Weak Supervision
A GUI tool for extracting hard-coded subtitle (hardsub) from videos
Stable Diffusion web UI
A deep learning toolkit for Text-to-Speech, battle-tested in research
Open-Sora: Democratizing Efficient Video Production for All
Awesome multilingual OCR toolkits based on PaddlePaddle
State-of-the-art TTS model under 25MB
Ready-to-use OCR with 80+ supported languages
Open Source Document Management System for Digital Archives
Central interface to connect your LLM's with external data
Comprehensive Gradio WebUI for audio processing
Speech recognition module for Python
Nexa SDK is a comprehensive toolkit for supporting ONNX and GGML
State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX
Web interface for generating images using Stable Diffusion models
InvokeAI is a leading creative engine for Stable Diffusion models
Label Studio is a multi-type data labeling and annotation tool
LLM abstractions that aren't obstructions