Provides line-oriented text file editing capabilities
A gradio web UI for running Large Language Models like LLaMA
Large Language Model Text Generation Inference
NLP Cloud serves high performance pre-trained or custom models for NER
Python binding to the Apache Tika™ REST services
Focus on prompting and generating
Robust Speech Recognition via Large-Scale Weak Supervision
Wan2.2: Open and Advanced Large-Scale Video Generative Model
OCRmyPDF adds an OCR text layer to scanned PDF files
3D reconstruction software
A GUI tool for extracting hard-coded subtitle (hardsub) from videos
Stable Diffusion web UI
Wan2.1: Open and Advanced Large-Scale Video Generative Model
Label Studio is a multi-type data labeling and annotation tool
Qwen3 is the large language model series developed by Qwen team
Comprehensive Gradio WebUI for audio processing
Open source personal AI Assistant for Linux, Windows and Mac
Ready-to-use OCR with 80+ supported languages
Open-Sora: Democratizing Efficient Video Production for All
Speech recognition module for Python
Models for the spaCy Natural Language Processing (NLP) library
Awesome multilingual OCR toolkits based on PaddlePaddle
A deep learning toolkit for Text-to-Speech, battle-tested in research
Open Source Document Management System for Digital Archives
InvokeAI is a leading creative engine for Stable Diffusion models