Large Language Model Text Generation Inference
High-performance inference server for text embeddings models API layer
Oobabooga - The definitive Web UI for local AI, with powerful features
Document (PDF, Word, PPTX ...) extraction and parse API
Hypernetworks that adapt LLMs for specific benchmark tasks
Provides line-oriented text file editing capabilities
Module for automatic summarization of text documents and HTML pages
AI tool that removes hardcoded subtitles and text from videos locally
A GUI tool for extracting hard-coded subtitle (hardsub) from videos
Comprehensive Gradio WebUI for audio processing
OCRmyPDF adds an OCR text layer to scanned PDF files
Awesome multilingual OCR toolkits based on PaddlePaddle
Generate audiobooks from EPUBs, PDFs and text with captions
Wan2.2: Open and Advanced Large-Scale Video Generative Model
Focus on prompting and generating
A text-to-speech, speech-to-text and speech-to-speech library
MTEB: Massive Text Embedding Benchmark
Wan2.1: Open and Advanced Large-Scale Video Generative Model
State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX
Open source no-code system for text annotation and building of text
SOTA Open Source TTS
Code for running inference and finetuning with SAM 3 model
Video-based AI memory library. Store millions of text chunks in MP4
A generative speech model for daily dialogue
Implementation of Imagen, Google's Text-to-Image Neural Network