An Open Source implementation of Notebook LM with more flexibility
Open-Sora: Democratizing Efficient Video Production for All
Chinese version of Google open source project style guide
An open phone agent model & framework
Build Vision Agents quickly with any model or video provider
Provides line-oriented text file editing capabilities
Comprehensive Markdown plugin built for Django
Open source AI VTuber platform with voice chat and Live2D avatars
Pre-trained Deep Learning models and demos
Module for automatic summarization of text documents and HTML pages
Large Language Model Text Generation Inference
Oobabooga - The definitive Web UI for local AI, with powerful features
Agent Skill for generating 2D sprite sheets and map, transparent PNG
Document (PDF, Word, PPTX ...) extraction and parse API
High-performance inference server for text embeddings models API layer
Hypernetworks that adapt LLMs for specific benchmark tasks
Python bindings for MuPDF's rendering library.
Ready-to-use OCR with 80+ supported languages
First class Sublime Text AI assistant with gpt-5, Opus 4.6, Gemini 3
AI tool that removes hardcoded subtitles and text from videos locally
The right way to check the weather
DICOM to PNG converter for easy viewing and sharing
Comprehensive Gradio WebUI for audio processing
A GUI tool for extracting hard-coded subtitle (hardsub) from videos
OCRmyPDF adds an OCR text layer to scanned PDF files