Public repository for Agent Skills
Fast and memory-efficient exact attention
OCRmyPDF adds an OCR text layer to scanned PDF files
Comprehensive Gradio WebUI for audio processing
OCR software, free and offline
Python tool for converting files and office documents to Markdown
The agent that grows with you
Powerful AI language model (MoE) optimized for efficiency/performance
Wan2.2: Open and Advanced Large-Scale Video Generative Model
3D reconstruction software
Effortless data labeling with AI support from Segment Anything
Improve your Baduk skills by training with KataGo
Robust Speech Recognition via Large-Scale Weak Supervision
NVR with realtime local object detection for IP cameras
Agentic, Reasoning, and Coding (ARC) foundation models
AI video generator optimized for low VRAM and older GPUs use
Awesome multilingual OCR toolkits based on PaddlePaddle
Open-source, high-performance AI model with advanced reasoning
Official inference repo for FLUX.1 models
Deep Research framework, combining language models with tools
Powerful Android AI agent with tools, automation, and Linux shell
The most powerful local music generation model
Data manipulation and transformation for audio signal processing
Native and Compact Structured Latents for 3D Generation
OBLITERATE THE CHAINS THAT BIND YOU