Native and Compact Structured Latents for 3D Generation
3D reconstruction software
Robust Speech Recognition via Large-Scale Weak Supervision
Effortless data labeling with AI support from Segment Anything
GLM-4.5: Open-source LLM for intelligent agents by Z.ai
Official Python inference and LoRA trainer package
An open source implementation of CLIP
Official inference repo for FLUX.1 models
Fast and memory-efficient exact attention
OCR software, free and offline
Wan2.1: Open and Advanced Large-Scale Video Generative Model
NVR with realtime local object detection for IP cameras
Generate audiobooks from e-books
A Lightweight Face Recognition and Facial Attribute Analysis
OBLITERATE THE CHAINS THAT BIND YOU
Awesome multilingual OCR toolkits based on PaddlePaddle
Improve your Baduk skills by training with KataGo
AI video generator optimized for low VRAM and older GPUs use
Python tool for converting files and office documents to Markdown
YOLOv5 is the world's most loved vision AI
AI Fully Automated Short Video Engine
A community-supported supercharged version of paperless
The agent that grows with you
Faster Whisper transcription with CTranslate2
A GUI tool for extracting hard-coded subtitle (hardsub) from videos