Object detection architectures and models pretrained on the COCO data
Structure-from-Motion and Multi-View Stereo
The agent that grows with you
Agentic, Reasoning, and Coding (ARC) foundation models
Speech Recognition Toolkit
Advanced language and coding AI model
Awesome multilingual OCR toolkits based on PaddlePaddle
A free, open source, and extensible speech-to-text application
Official code repo for the O'Reilly Book
Document Management System and Content Management System
GFPGAN aims at developing Practical Algorithms
The most powerful local music generation model
A telegram bot that will give instant stream links for telegram files
1 min voice data can also be used to train a good TTS model
ClawdBot one-click deployment tool
Free, local, open-source Cowork for Gemini CLI, Claude Code, Codex
Vector Database for the next generation of AI applications
A GUI tool for extracting hard-coded subtitle (hardsub) from videos
Uncensored, open-source alternative to Higgsfield AI
The open source coding agent
Open-source vector similarity search for Postgres
Official inference repo for FLUX.2 models
NVR with realtime local object detection for IP cameras
Simple and powerful voice changer for Linux, written with Python & GTK
Wan2.1: Open and Advanced Large-Scale Video Generative Model