Wan2.2: Open and Advanced Large-Scale Video Generative Model
Fast and memory-efficient exact attention
Image polygonal annotation with Python
Official inference repo for FLUX.1 models
Awesome multilingual OCR toolkits based on PaddlePaddle
AI Fully Automated Short Video Engine
Improve your Baduk skills by training with KataGo
GLM-4.5: Open-source LLM for intelligent agents by Z.ai
Powerful AI language model (MoE) optimized for efficiency/performance
A theoretical reconstruction of the Claude Mythos architecture
A Lightweight Face Recognition and Facial Attribute Analysis
An open source implementation of CLIP
NVR with realtime local object detection for IP cameras
OCR software, free and offline
Generate audiobooks from e-books
Comprehensive Gradio WebUI for audio processing
Advanced language and coding AI model
OBLITERATE THE CHAINS THAT BIND YOU
Qwen3-Coder is the code version of Qwen3
An enhanced tool for CodexApp, striving to make Codex better to use
Effortless data labeling with AI support from Segment Anything
Official inference repo for FLUX.2 models
1 min voice data can also be used to train a good TTS model
Wan2.1: Open and Advanced Large-Scale Video Generative Model
Code for running inference and finetuning with SAM 3 model