Taming Stable Diffusion for Lip Sync
A beautiful, powerful, self-hosted rom manager and player
Expressive Portrait Image Animation for Live Streaming
About 24 Lessons, 12 Weeks, Get Started as a Web Developer
Smart video converter using YOLOv8 and FFmpeg
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
Jupyter magics and kernels for working with remote Spark clusters
Azure command-line interface
Transform your favorite cities into beautiful, minimalist designs
A Python library for extracting structured information
Static Analyzer for Solidity
Doom-based AI research platform for reinforcement learning
Qwen3-omni is a natively end-to-end, omni-modal LLM
A computer vision closed-loop learning platform
GitLab automatic code review tool based on large models
Agent Skill for generating 2D sprite sheets and map, transparent PNG
General-purpose image editing model that delivers high-fidelity
[CVPR 2026 Oral] VGGT Omega
Misc; latest version of waifu2x; 2D video to stereo 3D video
ComfyUI nodes for LivePortrait
Python package for AutoML on Tabular Data with Feature Engineering
Python module that helps you build complex pipelines of batch jobs
GPT Image 2 prompt gallery, image prompt library, agentic skill
Handwritten Text Recognition (HTR) system implemented with TensorFlow
Motion-controllable Video Generation via Latent Trajectory Guidance