Taming Stable Diffusion for Lip Sync
Chinese and English multimodal conversational language model
A beautiful, powerful, self-hosted rom manager and player
Expressive Portrait Image Animation for Live Streaming
About 24 Lessons, 12 Weeks, Get Started as a Web Developer
Smart video converter using YOLOv8 and FFmpeg
Jupyter magics and kernels for working with remote Spark clusters
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
Phi-3.5 for Mac: Locally-run Vision and Language Models
GitLab automatic code review tool based on large models
Agent Skill for generating 2D sprite sheets and map, transparent PNG
Transform your favorite cities into beautiful, minimalist designs
A Python library for extracting structured information
Static Analyzer for Solidity
A computer vision closed-loop learning platform
The book "Performance Analysis and Tuning on Modern CPU"
Qwen3-omni is a natively end-to-end, omni-modal LLM
Python package for AutoML on Tabular Data with Feature Engineering
Azure command-line interface
Python module that helps you build complex pipelines of batch jobs
Static site generator for .NET API documentation
Handwritten Text Recognition (HTR) system implemented with TensorFlow
Just a Better Chatbot. Powered by MCP Client & Workflows
General-purpose image editing model that delivers high-fidelity