The most powerful local music generation model
Speech recognition module for Python
ChatGLM-6B: An Open Bilingual Dialogue Language Model
3D reconstruction software
Taming Stable Diffusion for Lip Sync
AI video generator optimized for low VRAM and older GPUs use
Toolkit to help you get started with Spec-Driven Development
Create Customized Software using Natural Language Idea
AI tool for real-time monitoring and analysis of Goofish listings
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
Industrial-level controllable zero-shot text-to-speech system
Multi-agent autonomous startup system for Claude Code
HivisionIDPhotos: a lightweight and efficient AI ID photos tools
The Multi-Agent Framework
GLM-4 series: Open Multilingual Multimodal Chat LMs
High-Resolution Image Synthesis with Latent Diffusion Models
1B text generation model based on the HRM architecture
A unified library of SOTA model optimization techniques
The first real AI developer
DeepCode: Open Agentic Coding
An Efficient Agentic Model for Computer Use
A Unified Library for Parameter-Efficient Learning
Qwen2.5-VL is the multimodal large language model series
Backlog-row-first content production system for teams
Bidirectional token-classification model for identifiable info