A series of math-specific large language models of our Qwen2 series
Capable of understanding text, audio, vision, video
GLM-4.5: Open-source LLM for intelligent agents by Z.ai
Qwen2.5-VL is the multimodal large language model series
The most powerful local music generation model
A Systematic Framework for Interactive World Modeling
InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System
Wan2.2: Open and Advanced Large-Scale Video Generative Model
1 min voice data can also be used to train a good TTS model
Integrate Magisk root and Google Apps into WSA
AI agents autonomously run and improve ML experiments overnight
State-of-the-art TTS model under 25MB
Qwen2.5-Coder is the code version of Qwen2.5, the large language model
Clone a voice in 5 seconds to generate arbitrary speech in real-time
A Django plugin for creating AJAX driven forms in Bootstrap modal
macOS Security Compliance Project
Deep learning library
A central control plane for AWS permissions and access
Deep learning optimization library: makes distributed training easy
Designed for text embedding and ranking tasks
Node.js native addon build tool
Agent S: an open agentic framework that uses computers like a human
Blender pipeline for photorealistic training image generation
High-Fidelity and Controllable Generation of Textured 3D Assets
AI coding workstation: Claude Code + web UI + 5 AI CLIs + headless