Parallax is a distributed model serving framework
Persistent context and multi-instance coordination
Claude Code skill for generating production-quality SVG+PNG technical
Schema-Guided Reasoning (SGR) has agentic system design
Simplest working implementation of Stylegan2
Generate high-definition story short videos with one click using AI
A text-to-speech, speech-to-text and speech-to-speech library
Your Personal Research Multi-Tool
Physical Symbolic Optimization
Foundational model for human-like, expressive TTS
GLM-4-Voice | End-to-End Chinese-English Conversational Model
Ultimate meta-skill for generating best-in-class Claude Code skills
LLM based autonomous agent that does online comprehensive research
One-click deployment (including offline integration package)
InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System
A Production-ready Reinforcement Learning AI Agent Library
Machine Learning Pipelines for Kubeflow
Synthetic data generators for tabular and time-series data
OCR expert VLM powered by Hunyuan's native multimodal architecture
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
Large Multimodal Models for Video Understanding and Editing
Di♪♪Rhythm: Blazingly Fast & Simple End-to-End Song Generation
Open Multilingual Multimodal Chat LMs
Guiding Instruction-based Image Editing via Multimodal Large Language
Beyond the Imitation Game collaborative benchmark for measuring