Training data (data labeling, annotation, workflow) for all data types
MOSS-TTS-Nano is an open-source multilingual tiny speech generation
Benchmarking synthetic data generation methods
Adding guardrails to large language models
A fast TTS architecture with conditional flow matching
Qwen3-TTS is an open-source series of TTS models
Uncover insights, surface problems, monitor, and fine tune your LLM
AI coding workstation: Claude Code + web UI + 5 AI CLIs + headless
ComfyUI integration for Microsoft's VibeVoice text-to-speech model
Security Scanner for Agent Skills
100–200× Acceleration for Video Diffusion Models
My personal Claude Code and OpenAI Codex setup
Automatic Speech Recognition with Word-level Timestamps
Multi-lingual large voice generation model, providing inference
Synthetic data curation for post-training and data extraction
Hunyuan Translation Model Version 1.5
Multimodal Diffusion with Representation Alignment
Official inference repo for FLUX.2 models
Ultimate meta-skill for generating best-in-class Claude Code skills
Miso TTS is an 8 billion, highly emotive text-to-speech model
SDG is a specialized framework
The most powerful local music generation model
HY-Motion model for 3D character animation generation
Faster Whisper transcription with CTranslate2
AnyTool: Universal Tool-Use Layer for AI Agents