Use Microsoft Edge's online text-to-speech service from Python
Clippy, now with some AI
A Multi-Modal World Model for Reconstructing, Generating, Simulation
Mobile and Web client for Codex and Claude Code, with realtime voice
Open source Claude Artifacts – built with Llama 3.1 405B
Qwen3-TTS is an open-source series of TTS models
Tokenizer-Free TTS for Multilingual Speech Generation
Huashu Design · HTML-native design skill for Claude Code
VGGFace2 Dataset for Face Recognition
A graphical frontend to tesseract-ocr
Open source AI VTuber platform with voice chat and Live2D avatars
Implementation of TurboQuant (ICLR 2026)
Collection of CVPR 2026 Papers and Open Source Projects
An experimental version of DeepSeek model
Universal LLM Deployment Engine with ML Compilation
Full System Prompts, Internal Tools & AI Models
Reference PyTorch implementation and models for DINOv3
RGBD video generation model conditioned on camera input
Open-source orchestration for zero-human companies
Synchronized Translation for Videos
A Systematic Framework for Interactive World Modeling
From Images to High-Fidelity 3D Assets
Qwen3.6 is the large language model series developed by Qwen team
Uses Qwen3-ASR, local LLM, Whisper, TEN-VAD