Fast multimodal LLM for real-time voice interaction and AI apps
Autoregressive Model Beats Diffusion
Diffusion Transformer with Fine-Grained Chinese Understanding
A Python utility / library to sort imports
Statusline plugin for vim with prompts for several other applications
Real-time voice interactive digital human
Powerful Android AI agent with tools, automation, and Linux shell
Sample code and notebooks for Generative AI on Google Cloud
Unified Multimodal Understanding and Generation Models
LLM abstractions that aren't obstructions
Azure command-line interface
Framework for building AI-powered interactive digital humans and agent
End-to-end speech processing toolkit
A TTS model capable of generating ultra-realistic dialogue
A wiki system with complex functionality for simple integration
AutoGluon: AutoML for Image, Text, and Tabular Data
Qwen3-ASR is an open-source series of ASR models
User toolkit for analyzing and interfacing with Large Language Models
Ark pixel font - Open source Pan-CJK pixel font
Multimodal AI chat app with dynamic conversation routing
AutoAgent: Fully-Automated and Zero-Code LLM Agent Framework
Visual Causal Flow
SOTA discrete acoustic codec models with 40/75 tokens per second
A formatter for Python files
Easy to use Python library for creating 2D arcade games