Qwen3-TTS is an open-source series of TTS models
Contexts Optical Compression
My personal Claude Code configuration
High-Resolution Image Synthesis with Latent Diffusion Models
Tool for exploring and debugging transformer model behaviors
Repo for SeedVR2 & SeedVR
INT4/INT5/INT8 and FP16 inference on CPU for RWKV language model
Fast and Universal 3D reconstruction model for versatile tasks
Official implementation of Watermark Anything with Localized Messages
GLM-4.5V and GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning
Collection of Gemma 3 variants that are trained for performance
High-resolution models for human tasks
Inference script for Oasis 500M
This repository contains the official implementation of FastVLM
Foundational Models for State-of-the-Art Speech and Text Translation
The ChatGPT Retrieval Plugin lets you easily find personal documents
Clean and efficient FP8 GEMM kernels with fine-grained scaling
Safety reasoning models built-upon gpt-oss
A fast, local neural text to speech system
Open-source, high-performance Mixture-of-Experts large language model
Di♪♪Rhythm: Blazingly Fast & Simple End-to-End Song Generation
ChatGPT integration with Unity Editor
Code release for "Masked-attention Mask Transformer
VaultGemma: 1B DP-trained Gemma variant for private NLP tasks
Text-to-image model optimized for artistic quality and safe generation