Clean and efficient FP8 GEMM kernels with fine-grained scaling
Qwen3.5 is the large language model series developed by Qwen team
Official repository for LTX-Video
Recovering the Visual Space from Any Views
Qwen3-Coder is the code version of Qwen3
Block Diffusion for Ultra-Fast Speculative Decoding
Audio foundation model excelling in audio understanding
Tiny vision language model
Blazeface is a lightweight model that detects faces in images
Runtime extension of Proximus enabling Deployment on AMD Ryzen™ AI
Repo for external large-scale work
Text-to-3D & Image-to-3D & Mesh Exportation with NeRF + Diffusion
PyTorch implementation of VALL-E (Zero-Shot Text-To-Speech)
A minimal PyTorch re-implementation of the OpenAI GPT
An implementation of model parallel GPT-2 and GPT-3-style models
Dual LSTM Encoder for Dialog Response Generation
React app for inspecting, building and debugging with the Realtime API
Compact 8B multimodal instruct model optimized for edge deployment