Kimi K2 is the large language model series developed by Moonshot AI
Image generation model with single-stream diffusion transformer
Audio foundation model excelling in audio understanding
Generating Immersive, Explorable, and Interactive 3D Worlds
INT4/INT5/INT8 and FP16 inference on CPU for RWKV language model
Qwen-Image is a powerful image generation foundation model
ICLR2024 Spotlight: curation/training code, metadata, distribution
New family of code large language models (LLMs)
Tooling for the Common Objects In 3D dataset
Capable of understanding text, audio, vision, video
Implementation of model parallel autoregressive transformers on GPUs
An advanced bilingual image editing with semantic control