Kimi K2 is the large language model series developed by Moonshot AI
Image generation model with single-stream diffusion transformer
Generating Immersive, Explorable, and Interactive 3D Worlds
Audio foundation model excelling in audio understanding
Qwen-Image is a powerful image generation foundation model
INT4/INT5/INT8 and FP16 inference on CPU for RWKV language model
Tooling for the Common Objects In 3D dataset
ICLR2024 Spotlight: curation/training code, metadata, distribution
New family of code large language models (LLMs)
Capable of understanding text, audio, vision, video
Implementation of model parallel autoregressive transformers on GPUs
An advanced bilingual image editing with semantic control