State-of-the-art TTS model under 25MB
Lets make video diffusion practical
Image generation model with single-stream diffusion transformer
FlashMLA: Efficient Multi-head Latent Attention Kernels
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
Accurate × Fast × Comprehensive
A 0.1B Omni model trained from scratch
Multimodal model achieving SOTA performance
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
Repo of Qwen2-Audio chat & pretrained large audio language model
Tongyi Deep Research, the Leading Open-source Deep Research Agent
Open-source, high-performance Mixture-of-Experts large language model
RoBERTa Chinese pre-training model: RoBERTa for Chinese
The official pytorch implementation of our paper
Instruction-tuned 1.2B LLM for multilingual text generation by Meta
Compact hybrid reasoning language model for intelligent responses
T5-Small: Lightweight text-to-text transformer for NLP tasks
Llama 3.2–1B: Multilingual, instruction-tuned model for mobile AI
Summarization model fine-tuned on CNN/DailyMail articles
Efficient English embedding model for semantic search and retrieval
NVFP4 DiffusionGemma model for fast multimodal text generation
Unified multimodal Gemma model for local coding and reasoning
High-performance MoE model with MLA, MTP, and multilingual reasoning
Jan-v1-edge: efficient 1.7B reasoning model optimized for edge devices
An advanced bilingual image editing with semantic control