Image generation model with single-stream diffusion transformer
FlashMLA: Efficient Multi-head Latent Attention Kernels
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
Accurate × Fast × Comprehensive
A 0.1B Omni model trained from scratch
Multimodal model achieving SOTA performance
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
Open-source, high-performance Mixture-of-Experts large language model
RoBERTa Chinese pre-training model: RoBERTa for Chinese
The official pytorch implementation of our paper
Instruction-tuned 1.2B LLM for multilingual text generation by Meta
Compact hybrid reasoning language model for intelligent responses
T5-Small: Lightweight text-to-text transformer for NLP tasks
Llama 3.2–1B: Multilingual, instruction-tuned model for mobile AI
Summarization model fine-tuned on CNN/DailyMail articles
Efficient English embedding model for semantic search and retrieval
NVFP4 DiffusionGemma model for fast multimodal text generation
Unified multimodal Gemma model for local coding and reasoning
High-performance MoE model with MLA, MTP, and multilingual reasoning
Jan-v1-edge: efficient 1.7B reasoning model optimized for edge devices
An advanced bilingual image editing with semantic control
ClinicalBERT model trained on MIMIC notes for clinical NLP tasks
Frontier-scale 675B multimodal base model for custom AI training
Small 3B-base multimodal model ideal for custom AI on edge hardware
Ultra-efficient 3B multimodal instruct model built for edge deployment