Open deep learning compiler stack for cpu, gpu
Lightweight multimodal translation model for 55 languages
Compact 8B multimodal instruct model optimized for edge deployment
Small 3B-base multimodal model ideal for custom AI on edge hardware
Efficient 14B multimodal instruct model with edge deployment and FP8
OpenAI’s open-weight 120B model optimized for reasoning and tooling
Quantized 675B multimodal instruct model optimized for NVFP4