Port of Facebook's LLaMA model in C/C++
Agentic, Reasoning, and Coding (ARC) foundation models
Powerful AI language model (MoE) optimized for efficiency/performance
Phi-3.5 for Mac: Locally-run Vision and Language Models
ChatGLM-6B: An Open Bilingual Dialogue Language Model
Image generation model with single-stream diffusion transformer
Open-source, high-performance AI model with advanced reasoning
Contexts Optical Compression
RGBD video generation model conditioned on camera input
Open-weight, large-scale hybrid-attention reasoning model
Official inference repo for FLUX.2 models
Tencent Hunyuan Multimodal diffusion transformer (MM-DiT) model
Multimodal Diffusion with Representation Alignment
A Customizable Image-to-Video Model based on HunyuanVideo
Code for running inference and finetuning with SAM 3 model
An experimental version of DeepSeek model
GLM-4.5: Open-source LLM for intelligent agents by Z.ai
A Powerful Native Multimodal Model for Image Generation
From Images to High-Fidelity 3D Assets
Open-source large language model family from Tencent Hunyuan
Code for running inference with the SAM 3D Body Model 3DB
Pokee Deep Research Model Open Source Repo
Towards self-verifiable mathematical reasoning
Industrial-level controllable zero-shot text-to-speech system
Reference PyTorch implementation and models for DINOv3