Port of Facebook's LLaMA model in C/C++
Phi-3.5 for Mac: Locally-run Vision and Language Models
New set of lightweight state-of-the-art, open foundation models
Official code base for LeWorldModel: Stable End-to-End Joint-Embedding
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
Block Diffusion for Ultra-Fast Speculative Decoding
26m function call model that runs on incredibly small devices
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
Clean and efficient FP8 GEMM kernels with fine-grained scaling
ICLR2024 Spotlight: curation/training code, metadata, distribution
Memory-efficient and performant finetuning of Mistral's models
Blazeface is a lightweight model that detects faces in images
This repository contains the official implementation of research
PyTorch implementation of MAE
A library for Multilingual Unsupervised or Supervised word Embeddings
React app for inspecting, building and debugging with the Realtime API
Lightweight multimodal translation model for 55 languages
T5-Small: Lightweight text-to-text transformer for NLP tasks
Compact English sentence embedding model for semantic search tasks
Lightweight 24B agentic coding model with vision and long context
Jan-v1-edge: efficient 1.7B reasoning model optimized for edge devices
Tiny pre-trained IBM model for multivariate time series forecasting
Custom BLEURT model for evaluating text similarity using PyTorch
OpenAI’s compact 20B open model for fast, agentic, and local use
Efficient MoE reasoning model for coding and math workloads