TT-NN operator library, and TT-Metalium low level kernel programming
Easily compute clip embeddings and build a clip retrieval system
Making RAG Simpler with Small and Open-Sourced Language Models
Towards self-verifiable mathematical reasoning
"Big Model" trains a visual multimodal VLM with 26M parameters
Netflix’s Workflow Orchestrator
A theoretical reconstruction of the Claude Mythos architecture
FlashMLA: Efficient Multi-head Latent Attention Kernels
The open source AI research agent
The open-source managed agents platform
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
From-scratch PyTorch implementation of Google's TurboQuant
Open-weight, large-scale hybrid-attention reasoning model
DeepSeek Coder: Let the Code Write Itself
GLM-4.5: Open-source LLM for intelligent agents by Z.ai
Autonomous AI agent that you can configure and build
Drop-in replacement for standard residual connections in Transformers
Unsupervised Learning for Image Registration
Multi-user UI for managing and running Stable Diffusion workflows tool
PyTorch3D is FAIR's library of reusable components for deep learning
Official inference framework for 1-bit LLMs
DeepSeek 4 Flash local inference engine for Metal
Training neural networks on Apple Neural Engine via APIs
Confidential Compute Open Network, Decentralized AI Inference on TON
OpenTinker is an RL-as-a-Service infrastructure for foundation models