WebAssembly binding for llama.cpp - Enabling on-browser LLM inference
Low-latency machine code generation
Port of OpenAI's Whisper model in C/C++
TT-NN operator library, and TT-Metalium low level kernel programming
A course of learning LLM inference serving on Apple Silicon
Python inference and LoRA trainer package for the LTX-2 audio–video
A curated collection of skills for AI coding agents
ByteHook is an Android PLT hook library
Machine learning on FPGAs using HLS
PyTorch code and models for V-JEPA self-supervised learning from video
Tensor library for machine learning
The python library for real-time communication
SAPIEN Manipulation Skill Framework
Vision AI browser agent for automation, testing, and extraction
Uses Qwen3-ASR, local LLM, Whisper, TEN-VAD
Interface for OuteTTS models
Hundreds of fully solved job interview questions
Datawhale members have compiled a book covering machine learning
A simple, open format for guiding coding agents
DeepSeek 4 Flash local inference engine for Metal
Anthropic's original performance take-home, now open for you to try
Instructions on how to use the Realtime API on Microcontrollers
Fast and efficient unstructured data extraction
A high-performance distributed file system
Generate high-definition story short videos with one click using AI