Instructions on how to use the Realtime API on Microcontrollers
Open-source Python framework for hybrid quantum-classical ml learning
Local RAG engine for private multimodal knowledge search on devices
UCCL is an efficient communication library for GPUs
Real-time NVIDIA GPU dashboard
A simple, performant and scalable Jax LLM
Implementation for MatMul-free LM
Run PyTorch LLMs locally on servers, desktop and mobile
Large-Scale Agentic RL for High-Performance CUDA Kernel Generation
New set of lightweight state-of-the-art, open foundation models
Official implementation of DreamCraft3D
Probabilistic reasoning and statistical analysis in TensorFlow
Give your OpenClaw AI agent a WhatsApp number
GLM-4-Voice | End-to-End Chinese-English Conversational Model
FlashMLA: Efficient Multi-head Latent Attention Kernels
A SOTA open-source image editing model
Bailing is a voice dialogue robot similar to GPT-4o
Lightning-fast, on-device TTS, running natively via ONNX
A text-to-speech, speech-to-text and speech-to-speech library
Enabling PyTorch on Google TPU
Official Repo For "Sa2VA: Marrying SAM2 with LLaVA
An engine-agnostic deep learning framework in Java
Learn How LLM Transformer Models Work with Interactive Visualization
Easy-to-use deep learning framework with 3 key features
AI assistent plugin for Kate editor