Use Microsoft Edge's online text-to-speech service from Python
Machine Learning Containers for NVIDIA Jetson and JetPack-L4T
A lightweight, lightning-fast, in-process vector database
Universal LLM Deployment Engine with ML Compilation
Build and deploy AI Agents on Cloudflare
Build Vision Agents quickly with any model or video provider
A TTS that fits in your CPU (and pocket)
Introduction to Machine Learning Systems
A workflow execution platform built on top of the fantastic Cloudflare
AI edge infrastructure for macOS. Run local or cloud models
26m function call model that runs on incredibly small devices
A Claude Code plugin that iteratively refines product specifications
Fast State-of-the-Art Static Embeddings
Hunyuan Translation Model Version 1.5
Realtime AI Voice Agents with SoTA Multimodal AI models on Arduino ESP
Make videos programmatically with React
Ultra-Efficient LLMs on End Device
Run a 1-billion parameter LLM on a $10 board with 256MB RAM
Fast Multimodal LLM on Mobile Devices
High-Quality Voice Cloning TTS for 600+ Languages
Bailing is a voice dialogue robot similar to GPT-4o
Real-time voice interactive digital human
GLM-4.5: Open-source LLM for intelligent agents by Z.ai
The repository provides code for running inference with SAM 2
Accurate × Fast × Comprehensive