A TTS that fits in your CPU (and pocket)
Make videos programmatically with React
Fast LLM speculative inference server for consumer hardware
Numerical differential equation solvers in JAX
Memory Management Kit for Agents
GitLab automatic code review tool based on large models
Dual LSTM Encoder for Dialog Response Generation
OpenAI’s compact 20B open model for fast, agentic, and local use
Concurrent AI Chat, Search, and Read for free, alternative to Sider
Compact 8B multimodal instruct model optimized for edge deployment
Ultra-efficient 3B multimodal instruct model built for edge deployment
Powerful 14B-base multimodal model — flexible base for fine-tuning