Research project. A Memory solution for users, teams, and applications
Port of OpenAI's Whisper model in C/C++
Official inference framework for 1-bit LLMs
Run a 1-billion parameter LLM on a $10 board with 256MB RAM
Our first fully AI generated deep learning system
Run OpenClaw on a $5 chip
AI video generator optimized for low VRAM and older GPUs use
FlashMLA: Efficient Multi-head Latent Attention Kernels
Framework for building, orchestrating, and deploying AI agents
Open-source large language model family from Tencent Hunyuan
Mooncake is the serving platform for Kimi
TT-NN operator library, and TT-Metalium low level kernel programming
Running a big model on a small laptop
Alibaba's high-performance LLM inference engine for diverse apps
Machine learning algorithms for advanced analytics
Intellect Modeling Kit: assisting research, diagnostics, consulting
Transformer related optimization, including BERT, GPT
A High Performance Library for Sequence Processing and Generation
TinyML AI inference library
10x faster matrix and vector operations
Fast and robust map analyser for Brood War.
The IRC's Talking Robot
CRFSharp is a .NET(C#) implementation of Conditional Random Field