WebAssembly binding for llama.cpp - Enabling on-browser LLM inference
TT-NN operator library, and TT-Metalium low level kernel programming
A course of learning LLM inference serving on Apple Silicon
Low-code framework for building custom LLMs, neural networks
Uses Qwen3-ASR, local LLM, Whisper, TEN-VAD
ChatGLM2-6B: An Open Bilingual Chat LLM
Datawhale members have compiled a book covering machine learning
ChatGLM3 series: Open Bilingual Chat LLMs | Open Source Bilingual Chat
Fast and efficient unstructured data extraction
Database system for building simpler and faster AI-powered application
Python bindings for the Transformer models implemented in C/C++
Llama 2 Everywhere (L2E)