ONNX Runtime: cross-platform, high performance ML inferencing
ChatGLM3 series: Open Bilingual Chat LLMs | Open Source Bilingual Chat
LiteRT, successor to TensorFlow Lite
NVR with realtime local object detection for IP cameras
A simple, performant and scalable Jax LLM
Implementation of Denoising Diffusion Probabilistic Model in Pytorch
TT-NN operator library, and TT-Metalium low level kernel programming
Ready-to-run cloud templates for RAG
A ranked list of awesome machine learning Python libraries
Google's open source distributed agent runtime
Numerical differential equation solvers in JAX
Making large AI models cheaper, faster and more accessible
OpenMMLab Model Deployment Framework
Reference implementation of the Transformer architecture optimized
Speculative-decoding accelerator for the 675B Mistral Large 3