FlashMLA: Efficient Multi-head Latent Attention Kernels
Workflow engine for Kubernetes
The open source AI research agent
Burn is a new comprehensive dynamic Deep Learning Framework
Edit Banana: A framework for converting statistical figures
Internet-scale Neural Networks
Retrofit.dart is an dio client generator using source_gen
From-scratch PyTorch implementation of Google's TurboQuant
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
Safe and portable GPU abstraction in Rust, implementing WebGPU API
Easy-to-use and powerful homomorphic encryption library
Open-weight, large-scale hybrid-attention reasoning model
Low-level Python library used to interact with a Substra network
Python library for audio and music analysis
A modern model graph visualizer and debugger
Julia support for the oneAPI programming toolkit.
The open-source managed agents platform
DeepSeek Coder: Let the Code Write Itself
Algorithms for detecting associations, dynamical influences
A self-hostable CDN for databases
Bringing large-language models and chat to web browsers
C-based Application Programming Interface (API)
Metering and Billing for AI, API and DevOps
GLM-4.5: Open-source LLM for intelligent agents by Z.ai
Open-source LLM load balancer and serving platform for hosting LLMs