Fast and memory-efficient exact attention
A simple but complete full-attention transformer
Prevent PyTorch's `CUDA error: out of memory` in just 1 line of code
Hub of ready-to-use datasets for ML models
A library for accelerating Transformer models on NVIDIA GPUs
Making large AI models cheaper, faster and more accessible
Gradient boosting framework based on decision tree algorithms
A Python library for audio
Running large language models on a single GPU
BitNet: Scaling 1-bit Transformers for Large Language Models
Python package built to ease deep learning on graph
A fast image processing library with low memory needs
A modern Anki custom scheduling based on Free Spaced Repetition
Topic Modelling for Humans
Go package for computer vision using OpenCV 4 and beyond
Kodezi Chronos is a debugging-first language model
A self-hostable CDN for databases
The Operator Splitting QP Solver
C++ library for high performance inference on NVIDIA GPUs
Burn is a new comprehensive dynamic Deep Learning Framework
Models for the spaCy Natural Language Processing (NLP) library
Multilingual Automatic Speech Recognition with word-level timestamps
Statistical machine intelligence and learning engine
AIMET is a library that provides advanced quantization and compression
Core ML tools contain supporting tools for Core ML model conversion