Fast and memory-efficient exact attention
A simple but complete full-attention transformer
Prevent PyTorch's `CUDA error: out of memory` in just 1 line of code
Infrastructure to enable deployment of ML models
Hub of ready-to-use datasets for ML models
A library for accelerating Transformer models on NVIDIA GPUs
Gradient boosting framework based on decision tree algorithms
A Python library for audio
Making large AI models cheaper, faster and more accessible
BitNet: Scaling 1-bit Transformers for Large Language Models
Running large language models on a single GPU
Python package built to ease deep learning on graph
A modern Anki custom scheduling based on Free Spaced Repetition
Topic Modelling for Humans
Go package for computer vision using OpenCV 4 and beyond
A self-hostable CDN for databases
Kodezi Chronos is a debugging-first language model
Models for the spaCy Natural Language Processing (NLP) library
Burn is a new comprehensive dynamic Deep Learning Framework
Multilingual Automatic Speech Recognition with word-level timestamps
The Operator Splitting QP Solver
AIMET is a library that provides advanced quantization and compression
Core ML tools contain supporting tools for Core ML model conversion
Create HTML profiling reports from pandas DataFrame objects
Statistical machine intelligence and learning engine