Low-latency REST API for serving text-embeddings
A Unified Library for Parameter-Efficient Learning
PyTorch library of curated Transformer models and their components
C++ library for high performance inference on NVIDIA GPUs
Serving system for machine learning models
A general-purpose probabilistic programming system
Database system for building simpler and faster AI-powered application
A computer vision framework to create and deploy apps in minutes
Implementation of "Tree of Thoughts
The deep learning toolkit for speech-to-text