The unofficial python package that returns response of Google Bard
Optimizing inference proxy for LLMs
Libraries for applying sparsification recipes to neural networks
Visual Instruction Tuning: Large Language-and-Vision Assistant
The Triton Inference Server provides an optimized cloud
Open platform for training, serving, and evaluating language models
Easy-to-use Speech Toolkit including Self-Supervised Learning model
Gaussian processes in TensorFlow
A library to communicate with ChatGPT, Claude, Copilot, Gemini
Adversarial Robustness Toolbox (ART) - Python Library for ML security
Python Package for ML-Based Heterogeneous Treatment Effects Estimation
Everything you need to build state-of-the-art foundation models
Lightweight Python library for adding real-time multi-object tracking
Framework for Accelerating LLM Generation with Multiple Decoding Heads
Build your chatbot within minutes on your favorite device
Neural Network Compression Framework for enhanced OpenVINO
Efficient few-shot learning with Sentence Transformers
Openai style api for open large language models
A Unified Library for Parameter-Efficient Learning
Images to inference with no labeling
Scripts for fine-tuning Meta Llama3 with composable FSDP & PEFT method
GPU environment management and cluster orchestration
Run any Llama 2 locally with gradio UI on GPU or CPU from anywhere
Framework that is dedicated to making neural data processing
Open-source tool designed to enhance the efficiency of workloads