INT4/INT5/INT8 and FP16 inference on CPU for RWKV language model
DoWhy is a Python library for causal inference
Large Language Model Text Generation Inference
Data manipulation and transformation for audio signal processing
Uplift modeling and causal inference with machine learning algorithms
Efficient few-shot learning with Sentence Transformers
Python Package for ML-Based Heterogeneous Treatment Effects Estimation
A library to communicate with ChatGPT, Claude, Copilot, Gemini
A high-performance ML model serving framework, offers dynamic batching
State-of-the-art diffusion models for image and audio generation
Open-source tool designed to enhance the efficiency of workloads
A Unified Library for Parameter-Efficient Learning
Uncover insights, surface problems, monitor, and fine tune your LLM
Trainable models and NN optimization tools
Probabilistic reasoning and statistical analysis in TensorFlow
OpenMMLab Model Deployment Framework
Multilingual Automatic Speech Recognition with word-level timestamps
Easy-to-use deep learning framework with 3 key features
Optimizing inference proxy for LLMs
LMDeploy is a toolkit for compressing, deploying, and serving LLMs
A lightweight vision library for performing large object detection
Scripts for fine-tuning Meta Llama3 with composable FSDP & PEFT method
PyTorch extensions for fast R&D prototyping and Kaggle farming
Unified Model Serving Framework
Framework that is dedicated to making neural data processing