A high-performance ML model serving framework, offers dynamic batching
Run any Llama 2 locally with gradio UI on GPU or CPU from anywhere
Visual Instruction Tuning: Large Language-and-Vision Assistant
State-of-the-art Parameter-Efficient Fine-Tuning
Trainable models and NN optimization tools
PyTorch extensions for fast R&D prototyping and Kaggle farming
Multi-Modal Neural Networks for Semantic Search, based on Mid-Fusion
Tensor search for humans
A set of Docker images for training and serving models in TensorFlow
The unofficial python package that returns response of Google Bard
LLMFlows - Simple, Explicit and Transparent LLM Apps
Phi-3.5 for Mac: Locally-run Vision and Language Models
Probabilistic reasoning and statistical analysis in TensorFlow
A lightweight vision library for performing large object detection
Lightweight Python library for adding real-time multi-object tracking
A Unified Library for Parameter-Efficient Learning
Data manipulation and transformation for audio signal processing
INT4/INT5/INT8 and FP16 inference on CPU for RWKV language model
Gaussian processes in TensorFlow
A library to communicate with ChatGPT, Claude, Copilot, Gemini
AI interface for tinkerers (Ollama, Haystack RAG, Python)
Libraries for applying sparsification recipes to neural networks
Framework for Accelerating LLM Generation with Multiple Decoding Heads
Neural Network Compression Framework for enhanced OpenVINO
Efficient few-shot learning with Sentence Transformers