Low-latency REST API for serving text-embeddings
Replace OpenAI GPT with another LLM in your app
A Unified Library for Parameter-Efficient Learning
Scripts for fine-tuning Meta Llama3 with composable FSDP & PEFT method
PyTorch library of curated Transformer models and their components
Integrate, train and manage any AI models and APIs with your database
MII makes low-latency and high-throughput inference possible
A toolkit to optimize ML models for deployment for Keras & TensorFlow
A graphical manager for ollama that can manage your LLMs
A computer vision framework to create and deploy apps in minutes
Implementation of "Tree of Thoughts