Bring the notion of Model-as-a-Service to life
The official Python Library for the Groq API
An engine-agnostic deep learning framework in Java
Pruna is a model optimization framework built for developers
Build Production-ready Agentic Workflow with Natural Language
Prompt, run, edit, & deploy full-stack web applications using any LLM
The Triton Inference Server provides an optimized cloud
The TypeScript AI agent framework
The fast, Pythonic way to build Model Context Protocol servers
A GPU-accelerated library containing highly optimized building blocks
CodeGeeX2: A More Powerful Multilingual Code Generation Model
Simple, unified interface to multiple Generative AI providers
ChatGPT interface with better UI
An open-source RAG-based tool for chatting with your documents
A secure sandbox environment for malware developers and red teamers
The official Meta Llama 3 GitHub site
Inference code for CodeLlama models
LLM Chatbot Assistant for Openfire server
Powering Amazon custom machine learning chips
Light and Fast AI Assistant
Project showcasing Llama 3.3 70B HTML codegen abilities
GPT4V-level open-source multi-modal model based on Llama3-8B
A git prepare-commit-msg hook for authoring commit messages with GPT-3
A lightweight vision library for performing large object detection
A gallery that showcases on-device ML/GenAI use cases