llm api free download

Showing 10 open source projects for "llm api"

View related business solutions

Go Clear Filters & Widen Search

AI-powered service management for IT and enterprise teams
Enterprise-grade ITSM, for every business

Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity. Maximize operational efficiency with refreshingly simple, AI-powered Freshservice.

Try it Free
Build AI Apps with Gemini 3 on Vertex AI
Access Google’s most capable multimodal models. Train, test, and deploy AI with 200+ foundation models on one platform.

Vertex AI gives developers access to Gemini 3—Google’s most advanced reasoning and coding model—plus 200+ foundation models including Claude, Llama, and Gemma. Build generative AI apps with Vertex AI Studio, customize with fine-tuning, and deploy to production with enterprise-grade MLOps. New customers get $300 in free credits.

Try Vertex AI Free
1

Bifrost

The Fastest LLM Gateway with built in OTel observability

Bifrost is an LLM gateway designed to provide a unified OpenAI-compatible API front for many different model providers. It abstracts away the complexity of working directly with multiple backend providers (OpenAI, Anthropic, AWS Bedrock, Google Vertex, etc.), enabling you to plug in providers and switch between them without touching your client code.

Downloads: 2 This Week

Last Update: 4 days ago
See Project
2

LocalAI

Self-hosted, community-driven, local OpenAI compatible API

Self-hosted, community-driven, local OpenAI compatible API. Drop-in replacement for OpenAI running LLMs on consumer-grade hardware. Free Open Source OpenAI alternative. No GPU is required. Runs ggml, GPTQ, onnx, TF compatible models: llama, gpt4all, rwkv, whisper, vicuna, koala, gpt4all-j, cerebras, falcon, dolly, starcoder, and many others. LocalAI is a drop-in replacement REST API that’s compatible with OpenAI API specifications for local inferencing. It allows you to run LLMs (and not...

Downloads: 18 This Week

Last Update: 1 day ago
See Project
3

KubeAI

Private Open AI on Kubernetes

Get inferencing running on Kubernetes: LLMs, Embeddings, Speech-to-Text. KubeAI serves an OpenAI compatible HTTP API. Admins can configure ML models by using the Model Kubernetes Custom Resources. KubeAI can be thought of as a Model Operator (See Operator Pattern) that manages vLLM and Ollama servers.

Downloads: 0 This Week

Last Update: 2025-12-03
See Project
4

hfapigo

Unofficial (Golang) Go bindings for the Hugging Face Inference API

(Golang) Go bindings for the Hugging Face Inference API. Directly call any model available in the Model Hub. An API key is required for authorized access. To get one, create a Hugging Face profile.

Downloads: 0 This Week

Last Update: 2025-11-06
See Project
99.99% Uptime for MySQL and PostgreSQL on Google Cloud
Enterprise Plus edition delivers sub-second maintenance downtime and 2x read/write performance. Built for critical apps.

Cloud SQL Enterprise Plus gives you a 99.99% availability SLA with near-zero downtime maintenance—typically under 10 seconds. Get 2x better read/write performance, intelligent data caching, and 35 days of point-in-time recovery. Supports MySQL, PostgreSQL, and SQL Server with built-in vector search for gen AI apps. New customers get $300 in free credit.

Try Cloud SQL Free
5

BricksLLM

Enterprise-grade API gateway that helps you monitor and impose cost

BricksLLM is an open-source framework for building and managing production-ready LLM (Large Language Model) applications. It provides tooling for prompt engineering, memory management, observability, and chaining, all in one unified developer experience. BricksLLM is designed to reduce boilerplate and increase the maintainability of LLM-based workflows.

Downloads: 0 This Week

Last Update: 2025-06-11
See Project
6

kgateway

The Cloud-Native API Gateway and AI Gateway

kgateway is a mature, cloud-native API and ingress gateway designed to provide unified API connectivity for services, microservices, serverless workloads, and AI-centric systems running on Kubernetes clusters. It implements the Kubernetes Gateway API and can operate as both a lightweight in-cluster microgateway and a centralized gateway capable of handling billions of API calls with high performance and low latency.

Downloads: 1 This Week

Last Update: 5 days ago
See Project
7

MCP Go

A Go implementation of the Model Context Protocol (MCP)

mcp-go is a Go implementation of the Model Context Protocol (MCP), designed to enable seamless integration between Large Language Model (LLM) applications and external data sources and tools. It abstracts the complexities of the protocol and server management, allowing developers to focus on building robust tools. The library is high-level and user-friendly, facilitating the development of MCP servers in Go.

Downloads: 1 This Week

Last Update: 2026-02-15
See Project
8

Memobase

Fast backend for long-term AI user memory via structured profiles

Memobase is an open source backend system that enables long-term user memory functionality for AI applications by capturing and structuring information about users across interactions. Its design centers on creating user profiles and recording event timelines, allowing AI systems to remember, understand, and evolve in their behaviour toward individual users over time. Instead of relying purely on traditional embedding-based retrieval or RAG systems, Memobase uses profile and timeline...

Downloads: 5 This Week

Last Update: 19 hours ago
See Project
9

HasMCP

Convert API into MCP Server in seconds

HasMCP empowers AI development by seamlessly connecting your existing APIs to Large Language Models. Its Automated OpenAPI Mapping instantly translates API documentation into LLM-usable tools, eliminating manual coding. Security is paramount, with Native MCP Elicitation Auth managing complex authentication flows like OAuth2, ensuring user credentials are never exposed. To enhance efficiency, Context Window Optimization intelligently prunes API responses using JMESPath and Goja (JS) logic, dramatically reducing costs and improving response times. ...

Downloads: 0 This Week

Last Update: 2026-02-03
See Project
Build on Google Cloud with $300 in Free Credit
New to Google Cloud? Get $300 in free credit to explore Compute Engine, BigQuery, Cloud Run, Vertex AI, and 150+ other products.

Start your next project with $300 in free Google Cloud credit. Spin up VMs, run containers, query exabytes in BigQuery, or build AI apps with Vertex AI and Gemini. Once your credits are used, keep building with 20+ products with free monthly usage, including Compute Engine, Cloud Storage, GKE, and Cloud Run functions. Sign up to start building right away.

Start Free Trial
10

aqueduct LLM

Aqueduct allows you to run LLM and ML workloads on any infrastructure

Aqueduct is an MLOps framework that allows you to define and deploy machine learning and LLM workloads on any cloud infrastructure. Aqueduct is an open-source MLOps framework that allows you to write code in vanilla Python, run that code on any cloud infrastructure you'd like to use, and gain visibility into the execution and performance of your models and predictions. Aqueduct's Python native API allows you to define ML tasks in regular Python code.

Downloads: 0 This Week

Last Update: 2023-08-25
See Project