llm api free download

Showing 33 open source projects for "llm api"

View related business solutions

TypeScript Clear Filters & Widen Search

Cut Data Warehouse Costs up to 54% with BigQuery
Migrate from Snowflake, Databricks, or Redshift with free migration tools. Exabyte scale without the Exabyte price.

BigQuery delivers up to 54% lower TCO than cloud alternatives. Migrate from legacy or competing warehouses using free BigQuery Migration Service with automated SQL translation. Get serverless scale with no infrastructure to manage, compressed storage, and flexible pricing—pay per query or commit for deeper discounts. New customers get $300 in free credit.

Try BigQuery Free
Build on Google Cloud with $300 in Free Credit
New to Google Cloud? Get $300 in free credit to explore Compute Engine, BigQuery, Cloud Run, Vertex AI, and 150+ other products.

Start your next project with $300 in free Google Cloud credit. Spin up VMs, run containers, query exabytes in BigQuery, or build AI apps with Vertex AI and Gemini. Once your credits are used, keep building with 20+ products with free monthly usage, including Compute Engine, Cloud Storage, GKE, and Cloud Run functions. Sign up to start building right away.

Start Free Trial
1

LLM Gateway

Route, manage, and analyze your LLM requests across multiple providers

LLM Gateway is an open-source middleware that consolidates interactions with multiple LLM providers—such as OpenAI, Anthropic, Google Vertex AI—behind a single, unified API compatible with OpenAI's spec. Designed for both self-hosted and cloud use, it enables developers to route requests dynamically, secure and manage API keys, monitor token usage and costs, and analyze performance metrics.

Downloads: 0 This Week

Last Update: 2025-12-18
See Project
2

promptfoo

Evaluate and compare LLM outputs, catch regressions, improve prompts

Ensure high-quality LLM outputs with automatic evals. Use a representative sample of user inputs to reduce subjectivity when tuning prompts. Use built-in metrics, LLM-graded evals, or define your own custom metrics. Compare prompts and model outputs side-by-side, or integrate the library into your existing test/CI workflow. Use OpenAI, Anthropic, and open-source models like Llama and Vicuna, or integrate custom API providers for any LLM API.

Downloads: 3 This Week

Last Update: 4 days ago
See Project
3

Firecrawl

Turn entire websites into LLM-ready markdown or structured data

Crawl and convert any website into LLM-ready markdown or structured data. Built by Mendable.ai and the Firecrawl community. Includes powerful scraping, crawling, and data extraction capabilities. Firecrawl is an API service that takes a URL, crawls it, and converts it into clean markdown or structured data. We crawl all accessible subpages and give you clean data for each.

Downloads: 6 This Week

Last Update: 2026-02-02
See Project
4

WebLLM

Bringing large-language models and chat to web browsers

WebLLM is a modular, customizable javascript package that directly brings language model chats directly onto web browsers with hardware acceleration. Everything runs inside the browser with no server support and is accelerated with WebGPU. We can bring a lot of fun opportunities to build AI assistants for everyone and enable privacy while enjoying GPU acceleration. WebLLM offers a minimalist and modular interface to access the chatbot in the browser. The WebLLM package itself does not come...

Downloads: 2 This Week

Last Update: 5 days ago
See Project
Build AI Apps with Gemini 3 on Vertex AI
Access Google’s most capable multimodal models. Train, test, and deploy AI with 200+ foundation models on one platform.

Vertex AI gives developers access to Gemini 3—Google’s most advanced reasoning and coding model—plus 200+ foundation models including Claude, Llama, and Gemma. Build generative AI apps with Vertex AI Studio, customize with fine-tuning, and deploy to production with enterprise-grade MLOps. New customers get $300 in free credits.

Try Vertex AI Free
5

Arthur Bench

Bench is a tool for evaluating LLMs for production use cases

Bench is a tool for evaluating LLMs for production use cases. Whether you are comparing different LLMs, considering different prompts, or testing generation hyperparameters like temperature and # tokens, Bench provides one touch point for all your LLM performance evaluation.

Downloads: 1 This Week

Last Update: 2024-02-16
See Project
6

Langflow

Low-code app builder for RAG and multi-agent AI applications

Langflow is a low-code app builder for RAG and multi-agent AI applications. It’s Python-based and agnostic to any model, API, or database.

Downloads: 13 This Week

Last Update: 2026-01-23
See Project
7

Chainlit

Build Python LLM apps in minutes

Chainlit is an open-source Python package that makes it incredibly fast to build and share LLM apps. Integrate the Chainlit API in your existing code to spawn a ChatGPT-like interface in minutes! Integrate seamlessly with an existing code base or start from scratch in minutes. Understand the intermediary steps that produced an output at a glance. Deep dive into prompts in the Prompt Playground to understand where things went wrong and iterate.

Downloads: 0 This Week

Last Update: 2026-01-20
See Project
8

PasteGuard

Masks sensitive data and secrets before they reach AI

PasteGuard is an open-source privacy proxy that protects sensitive information like personal data and API secrets by detecting and masking them before they reach large language model APIs such as OpenAI or Anthropic Claude. It sits between an application and the LLM provider, automatically replacing names, emails, tokens, and other personally identifiable information (PII) with placeholders so that external services never see raw sensitive values, and then optionally unmasking them in the returned output. ...

Downloads: 0 This Week

Last Update: 2 days ago
See Project
9

Nanobrowser

Open-Source Chrome extension for AI-powered web automation

Nanobrowser is an open-source AI web automation tool that runs in your browser. A free alternative to OpenAI Operator with flexible LLM options and a multi-agent system. Nanobrowser, as a chrome extension, delivers premium web automation capabilities while keeping you in complete control. No subscription fees or hidden costs. Just install and use your own API keys, and you only pay what you use with your own API keys. Everything runs in your local browser. Your credentials stay with you, never shared with any cloud service. ...

Downloads: 2 This Week

Last Update: 2025-11-22
See Project
AI-powered service management for IT and enterprise teams
Enterprise-grade ITSM, for every business

Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity. Maximize operational efficiency with refreshingly simple, AI-powered Freshservice.

Try it Free
10

RWKV Runner

A RWKV management and startup tool, full automation, only 8MB

RWKV (pronounced as RwaKuv) is an RNN with GPT-level LLM performance, which can also be directly trained like a GPT transformer (parallelizable). So it's combining the best of RNN and transformer - great performance, fast inference, fast training, saves VRAM, "infinite" ctxlen, and free text embedding. Moreover it's 100% attention-free. Default configs has enabled custom CUDA kernel acceleration, which is much faster and consumes much less VRAM. If you encounter possible compatibility...

Downloads: 1 This Week

Last Update: 2026-02-01
See Project
11

TUUI

A desktop MCP client designed as a tool unitary utility integration

Tuui is a desktop chat application built around the Model Context Protocol (MCP), designed as a unified tool to streamline AI interactions by orchestrating LLM APIs across various vendors, with many components generated or transformed through AI workflows. This repository is essentially an LLM chat desktop application based on MCP. It also represents a bold experiment in creating a complete project using AI. Many components within the project have been directly converted or generated from...

Downloads: 1 This Week

Last Update: 2026-02-14
See Project
12

Memobase

Fast backend for long-term AI user memory via structured profiles

Memobase is an open source backend system that enables long-term user memory functionality for AI applications by capturing and structuring information about users across interactions. Its design centers on creating user profiles and recording event timelines, allowing AI systems to remember, understand, and evolve in their behaviour toward individual users over time. Instead of relying purely on traditional embedding-based retrieval or RAG systems, Memobase uses profile and timeline...

Downloads: 5 This Week

Last Update: 15 hours ago
See Project
13

APIPark

APIPark is the #1 open-source AI Gateway and Developer Portal

APIPark is an open-source, all-in-one AI gateway and API developer portal, that helps developers and enterprises easily manage, integrate, and deploy AI services. No matter which AI model you use, APIPark provides a one-stop integration solution. It unifies the management of all authentication information and tracks the costs of API calls. Standardize the request data format for all AI models. When switching AI models or modifying prompts, it won’t affect your app or microservices,...

Downloads: 0 This Week

Last Update: 2025-10-26
See Project
14

TONL

TONL (Token-Optimized Notation Language)

...It provides a serialization format that significantly reduces token usage compared with traditional JSON, which can result in lower costs and more efficient prompt size utilization in LLM-driven systems. TONL isn’t just a format — it includes a rich API for querying, indexing, modifying, and streaming data, along with tools for schema validation and TypeScript code generation. The platform comes with a complete command-line interface that supports interactive dashboards and cross-platform usage in browsers and server environments, and its high test coverage gives developers confidence in stability.

Downloads: 0 This Week

Last Update: 2026-02-07
See Project
15

Dify

One API for plugins and datasets, one interface for prompt engineering

Dify is an easy-to-use LLMOps platform designed to empower more people to create sustainable, AI-native applications. With visual orchestration for various application types, Dify offers out-of-the-box, ready-to-use applications that can also serve as Backend-as-a-Service APIs. Unify your development process with one API for plugins and datasets integration, and streamline your operations using a single interface for prompt engineering, visual analytics, and continuous improvement....

Downloads: 5 This Week

Last Update: 2026-02-11
See Project
16

BrowserAI

Run local LLMs like llama, deepseek, kokoro etc. inside your browser

BrowserAI is a cutting-edge platform that allows users to run large language models (LLMs) directly in their web browser without the need for a server. It leverages WebGPU for accelerated performance and supports offline functionality, making it a highly efficient and privacy-conscious solution. The platform provides a developer-friendly SDK with pre-configured popular models, and it allows for seamless switching between MLC and Transformer engines. Additionally, it supports features such as...

Downloads: 2 This Week

Last Update: 2025-05-21
See Project
17

Suna

Suna - Open Source Generalist AI Agent

Suna is an open-source generalist AI agent developed by Kortix AI. Designed to assist users in accomplishing real-world tasks through natural conversation, Suna combines powerful capabilities with an intuitive interface. It serves as a digital companion for research, data analysis, and everyday challenges, integrating tools like browser automation, file management, web crawling, command-line execution, website deployment, and API integration. Suna's architecture comprises a FastAPI-based...

Downloads: 3 This Week

Last Update: 2025-04-25
See Project
18

Open Responses

Specification for multi-provider, interoperable LLM interfaces

Open Responses is an open-source implementation of an API compatible with the OpenAI Responses API that lets developers self-host a drop-in alternative endpoint for AI interactions while preserving compatibility with existing Agents SDKs and model workflows. It enables you to run a local or private server that speaks the standard Responses API, so tools, applications, and agents built against that API can operate without contacting OpenAI’s cloud and can instead route calls to any large...

Downloads: 1 This Week

Last Update: 6 days ago
See Project
19

Mastra

The TypeScript AI agent framework

...At its core, Mastra provides abstractions for agents, workflows, tools, memory, retrieval, and model routing, so developers can focus on specifying behavior rather than wiring infrastructure from scratch. Model routing lets you connect to dozens of providers (OpenAI, Anthropic, Gemini, and others) through a single standardized interface, while agents orchestrate LLM calls and tools to solve open-ended tasks with internal reasoning loops. When explicit control is needed, Mastra’s workflow engine uses a graph-style API (.then(), .branch(), .parallel()) to orchestrate multi-step processes.

Downloads: 10 This Week

Last Update: 2 days ago
See Project
20

Gemini Fullstack LangGraph Quickstart

Get started w/ building Fullstack Agents using Gemini 2.5 & LangGraph

gemini-fullstack-langgraph-quickstart is a fullstack reference application from Google DeepMind’s Gemini team that demonstrates how to build a research-augmented conversational AI system using LangGraph and Google Gemini models. The project features a React (Vite) frontend and a LangGraph/FastAPI backend designed to work together seamlessly for real-time research and reasoning tasks. The backend agent dynamically generates search queries based on user input, retrieves information via the...

Downloads: 1 This Week

Last Update: 2 days ago
See Project
21

RisuAI

Make your own story. User-friendly software for LLM roleplaying

RisuAI (or Risu) is a cross-platform AI roleplay chat application—available as both a desktop and web solution—offering creative story-building and character interaction experiences with support for multiple APIs, in-chat assets, regex capabilities, and more. Supports OpenAI, Claude, Gemini, DeepInfra, Ooba, OpenRouter... and More. Display the image of the current character, according to his/her expressions. Modify model's output by regex, to make a custom GUI and others.

Downloads: 7 This Week

Last Update: 2 days ago
See Project
22

Groq AppGen

Project showcasing Llama 3.3 70B HTML codegen abilities

Groq AppGen is an interactive web application (built with Next.js and TypeScript) that uses Groq’s LLM API to generate or modify web application code based on natural-language prompts. Essentially, you tell the app what kind of web app or page you want (in plain English), and groq-appgen will produce HTML/JSX code scaffolding, layout, and optionally application logic accordingly. It supports iterative feedback: you can refine your prompt, adjust parameters or requirements, and have the app regenerate or update the code — facilitating rapid prototyping and experimentation. ...

Downloads: 4 This Week

Last Update: 2025-12-12
See Project
23

Portkey AI Gateway

A blazing fast AI Gateway with integrated guardrails

Portkey AI Gateway aims to offer a blazing fast, secure, and flexible gateway for interacting with a wide variety of models and enforcing guardrails. It presents a single, friendly API through which you can route to 200+ LLMs, while applying configurable input/output guardrails to enforce policies or restrict certain content. It supports automatic retries, fallbacks, load balancing across providers or keys, and request timeouts to avoid latency spikes. The gateway is multimodal: it can...

Downloads: 0 This Week

Last Update: 2026-01-12
See Project
24

Groq TypeScript / Node.s

The official Node.js / Typescript library for the Groq API

Groq TypeScript / Node.s (also often referred to as “groq-sdk” on npm) is the official Node.js / TypeScript client library for Groq’s REST API, enabling JavaScript/TypeScript developers to integrate LLM and AI-powered services into web backends, serverless functions, or frontend apps. It exports strongly-typed interfaces for models, chat completions, file uploads (e.g. for audio transcription), and other endpoints, allowing for better type safety and developer experience when using Groq from TypeScript. ...

Downloads: 0 This Week

Last Update: 2025-12-18
See Project
25

Llama Coder

Open source Claude Artifacts – built with Llama 3.1 405B

Llama Coder is an open-source tool that lets you generate small applications (often React or web apps) from a single natural-language prompt using the Llama 3 family of models. It’s framed as an open-source “Claude Artifacts”-style experience: you describe the app you want, the tool calls an LLM hosted on Together.ai, and you get back a runnable code artifact. The project includes a web interface where you can enter prompts, see generated code, and run or tweak the result directly in the browser. Technically, it is built using a modern TypeScript/Next.js stack and integrates with Together’s API, making it a good blueprint for building your own AI-powered developer tools. ...

Downloads: 15 This Week

Last Update: 2026-02-08
See Project