fastapi free download

ChatGLM2-6B

ChatGLM2-6B: An Open Bilingual Chat LLM

...The context window extends up to 32K (FlashAttention), and Multi-Query Attention improves speed and memory use. The repo includes Python APIs, CLI & web demos, OpenAI-style/FASTAPI servers, and quantized checkpoints for lightweight local deployment on GPUs or CPU/MPS.

Downloads: 2 This Week

Last Update: 19 hours ago

See Project

LangChain Extract

Did you say you like data?

...The project implements a lightweight web service that allows developers to define extraction schemas and apply them to various sources such as plain text, HTML, or PDF documents. Built using FastAPI and the LangChain framework, the application exposes a REST API that can process documents and return structured outputs that match user-defined JSON schemas. Developers can create reusable “extractors” that define what type of information should be pulled from a document, along with example prompts that improve extraction quality through in-context learning.

Downloads: 0 This Week

Last Update: 2026-03-09

See Project

LLaMA Efficient Tuning

Easy-to-use LLM fine-tuning framework (LLaMA-2, BLOOM, Falcon

Easy-to-use LLM fine-tuning framework (LLaMA-2, BLOOM, Falcon, Baichuan, Qwen, ChatGLM2)

Downloads: 0 This Week

Last Update: 2025-12-31

See Project

Gemini Fullstack LangGraph Quickstart

Get started w/ building Fullstack Agents using Gemini 2.5 & LangGraph

gemini-fullstack-langgraph-quickstart is a fullstack reference application from Google DeepMind’s Gemini team that demonstrates how to build a research-augmented conversational AI system using LangGraph and Google Gemini models. The project features a React (Vite) frontend and a LangGraph/FastAPI backend designed to work together seamlessly for real-time research and reasoning tasks. The backend agent dynamically generates search queries based on user input, retrieves information via the Google Search API, and performs reflective reasoning to identify knowledge gaps. It then iteratively refines its search until it produces a comprehensive, well-cited answer synthesized by the Gemini model. ...

Downloads: 4 This Week

Last Update: 19 hours ago

See Project

Infinity

Low-latency REST API for serving text-embeddings

Infinity is a high-throughput, low-latency REST API for serving vector embeddings, supporting all sentence-transformer models and frameworks. Infinity is developed under MIT License. Infinity powers inference behind Gradient.ai and other Embedding API providers.

Downloads: 0 This Week

Last Update: 2025-08-22

See Project

autollm

Ship RAG based LLM web apps in seconds

...The project focuses on simplifying the usual stack of model selection, document ingestion, vector storage, querying, and API deployment into a more unified developer experience. Its core idea is that a developer can create a query engine from a document set in just a few lines and then turn that same engine into a FastAPI application almost instantly. AutoLLM supports a broad range of language models and vector databases, which makes it useful for teams that want flexibility without rewriting their application architecture every time they switch providers. The framework also includes built-in readers for multiple content sources such as PDFs, DOCX files, notebooks, websites, and other document types, which helps shorten the time between raw data and a working knowledge application.

Downloads: 0 This Week

Last Update: 2026-03-10

See Project

Langcorn

Serving LangChain LLM apps automagically with FastApi

LangCorn is an API server that enables you to serve LangChain models and pipelines with ease, leveraging the power of FastAPI for a robust and efficient experience.

Downloads: 0 This Week

Last Update: 2023-11-06

See Project

LangChain Apps on Production with Jina

Langchain Apps on Production with Jina & FastAPI

...And if you prefer, you can also deploy your LangChain apps on your own infrastructure to ensure data privacy. With long chain-serve, you can craft REST/WebSocket APIs, spin up LLM-powered conversational Slack bots, or wrap your LangChain apps into FastAPI packages on the cloud or on-premises.

Downloads: 0 This Week

Last Update: 2023-08-25

See Project

Search Results for "fastapi"

Showing 8 open source projects for "fastapi"

ChatGLM2-6B

LangChain Extract

LLaMA Efficient Tuning

Gemini Fullstack LangGraph Quickstart

Infinity

autollm

Langcorn

LangChain Apps on Production with Jina

Search Results for "fastapi"

Showing 8 open source projects for "fastapi"

ChatGLM2-6B

LangChain Extract

LLaMA Efficient Tuning

Gemini Fullstack LangGraph Quickstart

Infinity

autollm

Langcorn

LangChain Apps on Production with Jina

Related Categories