Page 2 | rag free download

Showing 126 open source projects for "rag"

View related business solutions

Artificial Intelligence Linux Clear Filters & Widen Search

Stop Cyber Threats with VM-Series Next-Gen Firewall on Azure
Native application identity and user-based security for your Azure cloud

Gain integrated visibility across all traffic in a single pass. Deploy Palo Alto Networks VM-Series to determine application identity and content while automating security policy updates via rich APIs.

Get a free trial
MongoDB Atlas runs apps anywhere
Deploy in 115+ regions with the modern database for every enterprise.

MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.

Start Free
1

LightRAG

"LightRAG: Simple and Fast Retrieval-Augmented Generation"

LightRAG is a lightweight Retrieval-Augmented Generation (RAG) framework designed for efficient document retrieval and response generation. It is optimized for speed and lower resource consumption, making it ideal for real-time applications.

Downloads: 0 This Week

Last Update: 1 day ago
See Project
2

Hands-On Large Language Models

Official code repo for the O'Reilly Book

...The repository is structured into chapters that align with the educational progression of the book — covering everything from foundational topics like tokens, embeddings, and transformer architecture to advanced techniques such as prompt engineering, semantic search, retrieval-augmented generation (RAG), multimodal LLMs, and fine-tuning. Each chapter contains executable Jupyter notebooks that are designed to be run in environments like Google Colab, making it easy for learners to experiment interactively with models, visualize attention patterns, implement classification and generation tasks.

Downloads: 55 This Week

Last Update: 2026-04-24
See Project
3

EmoLLM

Pre & Post-training & Dataset & Evaluation & Depoly & RAG

...Its repository includes multiple model variants and training configurations spanning several underlying model families, including InternLM, Qwen, DeepSeek, Mixtral, LLaMA, and others, which shows that the initiative is structured as a broad ecosystem rather than a single release. The project also covers more than just model weights, with material for datasets, fine-tuning, evaluation, deployment, demos, RAG, and related subprojects such as its psychological digital assistant work.

Downloads: 0 This Week

Last Update: 2026-03-06
See Project
4

UltraRAG

Less Code, Lower Barrier, Faster Deployment

UltraRAG 2.0 is a low-code, MCP-enabled RAG framework that aims to lower the barrier to building complex retrieval pipelines for research and production. It provides end-to-end recipes—from encoding and indexing corpora to deploying retrievers and LLMs—so users can reproduce baselines and iterate rapidly. The toolkit comes with built-in support for popular RAG datasets, large corpora, and canonical baselines, plus documentation that walks from “quick start” to debugging and case analysis. ...

Downloads: 0 This Week

Last Update: 2026-04-09
See Project
Ship Agents Faster
Transform your applications and workflows into powerful agentic systems at global scale.

Gemini Enterprise Agent Platform lets you rapidly build, scale, govern and optimize production-ready agents grounded in your organization's data. The platform enables developers to build custom or pre-built agents for virtually any use case. New customers get $300 in free credits.

Get Started Free
5

Verba

Retrieval Augmented Generation (RAG) chatbot powered by Weaviate

Welcome to Verba: The Golden RAGtriever, a community-driven open-source application designed to offer an end-to-end, streamlined, and user-friendly interface for Retrieval-Augmented Generation (RAG) out of the box. In just a few easy steps, explore your datasets and extract insights with ease, either locally with Ollama and Huggingface or through LLM providers such as Anthrophic, Cohere, and OpenAI. This project is built with and for the community, please be aware that it might not be maintained with the same urgency as other Weaviate production applications.

Downloads: 0 This Week

Last Update: 2025-07-14
See Project
6

AI Engineering Hub

In-depth tutorials on LLMs, RAGs and real-world AI agent applications

The AI Engineering Hub repository is a large open-source collection of hands-on projects, tutorials, and real-world AI engineering resources designed to help developers learn and build with modern AI technologies, especially large language models (LLMs), retrieval-augmented generation (RAG), and agent-based systems. It includes more than 90 production-ready projects across skill levels, organized into beginner, intermediate, and advanced categories to guide users progressively from simple experiments to complex AI workflows. Projects range from OCR applications and local chatbot UIs to multimodal RAG systems and multi-agent automation pipelines, making the hub valuable both as a learning resource and as a practical reference. ...

Downloads: 0 This Week

Last Update: 2026-06-08
See Project
7

Coze Studio

An AI agent development platform with all-in-one visual tools

Coze Studio is ByteDance’s open‑source, visual AI agent development platform. It offers no-code/low-code workflows to build, debug, and deploy conversational agents, integrating prompting, RAG-based knowledge bases, plugin systems, and workflow orchestration. Developed in Go (backend) and React/TypeScript (frontend), it uses a containerized microservices architecture suitable for enterprise deployment.

Downloads: 2 This Week

Last Update: 2026-01-20
See Project
8

LlamaParse

Parse files for optimal RAG

LlamaParse is a GenAI-native document parser that can parse complex document data for any downstream LLM use case (RAG, agents). Load in 160+ data sources and data formats, from unstructured, and semi-structured, to structured data (API's, PDFs, documents, SQL, etc.) Store and index your data for different use cases. Integrate with 40+ vector stores, document stores, graph stores, and SQL db providers.

Downloads: 1 This Week

Last Update: 2026-02-13
See Project
9

SurfSense

Connect any LLM to your internal knowledge sources

...Team collaboration is a core focus, with real-time shared chats, role-based access control, and comment threads enabling organized workflows. The platform also supports advanced retrieval augmented generation (RAG) capabilities, enabling powerful search and citation features that help answer questions with contextually relevant data.

Downloads: 6 This Week

Last Update: 2026-06-18
See Project
$300 Free Credits to Build on Google Cloud
New to Google Cloud? Get $300 in credits to explore Compute Engine, BigQuery, Cloud Run, Gemini Enterprise Agent Platform, and more.

Start your next project with $300 in free Google Cloud credit. Spin up VMs, run containers, query petabytes in BigQuery, or build agents with Gemini Enterprise Agent Platform. Once your credits are used, keep building with 20+ always-free tier products including Compute Engine, Cloud Storage, GKE, and Cloud Run functions. No commitment required—just sign up and start building.

Claim $300 Free
10

Agents Towards Production

Code-first tutorials covering every layer of GenAI agents

Agents Towards Production is an opinionated, code-first playbook for taking AI agents from prototype to production-ready systems. Instead of focusing only on toy examples, it dives into every layer of an agent stack: orchestration, memory, RAG, tool and API integration, security, observability, deployment, evaluation, and UI. The repository is built around runnable tutorials, each in its own folder, often sponsored by or built in collaboration with infrastructure providers like LangChain, Redis, Bright Data, Contextual AI, Tavily, Runpod, Portia, and others. These tutorials show how to implement things like secure tool calling with OAuth, dual-memory architectures, production RAG agents, multi-agent communication protocols, GPU deployment, containerization with Docker, FastAPI endpoints, and Streamlit chat UIs. ...

Downloads: 0 This Week

Last Update: 2026-06-17
See Project
11

Neuron AI

The PHP Agentic Framework to build production-ready AI driven apps

Neuron AI is a PHP agentic framework for building production-ready AI applications that connect models, memory, vector databases, and tools into working agents. It is designed for developers who want to create systems such as RAG pipelines, multi-agent workflows, and business process automations without having to hand-build every integration from scratch. The framework provides an Agent class that can be extended to inherit core capabilities like memory, tools, function calling, and retrieval-augmented generation. Its design is modular, so developers can swap model providers with minimal changes to their application code, which makes it practical for teams that need flexibility across vendors. ...

Downloads: 5 This Week

Last Update: 2 days ago
See Project
12

MaxKB

Open-source platform for building enterprise-grade agents

MaxKB (Max Knowledge Brain) is an open-source platform for building enterprise-grade AI agents with strong knowledge retrieval, RAG pipelines, and workflow orchestration. It focuses on practical deployments such as customer support, internal knowledge bases, research assistants, and education, bundling tools for data ingestion, chunking, embedding, retrieval, and answer synthesis. The system exposes flexible tool-use (including MCP), supports multi-model backends, and provides dashboards for dataset management and evaluation. ...

Downloads: 7 This Week

Last Update: 2026-06-17
See Project
13

Generative AI for Beginners (Version 3)

21 Lessons, Get Started Building with Generative AI

...The course covers everything from model selection, prompt engineering, and chat/text/image app patterns to secure development practices and UX for AI. It also walks through modern application techniques such as function calling, RAG with vector databases, working with open source models, agents, fine-tuning, and using SLMs. Each lesson includes a short video, a written guide, runnable samples for Azure OpenAI, the GitHub Marketplace Model Catalog, and the OpenAI API, plus a “Keep Learning” section for deeper study.

Downloads: 5 This Week

Last Update: 4 days ago
See Project
14

Opik

Debug, evaluate, and monitor your LLMapps, RAG systems, and agentic AI

Confidently evaluate, test, and monitor LLM applications. Opik is an open-source platform for evaluating, testing, and monitoring LLM applications. Built by Comet. Record, sort, search, and understand each step your LLM app takes to generate a response. Manually annotate, view, and compare LLM responses in a user-friendly table. Log traces during development and in production. Run experiments with different prompts and evaluate against a test set. Choose and run pre-configured evaluation...

Downloads: 5 This Week

Last Update: 8 hours ago
See Project
15

DeepEval

...DeepEval incorporates the latest research to evaluate LLM outputs based on metrics such as G-Eval, hallucination, answer relevancy, RAGAS, etc., which uses LLMs and various other NLP models that run locally on your machine for evaluation. Whether your application is implemented via RAG or fine-tuning, LangChain, or LlamaIndex, DeepEval has you covered. With it, you can easily determine the optimal hyperparameters to improve your RAG pipeline, prevent prompt drifting, or even transition from OpenAI to hosting your own Llama2 with confidence.

Downloads: 2 This Week

Last Update: 2026-05-28
See Project
16

Lecca.io

Lecca.io | AI Agents & Automations

Lecca.io is an AI platform that allows you to configure and deploy Large Language Models (LLMs) equipped with powerful tools and workflows. Build, customize, and automate your AI agents with ease.

Downloads: 0 This Week

Last Update: 2025-05-18
See Project
17

Self-hosted AI Package

Run all your local AI together in one package

...The stack typically includes Ollama for running local large language models, n8n as a low-code workflow automation platform, Supabase for database and vector storage, Open WebUI for interacting with models, Flowise for agent building, and additional services like SearXNG, Neo4j, and Langfuse for search, knowledge graphs, and observability. This integrated setup allows users to experiment with RAG pipelines, automated workflows, AI agents, and project data management without relying on external hosted services, increasing flexibility and privacy. The repository comes with example workflows (such as Local RAG AI Agent workflows) and environment configurations that help streamline setup and encourage customization.

Downloads: 3 This Week

Last Update: 2026-02-01
See Project
18

Bionic GPT

Bionic is an on-premise replacement for ChatGPT

...The interface is intentionally familiar, offering a ChatGPT-like experience with customizable branding, fast Rust-based performance, and conversation history management. Beyond chat, Bionic focuses heavily on enterprise RAG by letting users create AI assistants that work with their own documents, share those assistants across teams, and configure embeddings, chunking, and system prompts through the UI. The platform supports a wide variety of document types, includes data isolation features for teams, and layers in security measures such as RBAC, row-level security in Postgres, strong content security policy settings, and minimal container builds.

Downloads: 0 This Week

Last Update: 2026-04-20
See Project
19

AgentGuide

AI Agent Development Guide, LangGraph in Action, Advanced RAG

AgentGuide is an open-source learning resource designed to provide a structured pathway for understanding and building AI agents. The project aggregates tutorials, research papers, frameworks, and practical resources related to agent development with large language models. Instead of presenting scattered resources, the repository organizes them into a systematic learning roadmap that guides learners from foundational concepts to advanced AI agent systems. The guide covers topics such as...

Downloads: 0 This Week

Last Update: 2026-06-09
See Project
20

Generative AI

Sample code and notebooks for Generative AI on Google Cloud

Generative AI is a comprehensive collection of code samples, notebooks, and demo applications designed to help developers build generative-AI workflows on the Vertex AI platform. It spans multiple modalities—text, image, audio, search (RAG/grounding) and more—showing how to integrate foundation models like the Gemini family into cloud projects. The README emphasises getting started with prompts, datasets, environments and sample apps, making it ideal for both experimentation and production-ready usage. The repository architecture is organised into folders like gemini/, search/, vision/, audio/, and rag-grounding/, which helps developers locate use cases by modality. ...

Downloads: 3 This Week

Last Update: 7 days ago
See Project
21

Ax

Build LLM powered Agents and "Agentic workflows"

Build intelligent agents quickly — inspired by the power of "Agentic workflows" and the Stanford DSPy paper. Seamlessly integrates with multiple LLMs and VectorDBs to build RAG pipelines or collaborative agents that can solve complex problems. Advanced features streaming validation, multi-modal DSPy, etc. We've renamed from "llmclient" to "ax" to highlight our focus on powering agentic workflows. We agree with many experts like "Andrew Ng" that agentic workflows are the key to unlocking the true power of large language models and what can be achieved with in-context learning. ...

Downloads: 1 This Week

Last Update: 23 hours ago
See Project
22

ChatOllama

ChatOllama is an open-source AI chatbot

...The platform also includes higher-level capabilities such as AI agents, document-backed knowledge bases, real-time voice chat, and Model Context Protocol integration for external tools. Its RAG functionality allows document upload and knowledge-base-driven interaction, while vector database support adds more scalable retrieval options. Deployment is streamlined with Docker Compose, and the project also includes internationalization and modular feature toggles for controlling what parts of the system are enabled. As a result, ChatOllama feels less like a single chatbot and more like a flexible self-hosted AI workspace.

Downloads: 0 This Week

Last Update: 2026-05-28
See Project
23

RAPTOR

The official implementation of RAPTOR

RAPTOR is a retrieval architecture designed to improve retrieval-augmented generation systems by organizing documents into hierarchical structures that enable more effective context retrieval. Traditional RAG systems typically retrieve small text chunks independently, which can limit a model’s ability to understand broader document context. RAPTOR addresses this limitation by recursively embedding, clustering, and summarizing documents to create a tree-structured hierarchy of information. Each level of the tree represents summaries at different levels of abstraction, allowing retrieval to operate at both detailed and high-level conceptual layers. ...

Downloads: 0 This Week

Last Update: 2026-03-06
See Project
24

Pathway AI Pipelines

Ready-to-run cloud templates for RAG

Pathway AI Pipelines is a collection of ready-to-deploy AI pipeline templates designed to help developers rapidly build production-grade retrieval-augmented generation and enterprise search applications. The project provides end-to-end examples that connect live data sources to LLM workflows, enabling applications to stay synchronized with continuously changing information. It supports numerous connectors including local files, Google Drive, SharePoint, Kafka, PostgreSQL, and real-time APIs,...

Downloads: 0 This Week

Last Update: 2026-06-10
See Project
25

MiniRAG

Making RAG Simpler with Small and Open-Sourced Language Models

MiniRAG is a lightweight retrieval-augmented generation tool designed to bring the benefits of RAG workflows to smaller datasets, edge environments, and constrained compute settings by simplifying embedding, indexing, and retrieval. It extracts text from documents, codes, or other structured inputs and converts them into embeddings using efficient models, then stores these vectors for fast nearest-neighbor search without requiring huge databases or separate vector servers.

Downloads: 0 This Week

Last Update: 2026-02-03
See Project