rag-search is a lightweight Retrieval-Augmented Generation API service designed to provide structured semantic search and answer generation through a simple FastAPI backend. The project integrates web search, vector embeddings, and reranking logic to retrieve relevant context before passing it to a language model for response generation. It is built to be easily deployable, requiring only environment configuration and dependency installation to run a functional RAG service. The system supports configurable filtering, scoring thresholds, and reranking options, allowing developers to fine-tune retrieval quality. Its architecture is modular, separating handlers, services, and utilities to support customization and extension. Overall, rag-search serves as a practical starter backend for teams building AI search or question-answering applications on their own data.

Features

  • FastAPI-based RAG backend service
  • Vector database integration for semantic retrieval
  • Configurable reranking and filtering controls
  • Support for external web search providers
  • Token-based API authentication
  • Modular Python service architecture

Project Samples

Project Activity

See All Activity >

License

Apache License V2.0

Follow rag-search

rag-search Web Site

Other Useful Business Software
Custom VMs From 1 to 96 vCPUs With 99.95% Uptime Icon
Custom VMs From 1 to 96 vCPUs With 99.95% Uptime

General-purpose, compute-optimized, or GPU/TPU-accelerated. Built to your exact specs.

Live migration and automatic failover keep workloads online through maintenance. One free e2-micro VM every month.
Try Free
Rate This Project
Login To Rate This Project

User Reviews

Be the first to post a review of rag-search!

Additional Project Details

Programming Language

Python

Related Categories

Python Search Software, Python Semantic Search Tool

Registered

2026-03-03