Showing 241 open source projects for "search"

View related business solutions
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • Stop Cyber Threats with VM-Series Next-Gen Firewall on Azure Icon
    Stop Cyber Threats with VM-Series Next-Gen Firewall on Azure

    Native application identity and user-based security for your Azure cloud

    Gain integrated visibility across all traffic in a single pass. Deploy Palo Alto Networks VM-Series to determine application identity and content while automating security policy updates via rich APIs.
    Get a free trial
  • 1
    Agent Reach

    Agent Reach

    Give your AI agent eyes to see the entire internet

    Agent Reach is a command-line tool that gives AI agents access to public internet content across multiple platforms. It is designed for agent workflows that need to read and search sources like Twitter, Reddit, YouTube, GitHub, Bilibili, and XiaoHongShu without relying on paid platform APIs. The project focuses on giving AI tools broader visibility into social, video, code, and community data through one interface. It can be useful for research agents, automation workflows, competitive monitoring, content analysis, and social listening. ...
    Downloads: 8 This Week
    Last Update:
    See Project
  • 2
    PipesHub

    PipesHub

    Workplace AI platform for enterprise search and workflow automation

    PipesHub AI is an open-source, enterprise-grade workplace AI platform designed to unify search, knowledge management, and workflow automation across distributed organizational systems. It connects to a wide range of enterprise tools such as Google Workspace, Slack, Jira, and Confluence, aggregating data into a centralized knowledge layer that can be queried using natural language. The platform uses knowledge graphs and ranking algorithms to provide context-rich answers along with traceable sources, improving transparency and trust in AI-generated insights. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 3
    Databend

    Databend

    Cloud-native open source data warehouse for analytics and AI queries

    ...This architecture enables cost-efficient storage and elastic scaling for workloads that involve large datasets and complex queries. Databend provides a unified engine capable of handling analytics, vector search, and full-text search within a single platform. Databend supports SQL-based workflows and enables real-time data ingestion, transformation, and analysis through streaming and task orchestration features. With its cloud-native design and distributed architecture, Databend can run both as a self-hosted system or within managed environments to power data analytics, AI workloads, and large-scale data.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    kg-gen

    kg-gen

    Knowledge Graph Generation from Any Text

    ...The framework addresses common problems in automatic knowledge graph construction, particularly sparsity and duplication of entities, by applying a clustering and entity-resolution process that merges semantically similar nodes. This allows the generated graphs to be denser, more coherent, and easier to use for downstream tasks such as retrieval-augmented generation, semantic search, and reasoning systems.
    Downloads: 2 This Week
    Last Update:
    See Project
  • Ship Agents Faster Icon
    Ship Agents Faster

    Transform your applications and workflows into powerful agentic systems at global scale.

    Gemini Enterprise Agent Platform lets you rapidly build, scale, govern and optimize production-ready agents grounded in your organization's data. The platform enables developers to build custom or pre-built agents for virtually any use case. New customers get $300 in free credits.
    Get Started Free
  • 5
    SurfSense

    SurfSense

    Connect any LLM to your internal knowledge sources

    ...Team collaboration is a core focus, with real-time shared chats, role-based access control, and comment threads enabling organized workflows. The platform also supports advanced retrieval augmented generation (RAG) capabilities, enabling powerful search and citation features that help answer questions with contextually relevant data.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 6
    OpenRecall

    OpenRecall

    OpenRecall is a fully open-source, privacy-first alternative

    OpenRecall is an open-source, privacy-first system designed to capture, index, and make searchable a user’s entire digital activity history, effectively acting as a personal memory layer for computing environments. It works by taking periodic screenshots of a user’s screen and applying local AI processing, including OCR and semantic analysis, to extract and structure information from both text and images. This data is then indexed into a searchable database, allowing users to retrieve past...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 7
    BCEmbedding

    BCEmbedding

    Netease Youdao's open-source embedding and reranker models

    ...BCEmbedding also provides integrations for popular RAG frameworks, making it easier to add semantic search and reranking to AI applications.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    ModernBERT

    ModernBERT

    Bringing BERT into modernity via both architecture changes and scaling

    ...The goal of the project is to bring BERT-style models up to date with the capabilities of modern large language models while preserving the strengths of bidirectional encoder architectures used for tasks such as classification, retrieval, and semantic search. ModernBERT introduces architectural improvements that enhance both training efficiency and inference performance, making the model more suitable for modern large-scale machine learning pipelines. The repository also includes FlexBERT, a modular framework that allows developers to experiment with different encoder building blocks and configurations when constructing new models.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 9
    MCP ZoomEye

    MCP ZoomEye

    A Model Context Protocol server that provides network asset info

    The ZoomEye MCP Server is a Model Context Protocol server that provides network asset information based on query conditions, allowing Large Language Models to obtain data by querying ZoomEye using dorks and other search parameters. ​
    Downloads: 0 This Week
    Last Update:
    See Project
  • Stop vibe-debugging. Icon
    Stop vibe-debugging.

    Plug Claude into your app's actual errors.

    AppSignal's MCP server hands Claude, Cursor, or Zed your real errors, traces, and the deploy that shipped them. AI writes the fix; you review the diff.
    Free 30 days.
  • 10
    PixelRAG

    PixelRAG

    The beginning of scalable pixel-native search

    ...This approach preserves layout, tables, charts, diagrams, infographics, and other visual structure that traditional HTML or text parsing can miss. The project includes tools for rendering, chunking, embedding, indexing, and serving visual search indexes. It also provides a hosted API with a prebuilt Wikipedia index, plus local pipelines for building indexes from custom documents. PixelRAG can be used with Claude through the pixelbrowse skill, giving agents the ability to inspect pages visually instead of relying only on raw markup.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    Jina

    Jina

    Build cross-modal and multimodal applications on the cloud

    Jina is a framework that empowers anyone to build cross-modal and multi-modal applications on the cloud. It uplifts a PoC into a production-ready service. Jina handles the infrastructure complexity, making advanced solution engineering and cloud-native technologies accessible to every developer. Build applications that deliver fresh insights from multiple data types such as text, image, audio, video, 3D mesh, PDF with Jina AI’s DocArray. Polyglot gateway that supports gRPC, Websockets, HTTP,...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    GLM-4.6

    GLM-4.6

    Agentic, Reasoning, and Coding (ARC) foundation models

    GLM-4.6 is the latest iteration of Zhipu AI’s foundation model, delivering significant advancements over GLM-4.5. It introduces an extended 200K token context window, enabling more sophisticated long-context reasoning and agentic workflows. The model achieves superior coding performance, excelling in benchmarks and practical coding assistants such as Claude Code, Cline, Roo Code, and Kilo Code. Its reasoning capabilities have been strengthened, including improved tool usage during inference...
    Downloads: 90 This Week
    Last Update:
    See Project
  • 13
    MemPalace

    MemPalace

    The highest-scoring AI memory system ever benchmarked

    ...Instead of relying on summarization or selective extraction like most memory tools, it takes a radically different approach by storing conversations in their entirety and making them retrievable through structured organization and semantic search. The system is inspired by the classical “memory palace” mnemonic technique, organizing information into hierarchical spaces such as wings, rooms, and halls, which allows AI agents to navigate past knowledge in a more contextual and intuitive way. It operates fully locally using tools like ChromaDB, meaning it requires no API keys, cloud services, or external dependencies once installed. ...
    Downloads: 5 This Week
    Last Update:
    See Project
  • 14
    vLLM

    vLLM

    A high-throughput and memory-efficient inference and serving engine

    vLLM is a fast and easy-to-use library for LLM inference and serving. High-throughput serving with various decoding algorithms, including parallel sampling, beam search, and more.
    Downloads: 11 This Week
    Last Update:
    See Project
  • 15
    The AI Scientist-v2

    The AI Scientist-v2

    Workshop-Level Automated Scientific Discovery via Agentic Tree Search

    ...It introduces a generalized framework that removes reliance on predefined templates, enabling broader applicability across multiple machine learning domains and more open-ended exploration of research problems. A key innovation is its progressive agentic tree search, which systematically explores experimental paths and is coordinated by an experiment manager agent that guides decision-making. The system also integrates automated review mechanisms, including vision-language feedback loops, to iteratively refine the quality of generated research outputs.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    SimpleMem

    SimpleMem

    SimpleMem: Efficient Lifelong Memory for LLM Agents

    SimpleMem is a lightweight memory-augmented model framework that helps developers build AI applications that retain long-term context and recall relevant information without overloading model context windows. It provides easy-to-use APIs for storing structured memory entries, querying those memories using semantic search, and retrieving context to augment prompt inputs for downstream processing. Unlike monolithic systems where memory management is ad-hoc, SimpleMem formalizes a memory lifecycle—write, index, retrieve, refine—so applications can handle user history, document collections, or dynamic contextual state systematically. It supports customizable embedding models, efficient vector indexes, and relevance weighting, making it practical for building assistants, personal agents, or domain-specific retrieval systems that need persistent knowledge.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    Python Client For NLP Cloud

    Python Client For NLP Cloud

    NLP Cloud serves high performance pre-trained or custom models for NER

    NLP Cloud serves high performance pre-trained or custom models for NER, sentiment-analysis, classification, summarization, dialogue summarization, paraphrasing, intent classification, product description and ad generation, chatbot, grammar and spelling correction, keywords and keyphrases extraction, text generation, image generation, blog post generation, source code generation, question answering, automatic speech recognition, machine translation, language detection, semantic search, semantic similarity, tokenization, POS tagging, embeddings, and dependency parsing. It is ready for production, served through a REST API. You can either use the NLP Cloud pre-trained models, fine-tune your own models, or deploy your own models.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    memsearch

    memsearch

    A Markdown-first memory system, a standalone library for any AI agent

    memsearch is a markdown-first memory system designed to provide long-term memory capabilities for AI agents through structured storage and semantic retrieval. It enables agents to store, organize, and retrieve information using embeddings and hybrid search techniques, ensuring that relevant context is always available. The system supports advanced features such as reranking and progressive disclosure, which help prioritize the most useful information for a given query. It integrates with vector databases like Milvus, enabling scalable storage and retrieval of large datasets. Memsearch is designed to be agent-friendly, making it easy to plug into existing AI workflows and enhance reasoning capabilities. ...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 19
    Sunfish

    Sunfish

    Sunfish: a Python Chess Engine in 111 lines of code

    ...It implements classic chess engine techniques such as alpha-beta pruning and efficient board representation while maintaining readability and simplicity. The project is often used as an educational tool for understanding game AI, search algorithms, and evaluation functions without the complexity of larger engines. It includes a simple UCI-compatible interface, allowing it to be integrated with graphical chess interfaces or used in terminal-based gameplay. The codebase is intentionally minimal, making it ideal for experimentation, modification, and learning purposes.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 20
    Director

    Director

    AI video agents framework for next-gen video interactions

    Director is a video database management system designed to organize, search, and retrieve large collections of video content efficiently.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    Memobase

    Memobase

    Fast backend for long-term AI user memory via structured profiles

    ...Instead of relying purely on traditional embedding-based retrieval or RAG systems, Memobase uses profile and timeline structures to deliver memory that reflects user context efficiently and meaningfully. The system focuses on three principal performance metrics: high search performance, reduced large language model (LLM) costs through batch processing techniques, and low latency with minimal SQL operations. Memobase supports integration with existing LLM workflows via APIs and SDKs (including Python, Node, and Go), making it easy to adopt within diverse application stacks.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 22
    Papermerge

    Papermerge

    Open Source Document Management System for Digital Archives

    ...Instead of having piles of paper documents all over your desk, office or drawers - you can quickly scan them and configure your scanner to directly upload to Papermerge DMS. Store, organize and index scanned documents in PDF, JPEG and TIFF formats. Instantly find relevant information using full text, tags and metadata-based search. Papermerge is free and open-source software which means that transparency is the core value of our software development. Source code can be reviewed and improved by anyone from anywhere. Papermerge supports multiple users. Each user can be assigned different permissions to perform only a specific kind of action e.g. view only documents from a specific folder. ...
    Downloads: 9 This Week
    Last Update:
    See Project
  • 23
    Fli

    Fli

    Google Flights MCP and Python Library

    ...This approach enables faster, more reliable, and more stable access to flight information, avoiding the fragility associated with HTML parsing and UI changes. The library supports a wide range of flight search capabilities, including filtering by airline, departure time, number of stops, cabin class, and sorting by price or duration, making it suitable for both casual queries and advanced travel analysis. In addition to its CLI interface, fli includes a Model Context Protocol (MCP) server that allows AI assistants to interact with flight data using structured tools, enabling natural language queries and automation workflows.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    DeepSearcher

    DeepSearcher

    Open Source Deep Research Alternative to Reason and Search

    DeepSearcher is an open-source “deep research” style system that combines retrieval with evaluation and reasoning to answer complex questions using private or enterprise data. It is designed around the idea that high-quality answers require more than top-k retrieval, so it orchestrates multi-step search, evidence collection, and synthesis into a comprehensive response. The project integrates with vector databases (including Milvus and related options) so organizations can index internal documents and query them with semantic retrieval. It also supports flexible embeddings, making it easier to choose different embedding models depending on domain requirements, latency targets, or accuracy goals. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    ArXiv MCP Server

    ArXiv MCP Server

    A Model Context Protocol server for searching and analyzing arXiv

    arxiv-mcp-server bridges AI assistants and the arXiv repository through a clean MCP interface, enabling search, metadata retrieval, and content access without bespoke scraping. With simple tools like “search” and “fetch,” an agent can find papers, pull abstracts, and download PDFs for downstream summarization or analysis. The project includes packaging and CI to publish to PyPI, plus tests and linting for reliability. Issue threads show feature requests such as extracting embedded LaTeX and improving markdown conversion, reflecting active community use in research flows. ...
    Downloads: 0 This Week
    Last Update:
    See Project
Auth0 Logo