Showing 64 open source projects for "document search engine"

View related business solutions
  • Cloud tools for web scraping and data extraction Icon
    Cloud tools for web scraping and data extraction

    Deploy pre-built tools that crawl websites, extract structured data, and feed your applications. Reliable web data without maintaining scrapers.

    Automate web data collection with cloud tools that handle anti-bot measures, browser rendering, and data transformation out of the box. Extract content from any website, push to vector databases for RAG workflows, or pipe directly into your apps via API. Schedule runs, set up webhooks, and connect to your existing stack. Free tier available, then scale as you need to.
    Explore 10,000+ tools
  • Atera all-in-one platform IT management software with AI agents Icon
    Atera all-in-one platform IT management software with AI agents

    Ideal for internal IT departments or managed service providers (MSPs)

    Atera’s AI agents don’t just assist, they act. From detection to resolution, they handle incidents and requests instantly, taking your IT management from automated to autonomous.
    Learn More
  • 1
    GitHub search with Manticore Search

    GitHub search with Manticore Search

    Demo: GitHub search with Manticore Search

    GitHub search with Manticore Search. The Manticore GitHub Issue Search tool allows users to search through GitHub issues using Manticore Search, a powerful full-text search engine designed for large datasets and real-time processing. It integrates Manticore's capabilities with GitHub to offer fast and efficient searches within repositories.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 2
    Meilisearch

    Meilisearch

    An open-source, lightning-fast, and hyper-relevant search engine

    An open-source, lightning-fast, and hyper-relevant search engine that fits effortlessly into your apps, websites, and workflow. Meilisearch is a flexible and powerful user-focused search engine that can be added to any website or application. Search-as-you-type returns answers in less than 50 milliseconds. That's faster than the blink of an eye! Deploy in a matter of minutes. Smart presets let you start searching through your data with zero configuration. ...
    Downloads: 11 This Week
    Last Update:
    See Project
  • 3
    TNTSearch

    TNTSearch

    A fully featured full text search engine written in PHP

    TNTSearch is a full-text search engine written in PHP, designed to be integrated into Laravel and other PHP applications. It offers real-time, efficient indexing and searching of textual data using SQLite as its storage backend. TNTSearch is highly configurable and supports features like fuzzy searching, customizable ranking algorithms, and boolean search, making it a powerful tool for adding search functionality to websites and applications.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 4
    Elasticsearch

    Elasticsearch

    A Distributed RESTful Search Engine

    Elasticsearch is a distributed, RESTful search and analytics engine that lets you store, search and analyze with ease at scale. It lets you perform and combine many types of searches; it scales seamlessly, and offers answers incredibly fast with search results you can rank based on a variety of factors. Elasticsearch can be used for a wide variety of use cases, from maps and metrics to site search and workplace search, and with all data types.
    Downloads: 9 This Week
    Last Update:
    See Project
  • Grafana: The open and composable observability platform Icon
    Grafana: The open and composable observability platform

    Faster answers, predictable costs, and no lock-in built by the team helping to make observability accessible to anyone.

    Grafana is the open source analytics & monitoring solution for every database.
    Learn More
  • 5
    Sonic

    Sonic

    Fast, lightweight & schema-less search backend

    Sonic is a super fast and lightweight, schema-less search backend that can be used in place of super-heavy and full-featured search backends like Elasticsearch. It is able to normalize language search queries, auto-complete search queries and offer the most relevant results. Being an identifier index rather than a document index, when queried it provides IDs that can be used to refer to matched documents in an external database.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 6
    ripgrep

    ripgrep

    Regex pattern directory search tool that respects your .gitignore

    ripgrep is a line-oriented search tool that actively searches the directory you're currently in for a regex pattern. By default, ripgrep will ignore your .gitignore and skip hidden files or directories and binary files automatically. ripgrep has first class support on Windows, macOS and Linux, with binary downloads available for every release. ripgrep is similar to other popular search tools like The Silver Searcher, ack and grep. ripgrep supports arbitrary input preprocessing filters which...
    Downloads: 45 This Week
    Last Update:
    See Project
  • 7
    Apache Lucene

    Apache Lucene

    Apache Lucene open-source search software

    The Apache Lucene™ project develops open-source search software. The project releases a core search library, named Lucene™ core, as well as PyLucene, a Python binding for Lucene. Lucene Core is a Java library providing powerful indexing and search features, as well as spellchecking, hit highlighting and advanced analysis/tokenization capabilities. The PyLucene sub-project provides Python bindings for Lucene Core. The Apache Software Foundation provides support for the Apache community of...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 8
    Zoekt

    Zoekt

    Fast trigram based code search

    zoekt is a fast, scalable code search engine written in Go that builds compressed indexes (shards) for source repositories and serves low-latency regex and substring queries. It optimizes for developer workflows: refresh indexes from Git, search across many repos instantly, and jump into hits with surrounding context and file navigation. The indexer uses efficient n-gram dictionaries and posting lists to keep query latency low, while storing metadata (branch, language, repo) for precise filtering. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 9

    UniversalTextExtractor

    Command-line toolset for extracting text from files

    Command-line toolset for extracting text from files (documents, images, archives) into SQLite with OCR support. Simple, expandable, one shell script only.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Axe Credit Portal - ACP- is axefinance’s future-proof AI-driven solution to digitalize the loan process from KYC to servicing, available as a locally hosted or cloud-based software. Icon
    Axe Credit Portal - ACP- is axefinance’s future-proof AI-driven solution to digitalize the loan process from KYC to servicing, available as a locally hosted or cloud-based software.

    Banks, lending institutions

    Founded in 2004, axefinance is a global market-leading software provider focused on credit risk automation for lenders looking to provide an efficient, competitive, and seamless omnichannel financing journey for all client segments (FI, Retail, Commercial, and Corporate.)
    Learn More
  • 10
    CMD-Search

    CMD-Search

    Command Search. "How should I know"

    CMD+Search / Command Search
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    Syncdrome

    Syncdrome

    Syncdrome: Fast, smart file catalog & search for your drives

    Syncdrome is a standalone tool that creates a comprehensive catalog of files from your personal hard drives. It enables rapid keyword searches, delivering results in a style similar to web search engines. When a drive is connected, you can directly open files with your system’s default applications. The application lets you designate a working folder for storing the file catalog, which can even be located on a cloud storage service like Dropbox or Google Drive for easy access from any...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    TextSeek

    TextSeek

    Professional full-text desktop search tool

    TextSeek is a professional full-text desktop search tool. Unlike the filename search tool like Everything and Listary, TextSeek can search filename and file content easily and quickly. It supports PDF, Word, Excel, Powerpoint, RTF and other formats. The software can run directly, and no extra package is required to install.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 13
    FSearch

    FSearch

    A fast file search utility for Unix-like systems based on GTK3

    FSearch is a blazing-fast file search utility for Linux inspired by Windows’ Everything Search Engine, built to allow users to locate files and directories instantly by name rather than content. It achieves its high performance by indexing the filesystem and updating that index in real time, so as you create, rename, or delete files the results stay up to date without noticeable lag.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 14
    Paul Graham GPT

    Paul Graham GPT

    RAG on Paul Graham's essays

    Paul Graham GPT is a specialized AI-powered search and chat app built on a corpus of essays from Paul Graham, giving users the ability to query and discuss his writings in a conversational way. The repo stores the full text of his essays (chunked), uses embeddings (e.g. via OpenAI embeddings) to allow semantic search over that corpus, and hosts a chat interface that combines retrieval results with LLM-based answering — enabling RAG (retrieval-augmented generation) over a fixed dataset. The...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    Common Resource Grep - crgrep

    Common Resource Grep - crgrep

    Common Resource Grep

    CRGREP searches for matching text in databases, various document formats, archives and other difficult to access resources. A command line tool for name and content text matching in database tables, plain files, MS Office documents, PDF, archives, MP3 audio, image meta-data, scanned documents, maven dependencies and web resources. CRGREP will search resources within resources of any arbitrary combination or depth, so text within a document within a zip archive, and so on. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 16
    DynaQ

    DynaQ

    Innovative text document search. http://dynaq.opendfki.de for details.

    The goal of DynaQ is to develop an inquiry system to explore the personal information space, supporting you with the searching paradigm 'orienteering'. DynaQ is a (desktop)search engine with enhanced functionality for file, email and blog search. Look at our GitLab homepage for sourcecode and documentation: http://dynaq.opendfki.de
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    Bookmark manager

    Bookmark manager

    Bookmark manager web application

    Web Page Application: https://shemeshg.github.io/desktop-search/ Github: https://github.com/shemeshg/desktop-search-code * Dropbox sync is performed manually (from admin screen) and not on interval.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    OpenSearchServer Search Engine

    OpenSearchServer Search Engine

    An open source search engine with RESTFul API and crawlers

    OpenSearchServer is a powerful, enterprise-class, search engine program. Using the web user interface, the crawlers (web, file, database, etc.) and the client libraries (REST/API , Ruby, Rails, Node.js, PHP, Perl) you will be able to integrate quickly and easily advanced full-text search capabilities in your application: Full-text with basic semantic, join queries, boolean queries, facet and filter, document (PDF, Office, etc.) indexation, web scrapping,etc. ...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 19

    Summa

    Lucene/Solr based search engine and workflow system

    Important: This project has been moved to https://github.com/statsbiblioteket/summa/ Lucens (and Solr) based search engine with very flexible setup and workflow system. It supports incremental updates, hierarchical faceting and index lookup with low memory overhead. Note: Although Summa is open source, the focus is on features used at Statsbiblioteket. No explicit resources has been allocated for support of external users.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Recent Document Tracker (RecDocTracker)

    Recent Document Tracker (RecDocTracker)

    Track and Search your recent documents and folders, customizable

    Recent Document Tracker Implements a similar function to Windows 7 Jumplists by tracking recent documents and folders, then by using document type icons in the taskbar tray you can pop-up a menu of recent documents sorted by last access time (like Jumplists) allowing you to quickly search for the files/folders you need. You can also pin/exclude documents as well indicate which documents are open is MS Office applications.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21

    Personalized Search Engine

    Personalized Search Engine for Your Files

    MySearchEngine (Personalized Search Engine) is a Java software to search files and folders in an OS file system. It differs from general OS file search engines in that it personalizes the indexing setup so that users can choose which directories to index or remove from an existing index and it can also suggest queries just like Google's "Did you mean" feature. The customization of indexing and query suggestion greatly improves search speed and make user experience more comfortable. eLibrary can also extract text content from files of many wildly used file types such as pdf, doc, ppt, and mp3 to improve the index quality.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    Torrent Search Engine (WinApp)

    Torrent Search Engine (WinApp)

    Fast and powerful search engine

    Torrent Search Engine: TorrentSearch is a framework and script language for searching Bit Torrent websites and search engines for torrents.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 23
    Torrent Search Engine

    Torrent Search Engine

    A fast cross-platform TorrentSearch client

    Torrent Search Engine: TorrentSearch is a framework and script language for searching Bit Torrent websites and search engines for torrents.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 24
    SCAN
    SCAN (Smart Content Aggregation and Navigation) is a universal semantic content aggregator. It combines search, text analysis, tagging and metadata functions to provide new user experience of desktop navigation and document management.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    JavaTB
    JavaTB is a Java ToolBox that goes beyond what operating systems provide for file management, networking, system monitoring and many more functions.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • Next