50 projects for "document search engine" with 2 filters applied:

  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • Cloud tools for web scraping and data extraction Icon
    Cloud tools for web scraping and data extraction

    Deploy pre-built tools that crawl websites, extract structured data, and feed your applications. Reliable web data without maintaining scrapers.

    Automate web data collection with cloud tools that handle anti-bot measures, browser rendering, and data transformation out of the box. Extract content from any website, push to vector databases for RAG workflows, or pipe directly into your apps via API. Schedule runs, set up webhooks, and connect to your existing stack. Free tier available, then scale as you need to.
    Explore 10,000+ tools
  • 1
    Zoekt

    Zoekt

    Fast trigram based code search

    zoekt is a fast, scalable code search engine written in Go that builds compressed indexes (shards) for source repositories and serves low-latency regex and substring queries. It optimizes for developer workflows: refresh indexes from Git, search across many repos instantly, and jump into hits with surrounding context and file navigation. The indexer uses efficient n-gram dictionaries and posting lists to keep query latency low, while storing metadata (branch, language, repo) for precise filtering. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 2
    LogicalDOC Document Management - DMS

    LogicalDOC Document Management - DMS

    smart and open source document management system

    LogicalDOC is both document management and collaboration system. The software is loaded with many functions and allows organizing, index, retrieving, controlling and distributing important business documents securely and safely for any organization and individual. Gone are the days when companies used paper-based processes such as printing, mailing and manual filing of paper documents; our document management system replaces all of this with electronic procedures that allow your...
    Leader badge
    Downloads: 319 This Week
    Last Update:
    See Project
  • 3
    AZip

    AZip

    AZip - A portable Zip Archive Manager

    AZip is a free Zip Archive Manager with unique features such as: built-in content Search, smart archive Update, archive Recompression. The goal is to keep the user interface as simple as possible and bother the user the least possible with options or useless gadgets. More details and screenshots on the AZip Web site: http://azip.sf.net Alire crate: https://alire.ada.dev/crates/azip Mirror: https://github.com/zertovitch/azip
    Downloads: 10 This Week
    Last Update:
    See Project
  • 4
    Interleave is a business process management application. It enables you to model your business process and make it available online. It's meant to replace processes which currently rely on paper or spreadsheets and it has a good workflow engine.
    Downloads: 2 This Week
    Last Update:
    See Project
  • Atera all-in-one platform IT management software with AI agents Icon
    Atera all-in-one platform IT management software with AI agents

    Ideal for internal IT departments or managed service providers (MSPs)

    Atera’s AI agents don’t just assist, they act. From detection to resolution, they handle incidents and requests instantly, taking your IT management from automated to autonomous.
    Learn More
  • 5
    Paul Graham GPT

    Paul Graham GPT

    RAG on Paul Graham's essays

    Paul Graham GPT is a specialized AI-powered search and chat app built on a corpus of essays from Paul Graham, giving users the ability to query and discuss his writings in a conversational way. The repo stores the full text of his essays (chunked), uses embeddings (e.g. via OpenAI embeddings) to allow semantic search over that corpus, and hosts a chat interface that combines retrieval results with LLM-based answering — enabling RAG (retrieval-augmented generation) over a fixed dataset. The...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    DynaQ

    DynaQ

    Innovative text document search. http://dynaq.opendfki.de for details.

    The goal of DynaQ is to develop an inquiry system to explore the personal information space, supporting you with the searching paradigm 'orienteering'. DynaQ is a (desktop)search engine with enhanced functionality for file, email and blog search. Look at our GitLab homepage for sourcecode and documentation: http://dynaq.opendfki.de
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    YaCy Peer-to-Peer Search Engine

    YaCy Peer-to-Peer Search Engine

    Decentralized Web Search Engine

    YaCy is a free search engine that anyone can use to build search the internet (www and ftp) or to create a search portal for others (internet or intranet). The scale of YaCy is limited only by the number of users and can index billions of web pages. In p2p mode it is fully decentralized, all users of the search engine network are equal and it is not possible for anyone to censor the content of the distributed index.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 8
    OpenSearchServer Search Engine

    OpenSearchServer Search Engine

    An open source search engine with RESTFul API and crawlers

    OpenSearchServer is a powerful, enterprise-class, search engine program. Using the web user interface, the crawlers (web, file, database, etc.) and the client libraries (REST/API , Ruby, Rails, Node.js, PHP, Perl) you will be able to integrate quickly and easily advanced full-text search capabilities in your application: Full-text with basic semantic, join queries, boolean queries, facet and filter, document (PDF, Office, etc.) indexation, web scrapping,etc. ...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 9
    TWiki Enterprise Collaboration Platform

    TWiki Enterprise Collaboration Platform

    Open Source Enterprise Wiki and Web Application Platform

    Enterprise collaboration platform with over 400 extensions. Users without programming skills can create web applications. 50,000 small businesses, many Fortune 500 companies, and millions of people use TWiki, with installations of 100,000s of pages. Visit http://twiki.org/ for the open source TWiki. The TWiki community is focusing on building the best collaboration platform for the workplace. We invite you to GET INVOLVED! More at http://bit.ly/twGetInv.
    Downloads: 11 This Week
    Last Update:
    See Project
  • Grafana: The open and composable observability platform Icon
    Grafana: The open and composable observability platform

    Faster answers, predictable costs, and no lock-in built by the team helping to make observability accessible to anyone.

    Grafana is the open source analytics & monitoring solution for every database.
    Learn More
  • 10
    xinco - Document Management System, DMS
    xinco [eXtensibe INformation COre] is a powerful Web-Service based Information and Document Management System (DMS) for files, text, URLs and contacts, featuring ACLs, versioning, full text search, an FTP-like client (easy install, J2EE+MySQL/PostgreSQL)
    Downloads: 1 This Week
    Last Update:
    See Project
  • 11
    phpTransformer
    php Transformer the First Enterprise web content management software CMS completely support Multi languages for Interface and content. Store data in MySQL database,Multi Themes, users & group permissions,search engine friendly SEO. phptransformer.com
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    SCAN
    SCAN (Smart Content Aggregation and Navigation) is a universal semantic content aggregator. It combines search, text analysis, tagging and metadata functions to provide new user experience of desktop navigation and document management.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    CLucene is a C++ port of Lucene: the high-performance, full-featured text search engine written in Java. CLucene is faster than lucene as it is written in C++.
    Leader badge
    Downloads: 115 This Week
    Last Update:
    See Project
  • 14

    mar

    Mail ARchiver

    Open Source Mail Archiver & Search engine.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15

    edocias

    Electronic Document Index And Search

    EDocIAS (Electronic Document Index And Search) is a PHP-based tool for indexing and searching files of various types. Third-party tools (tesseract, xpdf, etc.) can be configured to support any type of file.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    Platform for parallel computation in the Amazon cloud, including machine learning ensembles written in R for computational biology and other areas of scientific research. Home to MR-Tandem, a hadoop-enabled fork of X!Tandem peptide search engine.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    Documents In My Pocket (dimp)
    A simple document management system (DMS). Documents and associated datas are stored in a single file. Keep all your documents in your pocket on your USB stick.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    Document summarization system. By adding document content to system, user queries will generate a summary document containing the available information to the system.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    Ex-Crawler
    Ex-Crawler is divided into 3 subprojects (Crawler Daemon, distributed gui Client, (web) search engine) which together provide a flexible and powerful search engine supporting distributed computing. More informations: http://ex-crawler.sourceforge.net
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    A fulltext indexing engine designed for xml files on a slow readonly medium.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    Webstats Solr is an attempt to make Apache Access log easier to Data Mine. By adding a powerful Search Engine (SOLR) as a Backend and using Java Script and HTML and maybe PHP I hope to out date AWStats.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    jBingAPI is a java library to query the microsoft search engine bing (http://www.bing.com/) using their public api. jBingAPI just makes it a lot easier to communicate with this api.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    A Java implementation of a desktop search engine based on Apache Lucene. It indexes HTML-, XML-, OpenOffice- (Writer, Calc, Impress), MS Word-, and PDF- documents as well as plain text files. For other, arbitrary file types the file name can be indexed.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    Mustru is a desktop Q&A search engine based on Lucene. You can search local filesystems using natural language questions or boolean queries. A list of answers or hits will be returned. A web based interface is included.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    Sabuesonix is a desktop search engine. It can explore your PDF, TXT and HTML files (and more in the future) and create an index for quick documents search.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • Next