Showing 35 open source projects for "text based"

View related business solutions
  • Try Google Cloud Risk-Free With $300 in Credit Icon
    Try Google Cloud Risk-Free With $300 in Credit

    No hidden charges. No surprise bills. Cancel anytime.

    Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.
    Start Free
  • Fully Managed MySQL, PostgreSQL, and SQL Server Icon
    Fully Managed MySQL, PostgreSQL, and SQL Server

    Automatic backups, patching, replication, and failover. Focus on your app, not your database.

    Cloud SQL handles your database ops end to end, so you can focus on your app.
    Try Free
  • 1
    pg_textsearch

    pg_textsearch

    PostgreSQL extension for BM25 relevance-ranked full-text search

    ...By embedding search capabilities within the database, it simplifies architecture and reduces operational complexity. The project is particularly useful for applications that require fast and accurate text retrieval. Overall, pg_textsearch extends PostgreSQL into a more powerful platform for text-based data exploration.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 2
    Elasticsearch

    Elasticsearch

    A Distributed RESTful Search Engine

    Elasticsearch is a distributed, RESTful search and analytics engine that lets you store, search and analyze with ease at scale. It lets you perform and combine many types of searches; it scales seamlessly, and offers answers incredibly fast with search results you can rank based on a variety of factors. Elasticsearch can be used for a wide variety of use cases, from maps and metrics to site search and workplace search, and with all data types.
    Downloads: 10 This Week
    Last Update:
    See Project
  • 3
    Laravel Scout

    Laravel Scout

    Provides a driver based solution to searching your Eloquent models

    Laravel Scout provides a simple, driver-based solution for adding full-text search to your Eloquent models. Once Scout is installed and configured, it will automatically sync your model changes to your search indexes.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    Apache Lucene

    Apache Lucene

    Apache Lucene open-source search software

    The Apache Lucene™ project develops open-source search software. The project releases a core search library, named Lucene™ core, as well as PyLucene, a Python binding for Lucene. Lucene Core is a Java library providing powerful indexing and search features, as well as spellchecking, hit highlighting and advanced analysis/tokenization capabilities. The PyLucene sub-project provides Python bindings for Lucene Core. The Apache Software Foundation provides support for the Apache community of...
    Downloads: 2 This Week
    Last Update:
    See Project
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 5

    UniversalTextExtractor

    Command-line toolset for extracting text from files

    Command-line toolset for extracting text from files (documents, images, archives) into SQLite with OCR support. Simple, expandable, one shell script only.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    Searchmonkey III

    Searchmonkey III

    Real-time file search powered by ripgrep. No indexing. No stale result

    Searchmonkey III is a modern desktop search tool for finding text in your files instantly. Originally used by thousands of Linux users, it has been rebuilt from the ground up for macOS, Windows, and Linux. Unlike traditional search tools, it performs real-time searches directly on disk — no indexing, no background services, and no stale results. Powered by ripgrep, it delivers fast, accurate results with support for regular expressions, file filtering, and live previews. Available...
    Downloads: 23 This Week
    Last Update:
    See Project
  • 7
    Searcher

    Searcher

    Search utility for files

    This utility is intended to help in search or grep for multiple content in multiple locations for multiple file types. For help/usage, see the wiki page: https://sourceforge.net/p/searcher/wiki/Home/
    Leader badge
    Downloads: 15 This Week
    Last Update:
    See Project
  • 8

    Path Lister

    Get list of files and folders tree

    Folder digging tool to retrieve the list of files based on a search criteria. You can get on the file name, or full path. You can get the file count based on a file mask.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9

    Shapes

    Graphical programming. Includes n-dimensional sorting.

    Write programs as graphical dataflow charts instead of text. Compile them to any programming language you want. Besides this project includes the most efficient tree-based sorting algorithm that is possible. Originally developed on a CTOS Color NGEN, at first in Pascal, later ported to C, finally - 20 years later - ported to Linux. Currently it's still not really system independent. But it's intended that further releases will cure this.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • 10
    Paul Graham GPT

    Paul Graham GPT

    RAG on Paul Graham's essays

    Paul Graham GPT is a specialized AI-powered search and chat app built on a corpus of essays from Paul Graham, giving users the ability to query and discuss his writings in a conversational way. The repo stores the full text of his essays (chunked), uses embeddings (e.g. via OpenAI embeddings) to allow semantic search over that corpus, and hosts a chat interface that combines retrieval results with LLM-based answering — enabling RAG (retrieval-augmented generation) over a fixed dataset. The app uses a Postgres database (with pgvector) hosted on Supabase for its embedding store, making the backend relatively simple and accessible, and the frontend is again built with Next.js/TypeScript for a modern responsive UI. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    Common Resource Grep - crgrep

    Common Resource Grep - crgrep

    Common Resource Grep

    CRGREP searches for matching text in databases, various document formats, archives and other difficult to access resources. A command line tool for name and content text matching in database tables, plain files, MS Office documents, PDF, archives, MP3 audio, image meta-data, scanned documents, maven dependencies and web resources. CRGREP will search resources within resources of any arbitrary combination or depth, so text within a document within a zip archive, and so on. Here you...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 12
    Locate32

    Locate32

    Locate32 finds files and directories based on file names.

    ...The use of these databases provides very fast searching speed. The software includes a dialog-based application as well as console programs that can be used to update and access the databases. Locate32 is not primarily meant to find text or data within files, but it has a primitive feature to do this.
    Leader badge
    Downloads: 26 This Week
    Last Update:
    See Project
  • 13
    OpenSearchServer Search Engine

    OpenSearchServer Search Engine

    An open source search engine with RESTFul API and crawlers

    OpenSearchServer is a powerful, enterprise-class, search engine program. Using the web user interface, the crawlers (web, file, database, etc.) and the client libraries (REST/API , Ruby, Rails, Node.js, PHP, Perl) you will be able to integrate quickly and easily advanced full-text search capabilities in your application: Full-text with basic semantic, join queries, boolean queries, facet and filter, document (PDF, Office, etc.) indexation, web scrapping,etc. OpenSearchServer runs on...
    Downloads: 17 This Week
    Last Update:
    See Project
  • 14
    recoll-firefox-extension

    recoll-firefox-extension

    Index the web pages you visit with the Recoll text search tool

    Please note that this extension is superceded by a new one based on the WebExtensions API for Firefox 57 and later: https://addons.mozilla.org/en-US/firefox/addon/recoll-we/ The source for the new version is here: https://opensourceprojects.eu/p/recollwe/code/ This Firefox extension allows you to include the web pages that you visit in Firefox in the index built by the Recoll text search tool.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15

    Personalized Search Engine

    Personalized Search Engine for Your Files

    MySearchEngine (Personalized Search Engine) is a Java software to search files and folders in an OS file system. It differs from general OS file search engines in that it personalizes the indexing setup so that users can choose which directories to index or remove from an existing index and it can also suggest queries just like Google's "Did you mean" feature. The customization of indexing and query suggestion greatly improves search speed and make user experience more comfortable. eLibrary...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16

    Infofuze

    Data migration/conversion library based on STX and XSLT transformation

    Infofuze is a Java library and server application that can be used to transform and combine data from various sources into a specific XML or other text output format that can be stored or indexed.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    snippetsource

    snippetsource

    Portable source code librarian

    Snippetsource is a simple yet powerful repository to store code snippets or any other text content. SQLite is used as the database backend which makes fast indexed lookups possible.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    trovi

    trovi

    Trovi is a text search tool for PDF files

    Trovi is a tool to search for text in multiple PDF files.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    Documents In My Pocket (dimp)
    A simple document management system (DMS). Documents and associated datas are stored in a single file. Keep all your documents in your pocket on your USB stick.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    It is a desktop search (aiming learning materials) tool that provides full text search with a friendly GUI
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    A fast and efficient method to find all terms in tons of HTML-files by exact search, part-word-search and fuzzy search. Time and memory fall considerably below the requests of common search engines.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    UProm for Agile Development
    UProm Requirement Management System (traceability and requirement coverage). Includes : - An admin module to manage SCRUM Project and tree of requirements - A multi-user Web-based client - Tools for databases management (HSQLDB, PostGreSQL,...) UProm
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    Sgrep (sorted grep) is a much faster alternative to traditional Unix grep when searching large files, because sgrep searches sorted input files using a fast binary search to find matching lines.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 24
    A Java implementation of a desktop search engine based on Apache Lucene. It indexes HTML-, XML-, OpenOffice- (Writer, Calc, Impress), MS Word-, and PDF- documents as well as plain text files. For other, arbitrary file types the file name can be indexed.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    Nucular Archiving System for creating full text indices for fielded data. Python API, web, and command line interfaces. Fast. Very light weight. Concurrent read/writes with no possible locking issues. No server process. Proximity. Facets. Funny name.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • Next
MongoDB Logo MongoDB