Showing 29 open source projects for "text based"

View related business solutions
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build, govern, and optimize agents and models with Gemini Enterprise Agent Platform.
    Start Free
  • Build Securely on Azure with Proven Frameworks Icon
    Build Securely on Azure with Proven Frameworks

    Lay a foundation for success with Tested Reference Architectures developed by Fortinet’s experts. Learn more in this white paper.

    Moving to the cloud brings new challenges. How can you manage a larger attack surface while ensuring great network performance? Turn to Fortinet’s Tested Reference Architectures, blueprints for designing and securing cloud environments built by cybersecurity experts. Learn more and explore use cases in this white paper.
    Download Now
  • 1
    pg_textsearch

    pg_textsearch

    PostgreSQL extension for BM25 relevance-ranked full-text search

    ...By embedding search capabilities within the database, it simplifies architecture and reduces operational complexity. The project is particularly useful for applications that require fast and accurate text retrieval. Overall, pg_textsearch extends PostgreSQL into a more powerful platform for text-based data exploration.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 2
    Laravel Scout

    Laravel Scout

    Provides a driver based solution to searching your Eloquent models

    Laravel Scout provides a simple, driver-based solution for adding full-text search to your Eloquent models. Once Scout is installed and configured, it will automatically sync your model changes to your search indexes.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    Elasticsearch

    Elasticsearch

    A Distributed RESTful Search Engine

    Elasticsearch is a distributed, RESTful search and analytics engine that lets you store, search and analyze with ease at scale. It lets you perform and combine many types of searches; it scales seamlessly, and offers answers incredibly fast with search results you can rank based on a variety of factors. Elasticsearch can be used for a wide variety of use cases, from maps and metrics to site search and workplace search, and with all data types.
    Downloads: 8 This Week
    Last Update:
    See Project
  • 4
    Apache Lucene

    Apache Lucene

    Apache Lucene open-source search software

    The Apache Lucene™ project develops open-source search software. The project releases a core search library, named Lucene™ core, as well as PyLucene, a Python binding for Lucene. Lucene Core is a Java library providing powerful indexing and search features, as well as spellchecking, hit highlighting and advanced analysis/tokenization capabilities. The PyLucene sub-project provides Python bindings for Lucene Core. The Apache Software Foundation provides support for the Apache community of...
    Downloads: 1 This Week
    Last Update:
    See Project
  • Custom VMs From 1 to 96 vCPUs With 99.95% Uptime Icon
    Custom VMs From 1 to 96 vCPUs With 99.95% Uptime

    General-purpose, compute-optimized, or GPU/TPU-accelerated. Built to your exact specs.

    Live migration and automatic failover keep workloads online through maintenance. One free e2-micro VM every month.
    Try Free
  • 5

    UniversalTextExtractor

    Command-line toolset for extracting text from files

    Command-line toolset for extracting text from files (documents, images, archives) into SQLite with OCR support. Simple, expandable, one shell script only.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    Searchmonkey III

    Searchmonkey III

    Real-time file search powered by ripgrep. No indexing. No stale result

    Searchmonkey III is a modern desktop search tool for finding text in your files instantly. Originally used by thousands of Linux users, it has been rebuilt from the ground up for macOS, Windows, and Linux. Unlike traditional search tools, it performs real-time searches directly on disk — no indexing, no background services, and no stale results. Powered by ripgrep, it delivers fast, accurate results with support for regular expressions, file filtering, and live previews. Available...
    Downloads: 25 This Week
    Last Update:
    See Project
  • 7

    Shapes

    Graphical programming. Includes n-dimensional sorting.

    Write programs as graphical dataflow charts instead of text. Compile them to any programming language you want. Besides this project includes the most efficient tree-based sorting algorithm that is possible. Originally developed on a CTOS Color NGEN, at first in Pascal, later ported to C, finally - 20 years later - ported to Linux. Currently it's still not really system independent. But it's intended that further releases will cure this.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    Paul Graham GPT

    Paul Graham GPT

    RAG on Paul Graham's essays

    Paul Graham GPT is a specialized AI-powered search and chat app built on a corpus of essays from Paul Graham, giving users the ability to query and discuss his writings in a conversational way. The repo stores the full text of his essays (chunked), uses embeddings (e.g. via OpenAI embeddings) to allow semantic search over that corpus, and hosts a chat interface that combines retrieval results with LLM-based answering — enabling RAG (retrieval-augmented generation) over a fixed dataset. The app uses a Postgres database (with pgvector) hosted on Supabase for its embedding store, making the backend relatively simple and accessible, and the frontend is again built with Next.js/TypeScript for a modern responsive UI. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    Common Resource Grep - crgrep

    Common Resource Grep - crgrep

    Common Resource Grep

    CRGREP searches for matching text in databases, various document formats, archives and other difficult to access resources. A command line tool for name and content text matching in database tables, plain files, MS Office documents, PDF, archives, MP3 audio, image meta-data, scanned documents, maven dependencies and web resources. CRGREP will search resources within resources of any arbitrary combination or depth, so text within a document within a zip archive, and so on. Here you...
    Downloads: 1 This Week
    Last Update:
    See Project
  • Enterprise-grade ITSM, for every business Icon
    Enterprise-grade ITSM, for every business

    Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity.

    Freshservice is an intuitive, AI-powered platform that helps IT, operations, and business teams deliver exceptional service without the usual complexity. Automate repetitive tasks, resolve issues faster, and provide seamless support across the organization. From managing incidents and assets to driving smarter decisions, Freshservice makes it easy to stay efficient and scale with confidence.
    Try it Free
  • 10
    OpenSearchServer Search Engine

    OpenSearchServer Search Engine

    An open source search engine with RESTFul API and crawlers

    OpenSearchServer is a powerful, enterprise-class, search engine program. Using the web user interface, the crawlers (web, file, database, etc.) and the client libraries (REST/API , Ruby, Rails, Node.js, PHP, Perl) you will be able to integrate quickly and easily advanced full-text search capabilities in your application: Full-text with basic semantic, join queries, boolean queries, facet and filter, document (PDF, Office, etc.) indexation, web scrapping,etc. OpenSearchServer runs on...
    Downloads: 17 This Week
    Last Update:
    See Project
  • 11
    recoll-firefox-extension

    recoll-firefox-extension

    Index the web pages you visit with the Recoll text search tool

    Please note that this extension is superceded by a new one based on the WebExtensions API for Firefox 57 and later: https://addons.mozilla.org/en-US/firefox/addon/recoll-we/ The source for the new version is here: https://opensourceprojects.eu/p/recollwe/code/ This Firefox extension allows you to include the web pages that you visit in Firefox in the index built by the Recoll text search tool.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12

    Personalized Search Engine

    Personalized Search Engine for Your Files

    MySearchEngine (Personalized Search Engine) is a Java software to search files and folders in an OS file system. It differs from general OS file search engines in that it personalizes the indexing setup so that users can choose which directories to index or remove from an existing index and it can also suggest queries just like Google's "Did you mean" feature. The customization of indexing and query suggestion greatly improves search speed and make user experience more comfortable. eLibrary...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13

    Infofuze

    Data migration/conversion library based on STX and XSLT transformation

    Infofuze is a Java library and server application that can be used to transform and combine data from various sources into a specific XML or other text output format that can be stored or indexed.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    snippetsource

    snippetsource

    Portable source code librarian

    Snippetsource is a simple yet powerful repository to store code snippets or any other text content. SQLite is used as the database backend which makes fast indexed lookups possible.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    trovi

    trovi

    Trovi is a text search tool for PDF files

    Trovi is a tool to search for text in multiple PDF files.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    Documents In My Pocket (dimp)
    A simple document management system (DMS). Documents and associated datas are stored in a single file. Keep all your documents in your pocket on your USB stick.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    It is a desktop search (aiming learning materials) tool that provides full text search with a friendly GUI
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    UProm for Agile Development
    UProm Requirement Management System (traceability and requirement coverage). Includes : - An admin module to manage SCRUM Project and tree of requirements - A multi-user Web-based client - Tools for databases management (HSQLDB, PostGreSQL,...) UProm
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    Sgrep (sorted grep) is a much faster alternative to traditional Unix grep when searching large files, because sgrep searches sorted input files using a fast binary search to find matching lines.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 20
    A Java implementation of a desktop search engine based on Apache Lucene. It indexes HTML-, XML-, OpenOffice- (Writer, Calc, Impress), MS Word-, and PDF- documents as well as plain text files. For other, arbitrary file types the file name can be indexed.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    Nucular Archiving System for creating full text indices for fielded data. Python API, web, and command line interfaces. Fast. Very light weight. Concurrent read/writes with no possible locking issues. No server process. Proximity. Facets. Funny name.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    PyQFindReplace is a python based find and replace utility. It allows you to select files and folders. You can replace text in several thousands of files in one click. It has a pretty neat UI
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    Full text search engine - console tools and GUI frontends for users, program components and libraries for developers. Cross-platform, portable (Win32/64, .NET, Linux). Extensible architecture. Morphology of natural languages (English, Russian and French)
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24

    DuMP3 - duplicate & similar file finder

    DuMP3 is a duplicate and similar file finder.

    DuMP3 is a duplicate and similar file finder. It finds exact duplicate binaries by hash, similar text files by substring content, images (JPG, BMP, GIF, PNG, etc) by color and audio files (MP3, WAV, OGG, etc) by wave data. Future: fonts, video.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    Kriterion is a document retrieval and categorization engine capable of full text searching. There is no need for keyword or context-based information.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • Next
MongoDB Logo MongoDB