Showing 22 open source projects for "document"

View related business solutions
  • Earn up to 16% annual interest with Nexo. Icon
    Earn up to 16% annual interest with Nexo.

    Let your crypto work for you

    Put idle assets to work with competitive interest rates, borrow without selling, and trade with precision. All in one platform. Geographic restrictions, eligibility, and terms apply.
    Get started with Nexo.
  • Try Google Cloud Risk-Free With $300 in Credit Icon
    Try Google Cloud Risk-Free With $300 in Credit

    No hidden charges. No surprise bills. Cancel anytime.

    Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.
    Start Free
  • 1
    Sonic

    Sonic

    Fast, lightweight & schema-less search backend

    Sonic is a super fast and lightweight, schema-less search backend that can be used in place of super-heavy and full-featured search backends like Elasticsearch. It is able to normalize language search queries, auto-complete search queries and offer the most relevant results. Being an identifier index rather than a document index, when queried it provides IDs that can be used to refer to matched documents in an external database.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 2

    UniversalTextExtractor

    Command-line toolset for extracting text from files

    Command-line toolset for extracting text from files (documents, images, archives) into SQLite with OCR support. Simple, expandable, one shell script only.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    DocumentGrep

    DocumentGrep

    Search text or a regular expression in multiple documents

    ...You can use regular expressions for the search (https://en.wikipedia.org/wiki/Regular_expression). This GUI and the command line tools work without indexing. Either the document is converted into text and processed by the RegExpr libary of Andrey V. Sorokin or handeled by the cli command itself (like pdfgrep). This GUI works well when searching in several hundreds of documents, depending on the speed of your system and the length of the documents.
    Downloads: 7 This Week
    Last Update:
    See Project
  • 4
    PdfgrepGui

    PdfgrepGui

    This is a simple GUI for the command line tool grep and pdfgrep

    THIS PROJECT HAS MOVED TO: https://sourceforge.net/projects/documentgrep/ This program is a GUI for the command line tool grep and pdfgrep. Pdfgrep search text in multiple PDF files and grep can serach text in multiple text files. You can use regular expressions for the search (https://en.wikipedia.org/wiki/Regular_expression). This GUI and the command line tools work without indexing. The following options are used: -i (ignore case) and -F (fixed strings), -n (Print page number or...
    Leader badge
    Downloads: 8 This Week
    Last Update:
    See Project
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build generative AI apps with Vertex AI. Switch between models without switching platforms.
    Start Free
  • 5
    TextSeek

    TextSeek

    Professional full-text desktop search tool

    TextSeek is a professional full-text desktop search tool. Unlike the filename search tool like Everything and Listary, TextSeek can search filename and file content easily and quickly. It supports PDF, Word, Excel, Powerpoint, RTF and other formats. The software can run directly, and no extra package is required to install.
    Downloads: 17 This Week
    Last Update:
    See Project
  • 6
    Paul Graham GPT

    Paul Graham GPT

    RAG on Paul Graham's essays

    Paul Graham GPT is a specialized AI-powered search and chat app built on a corpus of essays from Paul Graham, giving users the ability to query and discuss his writings in a conversational way. The repo stores the full text of his essays (chunked), uses embeddings (e.g. via OpenAI embeddings) to allow semantic search over that corpus, and hosts a chat interface that combines retrieval results with LLM-based answering — enabling RAG (retrieval-augmented generation) over a fixed dataset. The...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    Common Resource Grep - crgrep

    Common Resource Grep - crgrep

    Common Resource Grep

    CRGREP searches for matching text in databases, various document formats, archives and other difficult to access resources. A command line tool for name and content text matching in database tables, plain files, MS Office documents, PDF, archives, MP3 audio, image meta-data, scanned documents, maven dependencies and web resources. CRGREP will search resources within resources of any arbitrary combination or depth, so text within a document within a zip archive, and so on. ...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 8
    DynaQ

    DynaQ

    Innovative text document search. http://dynaq.opendfki.de for details.

    The goal of DynaQ is to develop an inquiry system to explore the personal information space, supporting you with the searching paradigm 'orienteering'. DynaQ is a (desktop)search engine with enhanced functionality for file, email and blog search. Look at our GitLab homepage for sourcecode and documentation: http://dynaq.opendfki.de
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    OpenSearchServer Search Engine

    OpenSearchServer Search Engine

    An open source search engine with RESTFul API and crawlers

    ...Using the web user interface, the crawlers (web, file, database, etc.) and the client libraries (REST/API , Ruby, Rails, Node.js, PHP, Perl) you will be able to integrate quickly and easily advanced full-text search capabilities in your application: Full-text with basic semantic, join queries, boolean queries, facet and filter, document (PDF, Office, etc.) indexation, web scrapping,etc. OpenSearchServer runs on Windows and Linux/Unix/BSD.
    Downloads: 12 This Week
    Last Update:
    See Project
  • Custom VMs From 1 to 96 vCPUs With 99.95% Uptime Icon
    Custom VMs From 1 to 96 vCPUs With 99.95% Uptime

    General-purpose, compute-optimized, or GPU/TPU-accelerated. Built to your exact specs.

    Live migration and automatic failover keep workloads online through maintenance. One free e2-micro VM every month.
    Try Free
  • 10
    Recent Document Tracker (RecDocTracker)

    Recent Document Tracker (RecDocTracker)

    Track and Search your recent documents and folders, customizable

    Recent Document Tracker Implements a similar function to Windows 7 Jumplists by tracking recent documents and folders, then by using document type icons in the taskbar tray you can pop-up a menu of recent documents sorted by last access time (like Jumplists) allowing you to quickly search for the files/folders you need. You can also pin/exclude documents as well indicate which documents are open is MS Office applications.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    SCAN
    SCAN (Smart Content Aggregation and Navigation) is a universal semantic content aggregator. It combines search, text analysis, tagging and metadata functions to provide new user experience of desktop navigation and document management.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 12
    Wukong

    Wukong

    Highly customizable full-text search engine

    ...Support Chinese word segmentation (concurrent word segmentation using the sego word segmentation package, speed 27MB/sec). Support to calculate the proximity distance of keywords in the text (token proximity). When a request to add a document to the index comes in, the main coroutine will send the text to be segmented to a word segmentation coroutine through a channel, and the coroutine will segment the text and send it to a word segmentation through another channel. Indexer coroutines. The indexer coroutine builds an inverted index from the search keyword to the document, and the inverted index table is stored in memory for quick recall. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    Integra desktop search engine - cross-platform, portable, embeddable, extensible search engine and search tools (both console and GUI) for developers and end-users. Full unicode support, easy UI localisation, plugins for document parsers and unpackers.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    Documents In My Pocket (dimp)
    A simple document management system (DMS). Documents and associated datas are stored in a single file. Keep all your documents in your pocket on your USB stick.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    Quotero

    Quotero

    Quotero becomes Kimios

    Quotero was an open source Document Management System (DMS) developed in java. It provides basic document management features and advanced collaborative features such as version control, comments, workflow, etc. Quotero becomes Kimios. Please visit https://sourceforge.net/projects/kimios/ Web Site: http://www.kimios.com Issues: http://issues.kimios.com Wiki: http://wiki.kimios.com
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    Document summarization system. By adding document content to system, user queries will generate a summary document containing the available information to the system.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    UProm for Agile Development
    UProm Requirement Management System (traceability and requirement coverage). Includes : - An admin module to manage SCRUM Project and tree of requirements - A multi-user Web-based client - Tools for databases management (HSQLDB, PostGreSQL,...) UProm
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    UltimateLauncher is an application and document launcher for windows systems.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    BS File Explorer is a tabbed file explorer for windows that uses the .NET framework. Features include built in media players, document editors, and file type management, tabbed browsing, and a very quick and efficient file search feature.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    PDFcat - Portable Document (PDF) Catalog Manager
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    Kriterion is a document retrieval and categorization engine capable of full text searching. There is no need for keyword or context-based information.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    GLScube is an Augmented File System designed to offer a rich interface to the users' data, and pull, both end-users and developers, from thinking about Where a Document is to What a Document is about.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • Next
MongoDB Logo MongoDB