Showing 30 open source projects for "documents"

View related business solutions
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • Train ML Models With SQL You Already Know Icon
    Train ML Models With SQL You Already Know

    BigQuery automates data prep, analysis, and predictions with built-in AI assistance.

    Build and deploy ML models using familiar SQL. Automate data prep with built-in Gemini. Query 1 TB and store 10 GB free monthly.
    Try Free
  • 1
    PageLM

    PageLM

    PageLM is a community driven version of NotebookLM

    ...It supports uploaded documents including PDF, DOCX, Markdown, and TXT, allowing users to ground questions and generated materials in source content. On the technical side, it supports multiple model providers, multiple embedding back ends, WebSocket streaming for real-time generation, persistent content storage, and structured markdown outputs.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    Concordia

    Concordia

    Crowdsourcing platform for full text transcription and tagging

    Concordia is a platform for crowdsourcing transcription and tagging of text in digitized images. It was developed by the Library of Congress so that volunteers of all backgrounds could transcribe and tag digitized images of manuscripts and typed materials from the Library’s collections that could not otherwise be done by optical character recognition.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    eBook

    eBook

    LaTeX book with examples, open-source eBook

    ...Plus, it's #opensource and freely available, creating a collaborative learning environment. In this book the I try to reveal how you can find necessary pieces of TeX code looking only at already done work. If you want to create high-quality documents with LaTeX, check out the LaTaX book on GitHub, and feel free to ask any questions you have.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 4

    Proteomics Documents

    Open-Access Documents delivered to be used by the Proteomics community

    Open-Access Documents delivered to be used by the Proteomics community: - MIAPE-SP v1: download it at https://sourceforge.net/projects/proteomicsdocs.lp-csic-uab.p/files/MIAPE_SP_v1.pdf/download
    Downloads: 0 This Week
    Last Update:
    See Project
  • Go From AI Idea to AI App Fast Icon
    Go From AI Idea to AI App Fast

    One platform to build, fine-tune, and deploy ML models. No MLOps team required.

    Access Gemini 3 and 200+ models. Build chatbots, agents, or custom models with built-in monitoring and scaling.
    Try Free
  • 5
    Epub.js

    Epub.js

    Enhanced eBooks in the browser

    Epub.js is an open-source Javascript library that allows any web page to render Epub documents on any device with a modern browser. Epub.js contains a flexible rendering engine and provides a simple interface for common ebook functions such as styling, persistence and pagination. We release and maintain Epub.js on GitHub, with a growing developer community. Just include a single javascript file, and pointing it at a hosted epub.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 6

    Edition Visualization Technology

    Digital edition visualization software

    EVT is a tool that creates web-based diplomatic and/or interpretative editions starting from XML TEI-encoded documents. The current release, EVT v. 1.3, builds upon the feature set of the previous releases and adds support for quite a number of new features: * added a new navigation bar; * added support for translations in multiple languages; * added support VisColl style sheets; * added support for comment and critical <note>s; * added support for verse numbering and visualization; * many bugs fixed! ...
    Downloads: 10 This Week
    Last Update:
    See Project
  • 7

    Digital Preservation Recorder

    NO LONGER MAINTAINED

    NO LONGER MAINTAINED, NO LONGER SUPPORTED The Digital Preservation Recorder (DPR) has been developed by the National Archives of Australia to manage a digital preservation workflow. It features antivirus integration and makes use of the Xena framework for preservation conversions of data objects
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8

    WebDjVuTextEd

    Edit the OCR text layer of DjVu documents in a web browser

    WebDjVuTextEd allows to edit the text layer of OCR'ed DjVu documents in a web browser. You can modify the structure (paragraphs, lines, words...) create, delete, edit text nodes, modify their container box by mouse, and run a spellchecker. The program does not directly read the DjVu files, it requires exported XML text data and images. When using without a webserver, you can open and save local files, but cannot take advantages of auto-save and spell checking.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 9
    FALCON - Text Search Java Project

    FALCON - Text Search Java Project

    JSON based text search Java Project

    ----------------- - What is it? - ----------------- The "Falcon Search" is a JAVA API and tool to search inside the documents. It was originally started to search the content in pdf files under the project "HAWK Search". Searching with this tool is query-based not word-based as in most of the document search tools OR document readers. It also takes care of jumbling of words within query and spelling mistakes. Commonly used techniques in this project are Natural Language Processing, Information Extraction and Question-Answering Architecture. ---------------------- - Latest Version - ---------------------- Details of latest version can be found on project website - http://geekdadaji.com --------------------------- - CONTACT DETAILS - --------------------------- CREATOR : SWAPNIL A JADHAV (saj1919) EMAIL ID : dadajibudhau@gmail.com WEBSITE : http://geekdadaji.com LICENSE : CC BY-NC 4.0
    Downloads: 0 This Week
    Last Update:
    See Project
  • Try Google Cloud Risk-Free With $300 in Credit Icon
    Try Google Cloud Risk-Free With $300 in Credit

    No hidden charges. No surprise bills. Cancel anytime.

    Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.
    Start Free
  • 10

    Language Constructor

    Complete tool for constructing/manipulating languages in digital form

    ...It allows for free experimentation of all aspects of the language, so it does not have to be made consistent on paper first. You can edit script, syntax, grammar, morphology, lexicon and phonology, as well as write documents in the language, as it might be too complex to be handled by current font technology. The information is stored in xml format for easy integration with other software.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11

    Dependency Discovery Tool

    A tool for discovering external dependencies in Office files.

    The Dependency Discovery Tool searches through binary office files (.doc, .xls and .ppt) and tries to find any documents or files that are linked to the document. It is written in java, using the Apache POI libraries ( http://poi.apache.org ) This project was part of a summer scholarship from the School of Engineering and Computer Science at Victoria University, Wellington (http://ecs.vuw.ac.nz) in conjunction with Archives New Zealand ( http://archives.govt.nz ).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    Shark Virtual Library

    Shark Virtual Library

    The best way to store and share files

    Shark Virtual Library (shark-vlibrary) allows to share trougth an web java application educational, cultural and other types of documents (e-books, researches, articles, sounds...) it is specially designed for educational purposes and institutios, Three kinds of users can manage and use the documents what are saved in diffrerent sections that you can add.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    A system to perform analysis of large documents for the purpose of cataloging similar documents. Similarity is based upon contextual analysis of these documents done by identifying common words and proper nouns.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    XML documents To Generated dynamic web application supporting CRUD actions. Credits to Ministry of Culture and Communication, France; UNESCO; Ecole Nationale des Chartes, France; PASS-TECH, France.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    Online img2epub is a web application which creates epub documents from uploaded images.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    The Document Availability Information Ontology (DAIA) describes the current availability of documents in libraries and similar institutions. Availability can be expressed in terms of specific services. See http://purl.org/NET/DAIA for a specification
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    Ontea - Pattern based Semantic Annotation Platform. Ontea search or create semantic meta data from text or documents using pattern based approaches. Implementation currently includes regular expressions (regex) patterns
    Downloads: 1 This Week
    Last Update:
    See Project
  • 18
    The METS API is a Java API designed to aid developers in the processing and assembly of METS Documents.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    Digital space for building and confronting interpretations about documents
    Downloads: 112 This Week
    Last Update:
    See Project
  • 20
    This project defines the Simple API for Binary REpresentations (SABRE) for processing hierarchically structured, binary-oriented documents, comparable to the Simple API for XML (SAX). The library is e.g. used in the Java ISO Image Creator (JIIC).
    Downloads: 1 This Week
    Last Update:
    See Project
  • 21
    KAPDF 1.0 was released.KAPDF is a Free Libre Open Source PHP class for generating PDF documents without requiring external extensions.KAPDF Supports UTF-8, Unicode, RTL languages and (x)HTML. For more information visit http://www.kukaka.org
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    Make AsciiDoc part of your literate programming tool set. With eWEB you can weave and tangle literate programs written as AsciiDoc documents, using embedded WEB code snippets.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    A little xml tool which can filter the XML documents according the XPath.Only support part of the XPath. Hope to support full XPath expression and XQuery in the future.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    Apostil is an annotation Java software. The core of Apostil allows end-users to view their documents, to annote them and to keep track of their annotation. Plugins provides more functionnalities.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    Bibwiki is a Specialpage for MediaWiki to manage BibTeX bibliographies. With Bibwiki it's easy to import records from various sources, manage digital documents, export lists of references via BibTeX and create lists of references in wiki articles.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • Next
MongoDB Logo MongoDB