Showing 169 open source projects for "documents"

View related business solutions
  • Retool your internal operations Icon
    Retool your internal operations

    Generate secure, production-grade apps that connect to your business data. Not just prototypes, but tools your team can actually deploy.

    Build internal software that meets enterprise security standards without waiting on engineering resources. Retool connects to your databases, APIs, and data sources while maintaining the permissions and controls you need. Create custom dashboards, admin tools, and workflows from natural language prompts—all deployed in your cloud with security baked in. Stop duct-taping operations together, start building in Retool.
    Build an app in Retool
  • Find Hidden Risks in Windows Task Scheduler Icon
    Find Hidden Risks in Windows Task Scheduler

    Free diagnostic script reveals configuration issues, error patterns, and security risks. Instant HTML report.

    Windows Task Scheduler might be hiding critical failures. Download the free JAMS diagnostic tool to uncover problems before they impact production—get a color-coded risk report with clear remediation steps in minutes.
    Download Free Tool
  • 1
    TestEl is a Java-based learning analyzer for HTML (and possibly other) structured documents. It can be trained to detect structures in such documents and renders hits in XML.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    The first Free Text-to-Speech (TTS) synthesis system in Malayalam based on festival speech synthesis system. It gives Orca - a free domain screen reading software, the additional functionality to read Unicode Malayalam documents and websites as well.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    Vedvarsha is an application for 2 purposes: 1. Handwariting script recognition that extracts recognized letters into documents. 2. OCR (Optical Character Recogniton) that works only for non-cursive and isolated characters. It depends upon libsyntactic,
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4

    edocias

    Electronic Document Index And Search

    EDocIAS (Electronic Document Index And Search) is a PHP-based tool for indexing and searching files of various types. Third-party tools (tesseract, xpdf, etc.) can be configured to support any type of file.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Atera all-in-one platform IT management software with AI agents Icon
    Atera all-in-one platform IT management software with AI agents

    Ideal for internal IT departments or managed service providers (MSPs)

    Atera’s AI agents don’t just assist, they act. From detection to resolution, they handle incidents and requests instantly, taking your IT management from automated to autonomous.
    Learn More
  • 5
    A graphical tool to discover topics from collections of text documents.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    This project aims to implement in java the following text mining techniques: Text Language Detection, Keywords and keyphrases extraction, Text Classification, Text Clustering, Single or multiple documents Summarization, Plagiarism Detection.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    SpatialML is a markup language for representing spatial expressions in natural language documents. The goal is to allow for better integration of text collections with resources such as databases that provide spatial information about a domain.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 8
    Leark is a Data Mining library developed in C#.NET. It contains several methods for ranking web documents described with a set of normalized features, and a feature selection algorithm. The methods are based on perceptron and clustering.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    Simple Document management software written in PHP for linux-operating-systems supporting Bank-Account-Linking and OCR
    Downloads: 0 This Week
    Last Update:
    See Project
  • Nonprofit Budgeting Software Icon
    Nonprofit Budgeting Software

    Martus Solutions provides seamless budgeting, reporting, and forecasting tools that integrate with accounting systems for real-time financial insights

    Martus' collaborative and easy-to-use budgeting and reporting platform will save you hundreds of hours each year. It's designed to make the entire budgeting process easier and create unlimited financial transparency.
    Learn More
  • 10
    ANts P2P
    ANts P2P realizes a third generation P2P net. It protects your privacy while you are connected and makes you not trackable, hiding your identity (ip) and crypting everything you are sending/receiving from others.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 11
    Master's Thesis subject: "Knowledge repositories for effective and secure services executing in agent environment." Goal: Developing optimal method for storing knowledge in distributed agent applications. Java code + LaTeX documents.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    TableSeer is a tool that automatically identifies tables in digital documents and extracts the contents in the cells of the tables as well as table metadata
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    SecureJDMS is the attempt to develop a secure DMS, based on an RCP and (even untrusted) DB for data storage. For now, its all about managing scanned documents and searching them by content (using OCR). All data sent and stored will be strongly encrypted.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    TextMine is for the Perl hacker who is grappling with the problems of managing unstructured text from various sources. You can use these text mining tools to search the Web, index text, extract entities, categorize your e-mail, and summarize documents.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    JOFS is a java project concerning the develop of an owl ontology to store meta informations about files and documents. It will provide a core to easily manage the ontology and an architecture to easily add to the core functionalities.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    OntoExtractor is a way of building ontologies that proceeds in a bottom-up fashion, defining concepts as clusters of concrete XML objects. From a set of XML documents the application generates a taxonomy. OntoExtractor has been developed so far by the Kn
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    The aim of MIEX (Metadata and Information Extractor from small XML documents) is to create a wrapper for the Stanford Parser, to extract and store metadata (syntactic structures, relationships among words...) from simple XML documents.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    JWebPro: A Java tool that can interact with Google search and then process the returned Web documents in a couple of ways. The outputs can serve as inputs for NLP, IR, infor extraction, Web mining, online social network extraction/analysis applications.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    The Word Vector Tool is a simple but flexible Java library to create word vector representations of text documents. Word vectors can be used for various text processing tasks, as text classification, text clustering or information retrieval.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 20
    Evidence-based Guideline and Decision Support System. Provides patient specific point of care reminders in order to aid physicians provide high quality care. Input/output in the form of HL7 CDA Level 2 documents. Knowledge is encoded using Arden Syntax.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    Akshara Malayalam OCR is a project for the development of an OCR for printed and handwritten documents in Malayalam language. The inspiration is from similar OCR softwares in other languages etc.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    The Infomap NLP software performs automatic indexing of words and documents from free-text corpora, using a variant of LSA to enable information retrieval and other applications. It was developed by the Infomap Project at Stanford University's CSLI.
    Leader badge
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    Maiden-core is a Linux kernel robotics API module, extensible in Erc. The Maiden Project also documents and develops the Maiden architecture for mobile robots, a set of (optional) designs and standards for robots using the Maiden-core API.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    A collection of open source software and documents on machine perception and machine learning. Includes a state of the art face detector (MPISearch), video labeling tools (Score), and tutorials (Kolmogorov Tutorials).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    Tifftool is a high-performance tool to clean scanned documents in preparation for onscreen display or for OCR. Features include skew correction, orientation correction, despeckle, page alignment, split pages and batch processing.
    Downloads: 0 This Week
    Last Update:
    See Project