Showing 39 open source projects for "document analysis"

View related business solutions
  • Ship Agents Faster Icon
    Ship Agents Faster

    Transform your applications and workflows into powerful agentic systems at global scale.

    Gemini Enterprise Agent Platform lets you rapidly build, scale, govern and optimize production-ready agents grounded in your organization's data. The platform enables developers to build custom or pre-built agents for virtually any use case. New customers get $300 in free credits.
    Get Started Free
  • Stop Cyber Threats with VM-Series Next-Gen Firewall on Azure Icon
    Stop Cyber Threats with VM-Series Next-Gen Firewall on Azure

    Native application identity and user-based security for your Azure cloud

    Gain integrated visibility across all traffic in a single pass. Deploy Palo Alto Networks VM-Series to determine application identity and content while automating security policy updates via rich APIs.
    Get a free trial
  • 1
    elasticsearc-php

    elasticsearc-php

    PHP low-level client for Elasticsearch

    Introducing Elasticsearch DSL library to provide objective query builder for Elasticsearch bundle and elasticsearch-php client. You can easily build any Elasticsearch query and transform it to an array. This agnostic package is a lightweight wrapper on top of the Elasticsearch PHP client. Its main goal is to allow for easier structuring of queries and indices in your application. It does not want to hide or replace the functionality of the Elasticsearch PHP client. Feature complete, object...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    MCM-ICM

    MCM-ICM

    Mathematical Contest resources

    MCM-ICM is a curated archive of Outstanding Winner (“O-奖/特等奖”) solution papers from the Mathematical Contest in Modeling and the Interdisciplinary Contest in Modeling, spanning the early 2000s through recent years. The repository is organized by year, with per-year folders that collect the top-ranked reports and, in later years, additional materials such as problem statements or problem notes when available. It has evolved from a single-maintainer project into a collaborative effort, with...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 3
    Lattice

    Lattice

    Free and Open-Source Qualitative Data Analysis Software

    Lattice is a free and open-source Computer-Assisted Qualitative Data Analysis Software (CAQDAS). Native, lightweight, and offline-first, it supports document imports in plain text from various formats. Features include a six-level code hierarchy, five memo types, attribute-typed documents, precision retrieval, and an analysis workspace with Code Frequency, Co-occurrence, Crosstab, Coding Coverage, and Word Cloud.
    Downloads: 7 This Week
    Last Update:
    See Project
  • 4
    PySchool

    PySchool

    Installable / Portable Python Distribution for Everyone.

    PySchool is a free and open-source Python distribution intended primarily for students who learn Python and data analysis, but it can also used by scientists, engineering, and data scientists. It includes more than 150 Python packages (full edition) including numpy, pandas, scipy, sympy, keras, scikit-learn, matplotlib, seaborn, beautifulsoup4...
    Leader badge
    Downloads: 115 This Week
    Last Update:
    See Project
  • AI-powered service management for IT and enterprise teams Icon
    AI-powered service management for IT and enterprise teams

    Enterprise-grade ITSM, for every business

    Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity. Maximize operational efficiency with refreshingly simple, AI-powered Freshservice.
    Try it Free
  • 5
    IQcad

    IQcad

    A 2D CAD application for civil engineering , built with Free Pascal

    ...Features: 25+ entity types (Line, Polyline, Arc, Circle, Spline, Hatch, Text, Dimensions, Table, and more) AutoCAD-style command line with object snaps and grip editing Modify tools: Move, Copy, Rotate, Scale, Mirror, Offset, Trim, Extend, Chamfer, Fillet Layers, linetypes, blocks, and configurable styles Civil Engineering: TIN surfaces with contours and spot elevations Survey points with CSV import and point groups Tunnel section lines with deviation analysis Horizontal alignments with curves and spirals LandXML import/export File Formats: DXF (R12/R2000/R2018), LandXML, PDF, PNG, SVG,DWG ( using ODA converter or Librecad dwg reader) WMS server support Point Cloud import export. Section along alignment Modern ribbon UI with dark theme, multi-document tabs, and map background layers.
    Downloads: 17 This Week
    Last Update:
    See Project
  • 6
    DynaQ

    DynaQ

    Innovative text document search. http://dynaq.opendfki.de for details.

    The goal of DynaQ is to develop an inquiry system to explore the personal information space, supporting you with the searching paradigm 'orienteering'. DynaQ is a (desktop)search engine with enhanced functionality for file, email and blog search. Look at our GitLab homepage for sourcecode and documentation: http://dynaq.opendfki.de
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    AI learning

    AI learning

    AiLearning, data analysis plus machine learning practice

    We actively respond to the Research Open Source Initiative (DOCX) . Open source today is not just open source, but datasets, models, tutorials, and experimental records. We are also exploring other categories of open source solutions and protocols. I hope you will understand this initiative, combine this initiative with your own interests, and do what you can. Everyone's tiny contributions, together, are the entire open source ecosystem. We are iBooker, a large open-source community,...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8

    IOA AM code

    Implementation of the core routine for AM analysis from the IOA AMWG

    ...It should not be taken as agreement from the IOA or the AMWG that any results produced by this code are recommended or agreed. The example software does not represent an analysis method and requires correct inputs parameters and data, and interpretation of the results. All users should have a suitable understanding of the IOA AMWG document on which this code is based. No user support is offered, although feedback may may be directed to WTAMCONSULT (at) IOA.ORG.UK. No responses, however, can be guarantee
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    Gamera is a framework for the creation of structured document analysis applications by domain experts. It combines a programming library with GUI tools for the training and interactive development of recognition systems.
    Downloads: 0 This Week
    Last Update:
    See Project
  • $300 Free Credits for Your Google Cloud Projects Icon
    $300 Free Credits for Your Google Cloud Projects

    Start building on Google Cloud with $300 in free credits. No commitment, no credit card required until you're ready to scale.

    Launch your next project with $300 in free Google Cloud credits—no strings attached. Test, build, and deploy without risk. Use your credits across the entire Google Cloud platform to find what works best for your needs. After your credits are used, continue with always-free tier services. Only pay when you're ready to scale. Sign up in minutes and start exploring.
    Start Free Trial
  • 10

    Lute Tablature Toolkit for Gamera

    Optical Music Recognition for Tablature Notations

    A toolkit for the optical recognition of 16th century lute tablature prints. It is based on and requires the Gamera document image analysis framework (http://gamera.sf.net).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    libcrn is document image processing library written in C++11 for Linux, Windows, Mac OsX and Google Android. It is a toolbox that allows to create easily software such as OCRs and layout analysis tools.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 12

    jLDADMM

    A Java package for the LDA and DMM topic models

    The Java package jLDADMM is released to provide alternative choices for topic modeling on normal or short texts. It provides implementations of the Latent Dirichlet Allocation topic model and the one-topic-per-document Dirichlet Multinomial Mixture model (i.e. mixture of unigrams), using collapsed Gibbs sampling. In addition, jLDADMM supplies a document clustering evaluation to compare topic models. See the usage of jLDADMM in its website at http://jldadmm.sourceforge.net/
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    DCTFinder

    DCTFinder

    Extract title and creation time from web page.

    Web pages do not offer reliable metadata concerning their creation date and time. However, getting the document creation time is a necessary step for allowing to apply temporal normalization systems to web pages. DCTFinder is a system that parses a web page and extracts from its content the title and the creation date of this web page. DCTFinder combines heuristic title detection, supervised learning with Conditional Random Fields (CRFs) for document date extraction, and rule-based creation...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    iMS2Flux
    iMS2Flux is a command line based high-throughput processing tool set for stable isotope labelled mass spectral data targeting metabolic flux analysis. To get started simply download and unzip the iMS2Flux.zip file and follow the getting started document for your OS. Current version 7.2.1 (last updated 9/30/2014) - Completes support and correction functionality for a new user specified generic data class. See the change log for full details.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    SCAN
    SCAN (Smart Content Aggregation and Navigation) is a universal semantic content aggregator. It combines search, text analysis, tagging and metadata functions to provide new user experience of desktop navigation and document management.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 16
    FALCON - Text Search Java Project

    FALCON - Text Search Java Project

    JSON based text search Java Project

    ----------------- - What is it? - ----------------- The "Falcon Search" is a JAVA API and tool to search inside the documents. It was originally started to search the content in pdf files under the project "HAWK Search". Searching with this tool is query-based not word-based as in most of the document search tools OR document readers. It also takes care of jumbling of words within query and spelling mistakes. Commonly used techniques in this project are Natural Language...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    Bika Open Source LIMS

    Bika Open Source LIMS

    Web based Open Source laboratory information management system (LIMS)

    Modern Open Source LIMS (Laboratory Information Management System) · Professionally supported by experts The Bika code was migrated to https://github.com/bikalims Getting Started: https://github.com/bikalims/bika.lims/blob/main/README.md Modern Bika releases are built on the Senaite LIMS core, the LIMS that originated as a Bika fork. It is therefore as new and modern as Senaite, frequently upgraded and has many very useful add-ons. Bika expands on Senaite's lean design by adding...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18

    Texalyzer

    Text analyzer

    Analyzes text document using TF-IDF and optionally stopword list, and extracts important keywords.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    Unsupervised TXT classifier

    Unsupervised TXT classifier

    Classify any two TXT documents, no training required - JAVA

    ...In a way, this is similar to clustering but not really a clustering algorithm since there is some training involved. The summarizer from Classifier4J has been adjusted to accept two inputs (lets call them A and B). Then, the summarizer gets trained with A to summarize a document B, and vice versa. This extracts a relevant structure for both documents (and thus avoids the over-training) which are then compared using the Vector-Space analysis to give a range of belonging of one document to another (and thus avoids the shortage of information). This method can be used to create the user-defined classes by merging texts of certain categories and then to calculate the relevant distances between the documents, but this is not necessary.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    FluxY

    FluxY

    FluxY – a processing tool set for stable isotope label MS data

    FluxY is a command line based high-throughput processing tool set for stable isotope labelled mass spectral data used for metabolic flux analysis. To get started simply download and unzip the FluxY.zip file and follow the getting started document in the Instructions folder.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    A system to perform analysis of large documents for the purpose of cataloging similar documents. Similarity is based upon contextual analysis of these documents done by identifying common words and proper nouns.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    XmlView
    GUI utility in pure Java for viewing and editing XML content; example of application built with Superficial http://superficial.sourceforge.net
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    SIDoBI is an automatic summarization system for documents in Indonesian language. It is an acronym for Sistem Ikhtisar Dokumen untuk Bahasa Indonesia. SIDoBI is built based on MEAD, a public domain portable multi-document summarization system.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    OpenSHORE is an XML based Semantic Document Repository (SDR) with a free definable meta model that builds up a semantic network from sections and relations in documents. The acronym SHORE means Semantic Hypertext Object Repository.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    LSGTL means LLX’s Static Graph Template Library which is a light-weighted header-only template library developed mainly for static graph analysis. LSGTL is expected to be used in laboratories for research purposes mostly.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • Next
Auth0 Logo