Showing 86 open source projects for "file analysis"

View related business solutions
  • Compliant and Reliable File Transfers Backed by Top Security Certifications Icon
    Compliant and Reliable File Transfers Backed by Top Security Certifications

    Cerberus FTP Server delivers SOC 2 Type II certified security and FIPS 140-2 validated encryption.

    Stop relying on non-certified, legacy file transfer tools that creak under the weight of modern security demands. Get full audit trails, advanced access controls and more supported by an award-winning team of experts. Start your free 25-day trial today.
    Start Free Trial
  • Ship Agents Faster Icon
    Ship Agents Faster

    Transform your applications and workflows into powerful agentic systems at global scale.

    Gemini Enterprise Agent Platform lets you rapidly build, scale, govern and optimize production-ready agents grounded in your organization's data. The platform enables developers to build custom or pre-built agents for virtually any use case. New customers get $300 in free credits.
    Get Started Free
  • 1
    Framework for text mining, data integration and data analysis. Keywords: ontology and graph alignment, relation mining, warehouse, semantic database integration, bioinformatics, systems biology, microarray, Java.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    Unsupervised TXT classifier

    Unsupervised TXT classifier

    Classify any two TXT documents, no training required - JAVA

    ...This extracts a relevant structure for both documents (and thus avoids the over-training) which are then compared using the Vector-Space analysis to give a range of belonging of one document to another (and thus avoids the shortage of information). This method can be used to create the user-defined classes by merging texts of certain categories and then to calculate the relevant distances between the documents, but this is not necessary.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    DocCO

    DocCO

    Non-disjoint groupping of Documents based on word sequence approach

    This is a GUI for learning non disjoint groups of documents based on Weka machine learning framework. It offers the possibility to make non disjoint clustering of documents using both vectorial and sequential representation (word sequence approach based on WSK kernel). All data format supported by WEKA could be used in DocCO. Data could be loaded from files, from databases or from specified URL. All the preprocessing techniques implemented in WEKA could be used before performing the learning.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4

    OMR Reader

    OMR Sheets Data Retrieval Software

    OMR Reader is an application to read data from scanned images of OMR sheets. You can first create a template for a particular form and save it as a .omr file. To create a template, use a form with all filled marks. The program will automatically detect the marks. Now choose the marks for a group, and name the group. Do this for all the option groups on the form. Now save the info of these groups in a .omr file. For different forms, you can have different .omr files. These files can later be...
    Downloads: 6 This Week
    Last Update:
    See Project
  • Go from Code to Production URL in Seconds Icon
    Go from Code to Production URL in Seconds

    Cloud Run deploys apps in any language instantly. Scales to zero. Pay only when code runs.

    Skip the Kubernetes configs. Cloud Run handles HTTPS, scaling, and infrastructure automatically. Two million requests free per month.
    Try it free
  • 5
    A java-based framework to determine file origins on a file system by accessing system and application logs and track the evidence trace through metadata.
    Downloads: 9 This Week
    Last Update:
    See Project
  • 6
    PIVOT

    PIVOT

    PIVOT is a simple yet flexible visualization data tool

    PIVOT is a simple yet flexible visualization tool based on Circos (Krzywinski et al., 2009), which offers a fast and aesthetical visualization of data and information. The Protein Interaction Visualization and Observation Tool (PIVOT) was developed specifically for the visualization of protein interaction. It is difficult to spot the proteins that have an interaction when given a large list of proteins but with PIVOT, it is easy to identify the them at a glance. PIVOT displays an image...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7

    MionChrom

    Signal processing for chromatography

    MionChrom interprets ASCII data from GC-C-IRMS systems, plots it, identifies peaks and autointegrates the results. The user can manually integrate peaks with different tools. The results is reported in *.XLS, *.TXT and *.PNG. ------------------------------------------------------ MionChrom is soon to be replaced by MionMass which isn't a open sourced project.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    BlogTEX is an ad-hoc blog posts extraction algorithm written in Java for TREC Blog08 dataset. It includes an optimized sentence model for clearly identifying sentence boundaries in each blog post. Its output can be customized using its config file.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9

    Ocean Sync

    Hadoop Management System

    OceanSync is an Hadoop Management System that allows users to control a variety of aspects of Hadoop. This includes a Graphical User Interface that allows a user to perform HDFS maintenance tasks and submit new jobs to the cluster. The OceanSync product sits on top of any Hadoop Architecture.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • 10
    This software allows for a user to generate test data. This is useful for testing Hadoop or other data processing clusters.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    TextMarker
    TextMarker is now developed and hosted at Apache UIMA (http://uima.apache.org/textmarker.html). TextMarker is a UIMA-based tool for information extraction and more. The full featured editor of the rule language and the build process of UIMA descriptors are complemented with components for visualization, explanation, testing and rule learning.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    MediaWikiRevisionsExtractor extracts the history of a particular wiki page, computes the modifications made between each revisions and finally, stores the whole set of modifications in a file.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    Scanner based Bioparsers
    Parsers for biological data based on scanner generators like Flex (C), Re2c(C), Jflex (Java) and Ifickle (Tcl). This scanner generators are providing easier maintainance, development and higher speed than hand written scanners. Scanner output is SQL.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    Regexp testing tool allows to apply group of regexps to huge arrays of data (millions or so) in order to investigate search or search/replacement possibilities of regexp group.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    tinyTiM is a small and lightweight implementation of the Topic Maps API (TMAPI). The engine is implemented in Java.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    Ex-Crawler
    Ex-Crawler is divided into 3 subprojects (Crawler Daemon, distributed gui Client, (web) search engine) which together provide a flexible and powerful search engine supporting distributed computing. More informations: http://ex-crawler.sourceforge.net
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    Optex Analyzer is a software to analyze and compare algorithms to solve approximately optimization problems. It has a GUI that allows select a set of input files containing raw algorithm results. The analysis is shown with tables and charts.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    D.U.C.K (Determine segmentation of Unknown words by using Context Knowledge)is an NLP tool, which aims to find the correct segmentation for unknown words in written Hebrew. Statistics from different scopes will be used to determine the segmentation.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    SPASE Model is a collection of tools for working with the structured data model information. Tools can convert the relational version of the data model into various expressions, including XSD, XMI and PDF documentation.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    KADis is a software architecture reconstruction utility, which uitilizes the Knowledge Discovery Meta-Model of the OMG. It uses source nagivator NG for parsing the architecture and enables saving the architecture as an XML file.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    MonteCarlo portfolio simulation - it can be used as stand-alone command line application - it takes simple XML file needed data as entry and creates simple XML file with output, also this stuff have JNI and ISAPI interface.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    A very simple Equity Investment Stragety Analyser tool. What it does : - Maintain Stock portfolios - Simulate Investment using NSE Historical bhav data files - Analyze different investment strategies against index performance - Graph the results
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    The purpose of eAudit is to provide a comprehensive reporting capability relating to Electronic Auditing for various industries.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    Crawl a set of files, accumulating information on the temporal and spatial extent of the data in each file, for later search and retrieval.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    The Vyper Dynamic Server framework is a pure-java library for use in the creation of backend / server platforms. VDS is specifically tailored for financial applications, and suited for many high-throughput, asynchronous and distributed environments.
    Downloads: 0 This Week
    Last Update:
    See Project