Showing 121 open source projects for "linux file parser"

View related business solutions
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • Full-stack observability with actually useful AI | Grafana Cloud Icon
    Full-stack observability with actually useful AI | Grafana Cloud

    Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

    Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
    Create free account
  • 1
    MARF is a general cross-platform framework with a collection of algorithms for audio (voice, speech, and sound) and natural language text analysis and recognition along with sample applications (identification, NLP, etc.) of its use, implemented in Java.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 2
    Fingrams Generator

    Fingrams Generator

    Stand-alone software tool able to generate and visualize fingrams.

    Stand-alone software tool able to generate and visualize fingrams. It handles fuzzy systems designed by whatever fuzzy modeling tool. This tool takes as input a standard configuration file easy to be generated from a fuzzy system. Currently, Fingrams can be generated for classification, regression fuzzy rule based systems and fuzzy association rules. GUAJE [http://www.softcomputing.es/guaje], and KEEL [http://www.keel.es] are able to automatically produce the required input file.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    DJVU++

    DJVU++

    The DjVu complete solution,with OCR Technology(Arabic ,English).

    DjVu++ is a user-friendly program that used to manipulate DjVu file formats such as eBooks with a penalty of editing features. The program introduce a free replacement for the property PDF format with similar resolution and smaller file size DjVu++ also support OCR to handle text in scanned books and images. The program shows good performance for English. In addition to the Arabic language to lead free and commercial software in this area. The main features of DjVu++ program are: o...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 4

    SuRankCo

    Supervised Ranking of Contigs in de novo Assemblies

    SuRankCo is a machine learning based software to score and rank contigs from de novo assemblies of next generation sequencing data. It trains with alignments of contigs with known reference genomes and predicts scores and ranking for contigs which have no related reference genome yet. For more details about SuRankCo and its functioning, please see "SuRankCo: Supervised Ranking of Contigs in de novo Assemblies" Mathias Kuhring, Piotr Wojtek Dabrowski, Andreas Nitsche and Bernhard Y....
    Downloads: 0 This Week
    Last Update:
    See Project
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build, govern, and optimize agents and models with Gemini Enterprise Agent Platform.
    Start Free
  • 5
    The software annotates text with 41 broad semantic categories (Wordnet supersenses) for both nouns and verbs; i.e., it performs both sense disambiguation and named-entity recognition. The tagger implements a discriminatively-trained Hidden Markov Model.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    This software convert speech to text using Java and Android application. With this software you can also search for text in Google. You can use offline speech to text with this application if you don't have Internet, you can find the steps in guide file. How to use: ----------------- 1- Install a software to convert the PC as router (EX: My Wifi Router) then connect your mobile with PC via wifi. 2- Install Smart Text to Speech.apk file on your phone. 3- Open "Smart Speech to...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    jaivox

    jaivox

    Speech recognition application builder and library

    Java library and tools to create open source speech recognition applications. Generates dialogs for conversational interfaces. Works with a popular open source speech recognition library.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8

    ANNFiD

    A forensic file identification tool using neural networks

    Just carved a bunch of bytes and have no idea what they could be? Maybe ANNFiD can help. ANNFiD uses neural network to identify byte patterns. It can be trained and has a GUI to help in the process. The tool is still on a very early stage, but could improve exponentially with the help of the developer community
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    Flamingo Project

    Flamingo Project

    Workflow Designer, Hive Editor, Pig Editor, File System Browser

    Flamingo is a open-source Big Data Platform that combine a Ajax Rich Web Interface + Workflow Engine + Workflow Designer + MapReduce + Hive Editor + Pig Editor. 1. Easy Tool for big data 2. Use comfortable in Hadoop EcoSystem projects 3. Based GPL V3 License Supporting Pig IDE, Hive IDE, HDFS Browser, Scheduler, Hadoop Job Monitoring, Workflow Engine, Workflow Designer, MapReduce.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Custom VMs From 1 to 96 vCPUs With 99.95% Uptime Icon
    Custom VMs From 1 to 96 vCPUs With 99.95% Uptime

    General-purpose, compute-optimized, or GPU/TPU-accelerated. Built to your exact specs.

    Live migration and automatic failover keep workloads online through maintenance. One free e2-micro VM every month.
    Try Free
  • 10
    The project consists of a library based on Java Specification Request 94, release 1.1, i.e. is a java rule engine. Please refer to "Java Rule Engine API - JSR-94" document (file jsr94_spec.pdf) included in JSR-94 distribution.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 11

    Quadriga

    Quadruple Network Management System

    Quadriga is a web-application that acts as a clearing-house for text annotations -- in the form of contextualized triples, or “quadruples,” that form complex graphs -- generated with the Vogon desktop application (https://sourceforge.net/projects/gobtan/), and as an environment for managing text-annotation projects. It relies on a central authentication system for user authentication, a dictionary service (Wordpower) and an authority file service (Conceptpower). Quadriga can connect to a...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    LPCforSOS is a machine learning framework with a special focus on structured output spaces and pairwise learning. It supports currently multiclass, ordinal, hierarchical, multi-label and label ranking classification settings.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    NetKit-SRL, or NetKit for short, is an open-source Network Learning Toolkit for statistical relational learning. The toolkit provides functionalities not found in any existing open source projects and integrates with the WEKA machine learning toolkit.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    gannu

    gannu

    Java API and tools for performing NLP and other AI tasks

    Java API and tools for performing a wide range of AI tasks such as: word sense disambiguation (released), optimization (5 Evolutionary Algorithms Implemented ETA February 2014), opinion mining (ETA November 2014) and text wikification (ETA July 2014). Gannu includes some graphical interfaces for scientific purposes. When using Gannu please cite: *Jiménez, F. V., Gelbukh, A. F. & Sidorov, G. (2013). Simple Window Selection Strategies for the Simplified Lesk Algorithm for Word Sense...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    Unsupervised TXT classifier

    Unsupervised TXT classifier

    Classify any two TXT documents, no training required - JAVA

    This program is made to address two most common issues with the known classifying algorithms. First, over-training and second, shortage of data for a training of categories. Instead, each TXT file is a category on its own, rather than an assigned category. In a way, this is similar to clustering but not really a clustering algorithm since there is some training involved. The summarizer from Classifier4J has been adjusted to accept two inputs (lets call them A and B). Then, the summarizer...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 16
    TF-IDF.jar is a Java Archive file to measure TF-IDF of each document in a document collection (corpus). The jar can be used to (a) get all the terms in the corpus (b) get the document frequency (DF) and inverse document frequency (IDF) of all the terms in the corpus (c) get the TF-IDF of each document in the corpus (d) get each term with their frequency (no. of presence), term frequency (TF) and TF-IDF in every document
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17

    MSTParser

    MSTParser is a non-projective dependency parser that searches for maxi

    MSTParser is a non-projective dependency parser that searches for maximum spanning trees over directed graphs. Models of dependency structure are based on large-margin discriminative training methods. Projective parsing is also supported. mstparser 0.5.1 is now available via Maven Central. If you use Maven as your build tool, then you can add it as a dependency in your pom.xml file: <dependency> <groupId>net.sourceforge.mstparser</groupId> <artifactId>mstparser</artifactId> <version>0.5.1</version> </dependency>
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    DocCO

    DocCO

    Non-disjoint groupping of Documents based on word sequence approach

    This is a GUI for learning non disjoint groups of documents based on Weka machine learning framework. It offers the possibility to make non disjoint clustering of documents using both vectorial and sequential representation (word sequence approach based on WSK kernel). All data format supported by WEKA could be used in DocCO. Data could be loaded from files, from databases or from specified URL. All the preprocessing techniques implemented in WEKA could be used before performing the learning.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19

    File-em

    File-'em is an automatic receipts organizer implemented in Java & SWT.

    File-'em (pronounced like phylum) is an open source alternative to the software behind NeatReceipts?®. It allows you to load in scanned receipts and automatically pulls the information out of the receipt using OCR and stores it in a SQLite database for easy reference, reports, and retrieval.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Audivolv

    Audivolv

    Artificial intelligence evolves musical instruments played with mouse

    Artificial Intelligence (AI) evolves musical instruments you play with the mouse from small blocks of Java code that operate on numbers, each block designed to keep all numbers in range -1 to 1 if they started in that range, and combinations of these evolved codes running 44100 times per second for high quality 44.1 khz audio. You can also write the Java code into its window. Uses Javassist internal compiler on the new Java code assembled by renaming variables to overlap eachother and...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    The Deep Email Miner Application is a software solution for the multistaged analysis of an Email Corpus. Social network analysis and text mining techniques are connected to enable an in depth view into the underlying information. The self-executable Version 1.1 jar file will now run on Java 1.5 or higher. A Windows executable file of Version 1.1 is also provided in the Files section. Documentation can be found on the project homepage.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 22
    Nen

    Nen

    neural network implementation in java

    3-layer neural network for regression and classification with sigmoid activation function and command line interface similar to LibSVM. Quick Start: "java -jar nen.jar"
    Downloads: 1 This Week
    Last Update:
    See Project
  • 23
    HSSVM(Hyper-Sphere Support Vector Machines) is a software for solving multi-classification problem, implemented by Java.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    WordNet Browser using SQL database and Hibernate as Object-Relational Mapping
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    This project aims to implement in java the following text mining techniques: Text Language Detection, Keywords and keyphrases extraction, Text Classification, Text Clustering, Single or multiple documents Summarization, Plagiarism Detection.
    Downloads: 0 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB