Showing 35 open source projects for "flat file"

View related business solutions
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • Try Google Cloud Risk-Free With $300 in Credit Icon
    Try Google Cloud Risk-Free With $300 in Credit

    No hidden charges. No surprise bills. Cancel anytime.

    Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.
    Start Free
  • 1
    TalkingBot

    TalkingBot

    A virtual parrot for Android platform that can listen and talk to you.

    A virtual personality for Android platform that can listen and talk to you (like a parrot) via the Google synthesizer. Idea born for fun and totally experimental, but feel free to modify it and, if you like, to help me improve it. This initial version only understands the Italian language! I'm sorry.. Please support this project by donating to this fundraiser: ------------------------------------------- Give birth to new travel...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    ChoiceMaker
    Record matching software
    Downloads: 2 This Week
    Last Update:
    See Project
  • 3

    SimpleAnnotator

    A simple tool to annotate a text.

    This tool allows the user to annotate by coloring portions of the text. This can be seen as a simple model of annotation. This tool has been built to complete particular experimentation on student behavior (annotation per example) facing particularly difficult content. We put it here as an open-source project.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    GDL Editor

    GDL Editor

    Computerized guideline editor for clinical decision support

    The GDL editor is multiplatform application that allows users to create, edit and run GDL files. GDL is a formal language designed to represent clinical knowledge for decision support. It is designed to be natural language- and reference terminology- agnostic by leveraging the designs of openEHR Reference Model and Archetype Model. The tool provides an editing and testing environment capable of generating forms based on the elements defined in the GDL. Requirements: Java Runtime...
    Downloads: 0 This Week
    Last Update:
    See Project
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 5
    MARF is a general cross-platform framework with a collection of algorithms for audio (voice, speech, and sound) and natural language text analysis and recognition along with sample applications (identification, NLP, etc.) of its use, implemented in Java.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 6
    The software annotates text with 41 broad semantic categories (Wordnet supersenses) for both nouns and verbs; i.e., it performs both sense disambiguation and named-entity recognition. The tagger implements a discriminatively-trained Hidden Markov Model.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    LPCforSOS is a machine learning framework with a special focus on structured output spaces and pairwise learning. It supports currently multiclass, ordinal, hierarchical, multi-label and label ranking classification settings.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    NetKit-SRL, or NetKit for short, is an open-source Network Learning Toolkit for statistical relational learning. The toolkit provides functionalities not found in any existing open source projects and integrates with the WEKA machine learning toolkit.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    gannu

    gannu

    Java API and tools for performing NLP and other AI tasks

    Java API and tools for performing a wide range of AI tasks such as: word sense disambiguation (released), optimization (5 Evolutionary Algorithms Implemented ETA February 2014), opinion mining (ETA November 2014) and text wikification (ETA July 2014). Gannu includes some graphical interfaces for scientific purposes. When using Gannu please cite: *Jiménez, F. V., Gelbukh, A. F. & Sidorov, G. (2013). Simple Window Selection Strategies for the Simplified Lesk Algorithm for Word Sense...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build generative AI apps with Vertex AI. Switch between models without switching platforms.
    Start Free
  • 10
    DocCO

    DocCO

    Non-disjoint groupping of Documents based on word sequence approach

    This is a GUI for learning non disjoint groups of documents based on Weka machine learning framework. It offers the possibility to make non disjoint clustering of documents using both vectorial and sequential representation (word sequence approach based on WSK kernel). All data format supported by WEKA could be used in DocCO. Data could be loaded from files, from databases or from specified URL. All the preprocessing techniques implemented in WEKA could be used before performing the learning.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    TextMarker
    TextMarker is now developed and hosted at Apache UIMA (http://uima.apache.org/textmarker.html). TextMarker is a UIMA-based tool for information extraction and more. The full featured editor of the rule language and the build process of UIMA descriptors are complemented with components for visualization, explanation, testing and rule learning.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    BorderFlow
    BorderFlow implements a general-purpose graph clustering algorithm. It maximizes the inner to outer flow ratio from the border of each cluster to the rest of the graph.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    D.U.C.K (Determine segmentation of Unknown words by using Context Knowledge)is an NLP tool, which aims to find the correct segmentation for unknown words in written Hebrew. Statistics from different scopes will be used to determine the segmentation.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    weka outlier is an implementation of outlier detection algorithms for WEKA. CODB (Class Outliers: Distance-Based) Algorithm is the first algorithm developed using WEKA framework.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    Integrated Structure Evolution Environment
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    A modular language generator based on the theory of Functional Grammar (FG) by Simon C. Dik. Implemented using Java for the user interface, ANTLR for the input format parser and Prolog for the grammar and lexicon module, treating underlying linguistic st
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    Java package to study a clustering model described in the paper \"Novel Clustering Algorithm Based Upon Games on Evolving Network\" by Q. Li, Z. Chen, Y. He and J-P. Jiang (in arxiv: http://arxiv.org/pdf/0812.5064v1), generalizations and similar issues.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    Vademecum is a Data Mining suite with a novel help function, semi-automated protocol, consequently modular construction that is also equipped with the most elaborated Data Mining methods.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    JWebPro: A Java tool that can interact with Google search and then process the returned Web documents in a couple of ways. The outputs can serve as inputs for NLP, IR, infor extraction, Web mining, online social network extraction/analysis applications.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Bios is a suite of syntactico-semantico analyzers that include the most common tools needed for the shallow analysis of English text.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    JVnSegmenter is a Java-based and open-source Vietnamese word segmentation tool. The segmentation model was trained on about 8,000 sentences using Conditional Random Fields (FlexCRFs). This tool would be useful for Vietnamese NLP community.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 22
    Qualiweb aims at providing semantic web metrics for modeling a website visitors needs according to a given taxonomy or document classification. Web metrics provided by Qualiweb give an indication of how successful each of the website topics have been.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    JTextPro: A Java-based Text Processing tool that includes sentence boundary detection (using maximum entropy classifier), word tokenization (following Penn conventions), part-of-speech tagging (using CRFTagger), and phrase chunking (using CRFChunker).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    CRFChunker: Conditional Random Fields Phrase Chunker (Phrase Chunking Tool) for English. The model was trained on sections 01..24 of WSJ corpus and using section 00 as the development test set (F1-score of 95.77). Chunking speed: 700 sentences/s
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    CRFTagger: Conditional Random Fields Part-of-Speech (POS) Tagger for English. The model was trained on sections 01..24 of WSJ corpus and using section 00 as the development test set (accuracy of 97.00%). Tagging speed: 500 sentences/s.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • Next
MongoDB Logo MongoDB