Showing 96 open source projects for "text analysis"

View related business solutions
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • Full-stack observability with actually useful AI | Grafana Cloud Icon
    Full-stack observability with actually useful AI | Grafana Cloud

    Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

    Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
    Create free account
  • 1
    The NITE XML Toolkit supports the creation, analysis, and browsing of annotated multimodal, text, or spoken language corpora, and represents both timing and rich linguistic structure. It contains libraries for developers and some end user tools.
    Leader badge
    Downloads: 1 This Week
    Last Update:
    See Project
  • 2
    A set of components for doing text mining in Java. The target audience are other text mining developers who can use or extend these components.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    Ontea - Pattern based Semantic Annotation Platform. Ontea search or create semantic meta data from text or documents using pattern based approaches. Implementation currently includes regular expressions (regex) patterns
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    A lyrical analysis and classification tool focused specifically on rhyming style in rap lyrics. Functions include phonetic transcription, rhyme visualization, and rapper classification.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Go From AI Idea to AI App Fast Icon
    Go From AI Idea to AI App Fast

    One platform to build, fine-tune, and deploy ML models. No MLOps team required.

    Access Gemini 3 and 200+ models. Build chatbots, agents, or custom models with built-in monitoring and scaling.
    Try Free
  • 5
    Contextor
    Contextor is a light-weight simple-to-use Java based library to help developers and researchers working with the general concept of a resource; as examples, resources can be text resources, web resources, images and videos.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    PubCurator is a biomedical text mining platform and validation helper built on top of Eclipse RCP.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    Optex Analyzer is a software to analyze and compare algorithms to solve approximately optimization problems. It has a GUI that allows select a set of input files containing raw algorithm results. The analysis is shown with tables and charts.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    This project is a compilation of tools/libraries to help with tasks related to Text Analytics mainly in Java. These tools range from simple wrappers to sophisticated mining tasks that can improve the productivity of researchers and engineers.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    A simple java program using PDFBox and Prefuse libraries to extract text from pdf and display as a graph. Can also link words based on their contexts.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Try Google Cloud Risk-Free With $300 in Credit Icon
    Try Google Cloud Risk-Free With $300 in Credit

    No hidden charges. No surprise bills. Cancel anytime.

    Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.
    Start Free
  • 10
    OpenDMAP (Open Source Direct Memory Access Parser) is a natural language processing (text mining) application: a semantic parser for information extraction.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    * Java classes for parsing text, conversion to XML or to evaluate in Java. The parser is textual-script-controlled with a syntax near Backus Naur Format, named ZBNF. * Some routines for conversion: C-Header or Java to XMI, XML-Documentation generation,
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    DawNLITE is a Natural-Language-based Image Transmoding Engine. The software transforms an image to a video as recorded by a virtual camera panning and zooming over the image, following a natural language text description of the image.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    Provides a GUI interface to grammatical structure and relations (as parsed by the Stanford Parser) of any text. Contains grammatical relation editor to modify, import, export grammatical relation definitions (tregex patterns and features).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    The Fiber project seeks to create a modular open source text mining tool that provides a contextual foundation for analysis in the dissemination of large quantities of text data.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    T-Rex (Trainable Relation Extraction) is a highly configurable machine learning-based Information Extraction from Text framework, which includes tools for document classification, entity extraction and relation extraction.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    The main purpose of AMATOOL is to create an application for semiautomatic mark of text, using XML tags. The texts is typical can be archaeological reports or midleagetextscripts. It is a semiautomtaic editor.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    GoldenOrb is a java library under the Apache License V2.0 for correlation, summarization and clustering of text information.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    iDocs is a intellectual document work flow with text mining options project.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    The Java Text Categorizing Library (JTCL) is a pure java implementation of libTextCat which in turn is "a library that was primarily developed for language guessing, a task on which it is known to perform with near-perfect accuracy."
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    LACE means "Lucene Analyzer for CJK (Chinese/Japanese/Korean) & English". It's a simple tokenizer that can handle English-CJK mixed text. Chinese words are handled using a dictionary based method.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    hypKNOWsys aims at developing a Java-based workbench for knowledge discovery and knowledge management. Currently, hypKNOWsys has released two intermediate tools: DIAsDEM Workbench (text mining for semantic tagging) and WUMprep (Web mining pre-processing)
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    The UIMA Annotator (called BRUTUS - Business Rules from Unstructured Text and Unstructured Sources) is a component for the UIMA Framework that allows for capturing business knowledge formalized in Structured English syntax (based on OMG's SBVR) with MOF
    Downloads: 4 This Week
    Last Update:
    See Project
  • 23
    An approximate gazetteer for GATE (General Architecture for Text Engineering), based on Levenshtein's Distance. Strings can be matched and found even in texts with noise and errors. More Info: http://bruno-wp.blogspot.com/search/label/Software
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    Java Expert Rule Based Inference Language. Jerbil is an open source rule processing engine written in Java. Currently Jerbil supports a full set of processing functions with text-based and XML interfaces; a Java interface is planned.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    The Word Vector Tool is a simple but flexible Java library to create word vector representations of text documents. Word vectors can be used for various text processing tasks, as text classification, text clustering or information retrieval.
    Downloads: 1 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB