Showing 154 open source projects for "text analysis"

View related business solutions
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • Forever Free Full-Stack Observability | Grafana Cloud Icon
    Forever Free Full-Stack Observability | Grafana Cloud

    Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

    Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
    Create free account
  • 1
    The Java Data Mining Package (JDMP) is a library that provides methods for analyzing data with the help of machine learning algorithms (e.g. clustering, classification, graphical models, neural networks, Bayesian networks, text processing, optimization).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    OpenSearchServer Extractor

    OpenSearchServer Extractor

    A RESTFul/JSON Web Service for text and metata extraction

    An open source RESTFul Web Service for text , meta-data extraction and analysis. oss-text-extractor supports various binary formats: Word processor (doc, docx, odt, rtf) Spreadsheet (xls, xlsx, ods) Presentation (ppt, pptx, odp) Publishing (pdf, pub) Web (rss, html/xhtml) Medias (audio, images) Others (vsd, text)
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    Text Expander, Inverse summarizer

    Text Expander, Inverse summarizer

    Expand text, inverse summarizer

    IT WILL WORK WITH A JAVA DEVELOPMENT KIT 1.7 ONLY !!! This program is a data-miner and a knowledge-miner. It does exactly the opposite of what the text summarizers do. A text summarizer produces a shortened text given some text as an input. An inverse summarizer takes the shortened input, a similar or a same text and does the process in reverse. This results in an expanded text. It can be used with any text or notes that have the knowledge gaps. It is a great aid to any creative...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4

    iMir

    Integrated pipeline for HT miRNA-Seq data analysis

    Processing of smallRNA-Seq data to gather biologically relevant information requires application of multiple statistical and bioinformatics tools from different sources, each focusing on a specific step of the analysis pipeline. The analytical workflow can be challenging for the continuous interventions by the operator, a critical factor when large numbers of datasets need to be analyzed at once. To allow a flexible and comprehensive analysis of smallRNA-Seq data we designed a novel modular pipeline, called iMir, integrating multiple open source modules and resource in an automated workflow, devising different statistical approaches to analyze data rigorously. iMir comprises also a Graphical User Interface (GUI), so that the pipeline is particularly suited for biologist and early stage bioinformaticians and produces both graphics and text outputs.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Go From AI Idea to AI App Fast Icon
    Go From AI Idea to AI App Fast

    One platform to build, fine-tune, and deploy ML models. No MLOps team required.

    Access Gemini 3 and 200+ models. Build chatbots, agents, or custom models with built-in monitoring and scaling.
    Try Free
  • 5
    VANTED
    VANTED - Visualization and Analysis of NeTworks containing Experimental Data At SourceForge the VANTED development history is preserved, only limited amount of development will proceed here. Please head on to the most recent developments, which can be observed at www.vanted.org.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    The basic function is to allow the user to take notes while watching a computer video file. The program combines a video playback window and a very simple text editor. Keyboard shortcuts can control playback, e.g. pause/resume, skip back, etc. MacOS Yosemite Problems - the latest version of MacOS has removed the QuickTimeForJava libraries on which this tool relies. If you copy the appropriate files from a Mavericks installation, it will work again. The files you will need to copy from...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    The software annotates text with 41 broad semantic categories (Wordnet supersenses) for both nouns and verbs; i.e., it performs both sense disambiguation and named-entity recognition. The tagger implements a discriminatively-trained Hidden Markov Model.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8

    Khawas

    An Arabic Corpora Processing Tool

    The new version is available at https://sourceforge.net/projects/ghawwasv4/
    Downloads: 1 This Week
    Last Update:
    See Project
  • 9
    SetFon focus is an interface web based for Praat resources (www.praat.org) wich focus speech sound annalysis; it is a gerent program for acoustic analysis PHP/Mysql based. Developed with the framework SIMP.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Fully Managed MySQL, PostgreSQL, and SQL Server Icon
    Fully Managed MySQL, PostgreSQL, and SQL Server

    Automatic backups, patching, replication, and failover. Focus on your app, not your database.

    Cloud SQL handles your database ops end to end, so you can focus on your app.
    Try Free
  • 10
    SCAN
    SCAN (Smart Content Aggregation and Navigation) is a universal semantic content aggregator. It combines search, text analysis, tagging and metadata functions to provide new user experience of desktop navigation and document management.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 11
    The BioNLP UIMA Component Repository provides UIMA wrappers for novel and well-known 3rd-party NLP tools used in biomedical text prosessing, such as tokenizers, parsers, named entity taggers, and tools for evaluation.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12

    Twiline

    Transcribe audio files to written text

    Simple program to assist transcriptions from recorded interviews. Resembles somewhat the early F4. <br/>Keywords: transcription transcribe interview
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    FALCON - Text Search Java Project

    FALCON - Text Search Java Project

    JSON based text search Java Project

    ----------------- - What is it? - ----------------- The "Falcon Search" is a JAVA API and tool to search inside the documents. It was originally started to search the content in pdf files under the project "HAWK Search". Searching with this tool is query-based not word-based as in most of the document search tools OR document readers. It also takes care of jumbling of words within query and spelling mistakes. Commonly used techniques in this project are Natural Language...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14

    xSonify

    Java-based sonification data analysis

    xSonify is a Java-based sonification data analysis prototype for displaying science data as sounds, with an emphasis on supporting visually-impaired researchers. Alternatives include Sonification Sandbox <http://sonify.psych.gatech.edu/research/>, MathTrax and Earth Plus <http://prime.jsc.nasa.gov/>, Sonifyer <http://www.sonifyer.org> and SoniPy <http://sourceforge.net/projects/sonipy/>. More at the main forum for sonification, ICAD <http://icad.org>
    Downloads: 1 This Week
    Last Update:
    See Project
  • 15
    HAWK - PDF Text Search Java Project

    HAWK - PDF Text Search Java Project

    No more support for this project - TAKE A LOOK AT FALCONSEARCH

    No more support for this project - TAKE A LOOK AT FALCONSEARCH "https://sourceforge.net/projects/falcontextsearch/"
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    Framework for text mining, data integration and data analysis. Keywords: ontology and graph alignment, relation mining, warehouse, semantic database integration, bioinformatics, systems biology, microarray, Java.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17

    Texalyzer

    Text analyzer

    Analyzes text document using TF-IDF and optionally stopword list, and extracts important keywords.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18

    Fast Matrix for Java

    General purpose matrix utilities for Java in Parallel Computing

    Fast Matrix for Java (fm4j) is a general-purpose matrix utility library for computing with dense matrices. fm4j encapsulated different underlying implementations and select the optimal one in run-time depending on the size of the input matrix. Moreover, fm4j employs Java (Tm) Concurrency to take advantage of the computation power of multi-cor processors.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19

    TextProcessor

    A Java package to preprocess text datasets for posterior text analysis

    ...The toolkit is also being extended for more advanced text analysis tasks based on natural language processing techniques.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20

    BioLemmatizer

    Lemmatization tool for morphological analysis of biomedical literature

    ...If you use the BioLemmatizer to support academic research, please cite the following paper: Haibin Liu, Tom Christiansen, William A Baumgartner Jr, and Karin Verspoor BioLemmatizer: a lemmatization tool for morphological processing of biomedical text Journal of Biomedical Semantics 2012, 3:3.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21

    BioDare

    BioDare is Biological Data Repository focused on timeseries data

    BioDare (Biological Data Repository) was developed under the multi-site ROBuST project (http://hallidaylab.bio.ed.ac.uk/ROBuST.html) to support data exchange inside the project. It is a web application which allows data-sharing (including public dissemination), data-processing and analysis, with the main focus on time-series data produced in circadian experiments. The main features of BioDare are: - an online repository for experimental data accompanied by extensive metadata - generation of secondary data (normalized, detrended, averaged …) - graphical output of data, secondary data and rhythm analysis - simple text-based search throughout metadata - biology- and conditions-aware search for data - data aggregation and export - group-based privacy settings for collaborative research
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    Voikko

    Voikko

    Library of linguistic tools

    Voikko is a spell checking, grammar checking, morphological analysis and hyphenation system. Spell checkers are available for multiple languages, other features for Finnish only.
    Downloads: 11 This Week
    Last Update:
    See Project
  • 23
    Anteater

    Anteater

    Annotation Tool to Extract Endangered Animals from Text Resources

    The goal of this project is the extraction the information listed below from texts downloaded from the Federal Register (https://www.federalregister.gov). The texts are mainly applications for permits, notices about given permits, etc. This software tool is developed by the Max Planck Institute for the History of Science (http://www.mpiwg-berlin.mpg.de) in collaboration with Dirk Wintergrün and Etienne Benson.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    DiffReport

    DiffReport

    Code Difference report

    Often I have seen some Huge Maintenance Projects it is always very difficult to track the incremental files for each release and If we want to do that we need to checkout both the branches and use some UI based tool to get the diff of the files finally we end up waiting in front of the PC for a long time and do this job. In many cases we spend more than 2 hrs/day. The time increases if there are more such parallel releases and at the end of the day 1 developer does it as full time job and...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    A simple java library for text and object oriented code. Among the different available packages, there are for text analysis (levenshtein and ngram fingerprinting), a grammar framework, simple object persistence (very light and dependence free), ...
    Downloads: 0 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB