Showing 30 open source projects for "text analysis"

View related business solutions
  • Go from Code to Production URL in Seconds Icon
    Go from Code to Production URL in Seconds

    Cloud Run deploys apps in any language instantly. Scales to zero. Pay only when code runs.

    Skip the Kubernetes configs. Cloud Run handles HTTPS, scaling, and infrastructure automatically. Two million requests free per month.
    Try it free
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build generative AI apps with Vertex AI. Switch between models without switching platforms.
    Start Free
  • 1
    Scanopy

    Scanopy

    Clean network diagrams, One-time setup, zero upkeep

    Scanopy is a powerful multi-modal data capture and analysis toolkit that enables users to collect, process, and visualize structured and unstructured information from a variety of sources in a flexible pipeline. It is built to handle complex scanning tasks — such as OCR, document analysis, audio transcription, network data capture, and image extraction — while providing unified APIs and workflows that make managing heterogeneous data sources seamless. Developers can compose custom pipelines...
    Downloads: 16 This Week
    Last Update:
    See Project
  • 2
    pprof

    pprof

    pprof is a tool for visualization and analysis of profiling data

    pprof is a profiling visualization and analysis tool that ingests profiles in the profile.proto format and generates human-readable and graph-based reports. It supports multiple profile types (CPU, heap, allocations, contention, etc.) and can present data as text tables, call graphs (via Graphviz/dot), flame graphs, and interactive web UIs. The tool helps developers find hot paths, quantify resource usage, and compare profiles across runs to validate performance changes. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    Elasticsearch

    Elasticsearch

    A Distributed RESTful Search Engine

    Elasticsearch is a distributed, RESTful search and analytics engine that lets you store, search and analyze with ease at scale. It lets you perform and combine many types of searches; it scales seamlessly, and offers answers incredibly fast with search results you can rank based on a variety of factors. Elasticsearch can be used for a wide variety of use cases, from maps and metrics to site search and workplace search, and with all data types.
    Downloads: 9 This Week
    Last Update:
    See Project
  • 4
    Obsidian Visual Skills Pack

    Obsidian Visual Skills Pack

    Generate Canvas, Excalidraw, and Mermaid diagrams from text

    LLM-TLDR is a Python-based tool designed to dramatically reduce the amount of code a large language model needs to read by extracting the essential structure and context from a codebase and presenting only the most relevant parts to the model. Traditional approaches often dump entire files into a model’s context, which quickly exceeds token limits; LLM-TLDR instead indexes project structure, traces dependencies, and summarizes code in a way that preserves semantic relevance while shrinking...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Try Google Cloud Risk-Free With $300 in Credit Icon
    Try Google Cloud Risk-Free With $300 in Credit

    No hidden charges. No surprise bills. Cancel anytime.

    Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.
    Start Free
  • 5
    DocWire SDK

    DocWire SDK

    Award-winning modern data processing SDK in C++20

    DocWire SDK, a standout C++20AI driven data processing tool, has received award from SourceForge and strong backing from Microsoft. It handles nearly 100 file types, empowering efficient text extraction, web data extraction, and document analysis. For businesses, the shift to DocWire SDK signifies a leap forward. It promises comprehensive document format support and the ability to extract valuable insights from email boxes, databases, and websites using cutting-edge AI. DocWire SDK aims to expand its capabilities, focusing on versatile data extraction, platform support, and seamless integration with various systems. ...
    Downloads: 5 This Week
    Last Update:
    See Project
  • 6

    Graph Plotter

    GP is a tool to analyse numerical data

    GP is aimed to be simple and fast tool to analyse numerical data. It does not claim to replace all other plotters.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 7
    QUAST

    QUAST

    Quality Assessment Tool for Genome Assemblies

    ...QUAST computes several well-known metrics, including contig accuracy, the number of genes discovered, N50, and others, as well as introducing new ones, like NA50 (see details in the paper and manual). A comprehensive analysis results in summary tables (in plain text, tab-separated, and LaTeX formats) and colorful plots. The tool also produces web-based reports condensing all information in one easy-to-navigate file. QUAST and its three follow-up papers (MetaQUAST, Icarus, QUAST-LG) papers were published in Bioinformatics; the last paper (WebQUAST) is out in Nucl Acid Research.
    Leader badge
    Downloads: 51 This Week
    Last Update:
    See Project
  • 8
    PlateEditor

    PlateEditor

    PlateEditor, a free web application to work with multi-well plates

    PlateEditor is a free web application to work with multi-well plates, from creation of layouts to data visualization and aggregation. It is primarily designed as a tool to help scientists working in the field of biology to simplify and fasten the process of data analysis. It is fully client-side: files are never transferred to the server, ensuring complete security and privacy of research data. It doesn't require any installation. - PlateEditor is available at this...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    Sweetviz

    Sweetviz

    Visualize and compare datasets, target values and associations

    Sweetviz is an open-source Python library that generates beautiful, high-density visualizations to kickstart EDA (Exploratory Data Analysis) with just two lines of code. Output is a fully self-contained HTML application. The system is built around quickly visualizing target values and comparing datasets. Its goal is to help quick analysis of target characteristics, training vs testing data, and other such data characterization tasks. Shows how a target value (e.g. "Survived" in the Titanic dataset) relates to other features. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Train ML Models With SQL You Already Know Icon
    Train ML Models With SQL You Already Know

    BigQuery automates data prep, analysis, and predictions with built-in AI assistance.

    Build and deploy ML models using familiar SQL. Automate data prep with built-in Gemini. Query 1 TB and store 10 GB free monthly.
    Try Free
  • 10
    SentimentAnalysis-Rick&Morty

    SentimentAnalysis-Rick&Morty

    Rick & Morty Sentiment Analysis - End-of-Degree Project - UNIR

    The remarkable progress in the field of Big Data has driven the development of new technologies in natural language processing and data analysis. Text mining is a fascinating application of data analysis that extracts relevant information from related writings in different linguistic contexts. And therefore, in natural language processing, sentiment analysis and classification stands out as a key application supported by text mining. Through the extraction of information from textual data, it becomes possible to identify and comprehend the sentiments and emotions conveyed. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11

    modnlp-plugins

    External plugins for modnlp/teccli

    This is a general project for modnlp/teccli plugins, with focus on text visualizaton.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    DataMelt

    DataMelt

    Computation and Visualization environment

    DataMelt (or "DMelt") is an environment for numeric computation, data analysis, computational statistics, and data visualization. This Java multiplatform program is integrated with several scripting languages such as Jython (Python), Groovy, JRuby, BeanShell. DMelt can be used to plot functions and data in 2D and 3D, perform statistical tests, data mining, numeric computations, function minimization, linear algebra, solving systems of linear and differential equations. Linear, non-linear...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 13
    DynaQ

    DynaQ

    Innovative text document search. http://dynaq.opendfki.de for details.

    The goal of DynaQ is to develop an inquiry system to explore the personal information space, supporting you with the searching paradigm 'orienteering'. DynaQ is a (desktop)search engine with enhanced functionality for file, email and blog search. Look at our GitLab homepage for sourcecode and documentation: http://dynaq.opendfki.de
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14

    Adele

    Adhoc Data Exploration - Live & Easy

    ...But Adele is designed for business people working with spreadsheet applications to analyse their data. There are many technical concepts in an easier way included. For example realtime OLAP, transformations, charts, analysis tools,... Connectors (e.g. JDBC, SAP ABAP, OData) can be used to pre-analyse the data and extract it without saving the data as text files. A plugin concept for enhancements are available. Enjoy! Its free for commercial use too. Adele runs without installation from USB stick for Windows, Linux and MacOSX. Last added changes: - data science tools (V1, IQR) - export to remote and desktop databases (mysql,sqlite, ms access) - internet features for emails and domains
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    quickplot

    quickplot

    interactive 2D plotter

    Quickplot is a fast interactive 2D plotter with infinite zooming, value picking, pipe input, and unlimited plots displayed. Quickplot is meant for looking at your data quickly and of secondary importance is making static pictures of your data.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 16
    circlize

    circlize

    Circular visualization in R

    circlize is an R package for creating circular visualizations (plots laid out in circular coordinate systems) in a very flexible way. It implements many types of plots using circular layouts: chord diagrams, circular heatmaps, arcs/links between sectors, genomic data visualization, etc. It provides low-level drawing functions as well as high-level functions to build complex visualizations. It’s often used in genomics, network analysis, or other fields where relationships among categories or...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    OpenSearchServer Extractor

    OpenSearchServer Extractor

    A RESTFul/JSON Web Service for text and metata extraction

    An open source RESTFul Web Service for text , meta-data extraction and analysis. oss-text-extractor supports various binary formats: Word processor (doc, docx, odt, rtf) Spreadsheet (xls, xlsx, ods) Presentation (ppt, pptx, odp) Publishing (pdf, pub) Web (rss, html/xhtml) Medias (audio, images) Others (vsd, text)
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18

    iMir

    Integrated pipeline for HT miRNA-Seq data analysis

    Processing of smallRNA-Seq data to gather biologically relevant information requires application of multiple statistical and bioinformatics tools from different sources, each focusing on a specific step of the analysis pipeline. The analytical workflow can be challenging for the continuous interventions by the operator, a critical factor when large numbers of datasets need to be analyzed at once. To allow a flexible and comprehensive analysis of smallRNA-Seq data we designed a novel modular pipeline, called iMir, integrating multiple open source modules and resource in an automated workflow, devising different statistical approaches to analyze data rigorously. iMir comprises also a Graphical User Interface (GUI), so that the pipeline is particularly suited for biologist and early stage bioinformaticians and produces both graphics and text outputs.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    ETU SQL for MySQL

    ETU SQL for MySQL

    Powerful tool for data analysis and editing

    Changelog (Version 7 - Update 1) New Discard Partialy Changes New Edit Row in Window New List of User Connections New Show Image on Form New Quickly View of Cell Values Download also: 1) ETU SQL for MS SQL : https://sourceforge.net/projects/etusqltango/ 2) ETU SQL for Oracle: https://sourceforge.net/projects/etusqlfororacle/ 3) ETU SQL for DB2: https://sourceforge.net/projects/etusqlfordb2/ 4) ETU SQL for PostgreSQL: https://sourceforge.net/projects/etusqlforpostegresql/
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    VANTED
    VANTED - Visualization and Analysis of NeTworks containing Experimental Data At SourceForge the VANTED development history is preserved, only limited amount of development will proceed here. Please head on to the most recent developments, which can be observed at www.vanted.org.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    TeleScope

    TeleScope

    XML Data Stream Broker/Replicator

    TeleScope is the efficient intensive-load XML data stream broker, replicator and simple event processing platform (SEP) written in C for the Fedora 17-18, Slackware 13-14, Red Hat Enterprise Linux 6 (RHEL-6) Linux distributions. The platform is intended to be operated upon the single number/word values and is not meant to be deployed for full-text XML stream analysis. TeleScope has internal query language with a set of standard logical operators that allows to construct relatively complex query expressions. The platform features the pub-sub architecture and serves a set of simultaneously connected XML stream subscribers. The broker features Continuous Query engine over the XML stream. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22

    xSonify

    Java-based sonification data analysis

    xSonify is a Java-based sonification data analysis prototype for displaying science data as sounds, with an emphasis on supporting visually-impaired researchers. Alternatives include Sonification Sandbox <http://sonify.psych.gatech.edu/research/>, MathTrax and Earth Plus <http://prime.jsc.nasa.gov/>, Sonifyer <http://www.sonifyer.org> and SoniPy <http://sourceforge.net/projects/sonipy/>. More at the main forum for sonification, ICAD <http://icad.org>
    Downloads: 2 This Week
    Last Update:
    See Project
  • 23
    Framework for text mining, data integration and data analysis. Keywords: ontology and graph alignment, relation mining, warehouse, semantic database integration, bioinformatics, systems biology, microarray, Java.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24

    WaveSorter

    A powerful, versatile tool for offilne spike analysis and sorting

    WaveSorter emphasizes dynamic visualization and versatility. Slider controls let the user select any coefficient or sample from any of several transforms, which can then be plotted to either axis of a 2D histogram (scatterplot). Within the waveform space, cursor-based controls let the user select subregions of the waveform space or individual waveforms to view. The user may cluster waveforms manually or via one of several popular clustering programs. The classification along with waveform...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    Plotmeister is a data exploration tool. It parses your ASCII data and generates a simple (text-based) table format. You can modify this table and eventually create nice looking figures.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • Next
MongoDB Logo MongoDB