Showing 173 open source projects for "batch text processing"

View related business solutions
  • $300 in Free Credit Across 150+ Cloud Services Icon
    $300 in Free Credit Across 150+ Cloud Services

    VMs, containers, AI, databases, storage | build anything. No commitment to start.

    Start your project in minutes. After credits run out, 20+ products include free monthly usage. Only pay when you're ready to scale with Google Cloud.
    Start Building Free
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build generative AI apps with Vertex AI Studio. Switch between models without switching platforms.
    Start Free
  • 1
    gannu

    gannu

    Java API and tools for performing NLP and other AI tasks

    Java API and tools for performing a wide range of AI tasks such as: word sense disambiguation (released), optimization (5 Evolutionary Algorithms Implemented ETA February 2014), opinion mining (ETA November 2014) and text wikification (ETA July 2014). Gannu includes some graphical interfaces for scientific purposes. When using Gannu please cite: *Jiménez, F. V., Gelbukh, A. F. & Sidorov, G. (2013). Simple Window Selection Strategies for the Simplified Lesk Algorithm for Word Sense...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    WM Hyperintensities Segmentation Toolbox

    WM Hyperintensities Segmentation Toolbox

    Open Source White Matter Hyperintensities Segmentation Toolbox

    Wisconsin White Matter Hyperintensity Segmentation [W2MHS] and Quantification Toolbox is an open source MatLab toolbox designed for detecting and quantifying White Matter Hyperintensities (WMH) in Alzheimer’s and aging related neurological disorders. WMHs arise as bright regions on T2- weighted FLAIR images. They reflect comorbid neural injury or cerebral vascular disease burden. Their precise detection is of interest in Alzheimer’s disease (AD) with regard to its prognosis. Our toolbox...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3

    BioLemmatizer

    Lemmatization tool for morphological analysis of biomedical literature

    ...If you use the BioLemmatizer to support academic research, please cite the following paper: Haibin Liu, Tom Christiansen, William A Baumgartner Jr, and Karin Verspoor BioLemmatizer: a lemmatization tool for morphological processing of biomedical text Journal of Biomedical Semantics 2012, 3:3.
    Downloads: 8 This Week
    Last Update:
    See Project
  • 4
    Java application for training and deploying text processing applications such as part-of-speech taggers, based on a re-implementation of Brill's algorithm in Java.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Host LLMs in Production With On-Demand GPUs Icon
    Host LLMs in Production With On-Demand GPUs

    NVIDIA L4 GPUs. 5-second cold starts. Scale to zero when idle.

    Deploy your model, get an endpoint, pay only for compute time. No GPU provisioning or infrastructure management required.
    Try Free
  • 5
    LinqYedict

    LinqYedict

    Translate Chinese to English

    Translate Chinese to English using CEDICT (cantonese dictionary). Demonstrate the speed of C# and Linq. Copy the chinese text from any browser/application to Windows clipboard and see the translation.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 6

    BioDare

    BioDare is Biological Data Repository focused on timeseries data

    BioDare (Biological Data Repository) was developed under the multi-site ROBuST project (http://hallidaylab.bio.ed.ac.uk/ROBuST.html) to support data exchange inside the project. It is a web application which allows data-sharing (including public dissemination), data-processing and analysis, with the main focus on time-series data produced in circadian experiments. The main features of BioDare are: - an online repository for experimental data accompanied by extensive metadata - generation of secondary data (normalized, detrended, averaged …) - graphical output of data, secondary data and rhythm analysis - simple text-based search throughout metadata - biology- and conditions-aware search for data - data aggregation and export - group-based privacy settings for collaborative research
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7

    fit2dcorr

    SAXS 2D > 1D data reduction software, wrapper for Fit2D

    ...Please use the repository at https://github.com/Niels-Bohr-Institute-XNS-StructBiophys/fit2dcorr for the new updated versions. The old version is kept at sourceforge. Keywords: fit2d SAXS azimuthal averaging batch processing absolute units (exposure time, transmission, sample thickness) error bars C++ Unix OpenMP
    Downloads: 5 This Week
    Last Update:
    See Project
  • 8
    latexdiff is a Perl script, which compares two latex files and marks up significant differences between them (i.e. a diff for latex files). Various options are available for visual markup using standard latex packages such as "color.sty".
    Downloads: 1 This Week
    Last Update:
    See Project
  • 9

    OMR Reader

    OMR Sheets Data Retrieval Software

    ...Do this for all the option groups on the form. Now save the info of these groups in a .omr file. For different forms, you can have different .omr files. These files can later be used for batch processing the scanned images of OMR sheets. Please rate...
    Downloads: 7 This Week
    Last Update:
    See Project
  • Fully Managed MySQL, PostgreSQL, and SQL Server Icon
    Fully Managed MySQL, PostgreSQL, and SQL Server

    Automatic backups, patching, replication, and failover. Focus on your app, not your database.

    Cloud SQL handles your database ops end to end. Migrate from on-prem or other clouds with free migration tools.
    Try Free
  • 10

    WaveSorter

    A powerful, versatile tool for offilne spike analysis and sorting

    ...It supports a wide array of binary file formats as well as ASCII text.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    ASTL Automata Standard Template Library (Vincent Le Maout - Dominique Revuz) is a set of generic and efficient C++ components for automata manipulation.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 12

    Dvipdfm tool for SCons

    SCons tool to cooperate with dvipdfm program

    SCons is a make replacement providing a range of enhanced features such as automated dependency generation and built in compilation cache support. SCons rule sets are Python scripts so as well as the features it provides itself SCons allows you to use the full power of Python to control compilation. This is a SCons extension (tool) which enables usage of the dvipdfm program to convert dvi files to pdf.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13

    fluids-sh

    acquire fluid properties from the NIST Chemistry WebBook

    fluid.sh is a shell script which utilized wget to acquire fluid properties from the NIST Chemistry WebBook in a format suitable for further processing with shell scripts or e.g. xmgrace. It supports the full functionality provided by the website! The script takes the same input as command line arguments you need to enter on the web forms. It produces a ASCII text file containing the respective data points in columns headed by a well readable description. The advantage is that you do not need to "click" through three web pages and export the result - you can do it with one command in the shell! ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    Seqshell

    Seqshell

    A JAVA GUI for performing the function of Tophat and Cuffdiff

    Combines the Tophat and Cuffdiff functions in one GUI interface. tophat and cuffdiff are required to be pre-installed in the system. By modifying the program, it can be used to execute any command line programs even R packages since R can also be run from commandlines. New functions: Batch processing function for Tophat. You can now execute as many mapping jobs as you want with tophat. This program will save the output into separate folders. An alert email will be sent to your email address when the job is done. (You will need to modify the source code to change the content to meet your special needs) Run-time information will be displayed in a JAVA output window.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    SciData

    SciData

    Data Management for Matlab, MathCAD, and SciLab

    Data Management for Matlab, MathCAD, and SciLab: - Automatically import/export data to/from Matlab/MathCAD/Scilab - Easily separate data and analysis - Batch processing of datasets - Data filtering - Easily manage metadata for all your data files - Built in Data Acquisition (DAQ) - Implements Microsoft Scientific Dataset so data files are stored in a standard format
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    Transliterator between any Language files - Map Fonts, Create Encoding Scheme, Input Phonetic, Indian, Roman, Tamil, Hindi, English, French, German, Spanish or Any World Language Keyboard. Ex: [Phonetic Input]-[Any World Language Output] or ViceVersa.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 17
    Redundancy due to cut-paste operations in text creates bias in machine learning for NLP. This module takes a directory and produces a subset of the files in that directory (in a list) with an upper bound on similarity between two files.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    OmniHelp is a cross-platform, browser-independent, tri-pane help viewer built in pure JavaScript and CSS with HTML 4. Some functions (such as help embedding) may in the future be in Java, C, or C++; CSH is fully supported. All code is under the LGPL.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    PTools is a set of useful tools written in Pascal. It includes: scientific calculator, archiver, text editor, remote adminitration and more. It is designed to be portable across operating systems, specially Java-based mobiles, Windows and Unixes.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 20
    A MATLAB package for wavelet analysis of circadian rhythms with both discrete (Daubechies) and continuous (Morlet) wavelets, as well as tools for batch processing of multiple time series, all accessible through a graphical user interface.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    This is a library to extract raw unicode text from any written documents (office documents such as PDF, Word, OpenOffice, ...). It should be useful to developpers of search engine, text processing, corpus analysis, ....
    Downloads: 1 This Week
    Last Update:
    See Project
  • 22
    Ub3rMath

    Ub3rMath

    Simple math parsing library for C++

    A math parsing library for C++ with a number of powerful features to allow flexible interpretation of mathematical formula in text form.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    Mavscript ermöglicht es in einem Textdokument Berechnungen durchzuführen. Die eigentliche Berechnung verarbeitet das Algebraprogramm Yacas (oder der Java-Interpreter BeanShell).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    This project aims to be a dictionary manager for EDICT and CCEDICT like dictionaries, using GTK and Qt as GUIs (also looking for comandline operations).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    SemaRule Navigator
    SemaRule Navigator is an Integrated Suite of Open-Source and Free-License Software, placing Semantic and Text Analysis Technologies in the toolbox of Researchers, Students, and Enterprises.
    Downloads: 0 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB