Showing 128 open source projects for "batch text processing"

View related business solutions
  • Go From Idea to Deployed AI App Fast Icon
    Go From Idea to Deployed AI App Fast

    One platform to build, fine-tune, and deploy. No MLOps team required.

    Access Gemini 3 and 200+ models. Build chatbots, agents, or custom models with built-in monitoring and scaling.
    Try Free
  • $300 in Free Credit Across 150+ Cloud Services Icon
    $300 in Free Credit Across 150+ Cloud Services

    VMs, containers, AI, databases, storage | build anything. No commitment to start.

    Start your project in minutes. After credits run out, 20+ products include free monthly usage. Only pay when you're ready to scale with Google Cloud.
    Start Building Free
  • 1
    pyVideoTrans

    pyVideoTrans

    Translate the video from one language to another and embed dubbing

    ...The tool supports both command-line and GUI modes, making it accessible to developers and creatives needing batch or automated processing.
    Downloads: 12 This Week
    Last Update:
    See Project
  • 2
    Dawarich

    Dawarich

    Self-hostable alternative to Google Timeline

    Dawarich is a command-line tool (likely Ruby-based) for transforming and analyzing Arabic text data with normalization, diacritic handling, segmentation, and morphological tokenization. Designed for text mining and NLP workflows in Arabic-language contexts.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 3
    TorchQuantum

    TorchQuantum

    A PyTorch-based framework for Quantum Classical Simulation

    ...Researchers on quantum algorithm design, parameterized quantum circuit training, quantum optimal control, quantum machine learning, and quantum neural networks. Dynamic computation graph, automatic gradient computation, fast GPU support, batch model terrorized processing.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    PyMca
    Stand-alone application and Python tools for interactive and/or batch processing analysis of X-Ray Fluorescence Spectra. Graphical user interface (GUI) and batch processing capabilities provided.
    Leader badge
    Downloads: 212 This Week
    Last Update:
    See Project
  • Push Code. Get a Production URL. Done. Icon
    Push Code. Get a Production URL. Done.

    Cloud Run deploys any language instantly. Scales to zero. Pay only when code runs.

    Skip the Kubernetes configs. Cloud Run handles HTTPS, scaling, and infrastructure automatically. Two million requests free per month.
    Try Cloud Run Free
  • 5
    go1pylib

    go1pylib

    go1pylib is a Python library designed to control the Go1 robot

    go1pylib is a Python library designed to control the Go1 robot by Unitree Robotics. It provides an easy-to-use interface for robot movement, state management, collision avoidance, battery monitoring, and MQTT communication. Ideal for research and robotics development.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    Bowtie, an ultrafast, memory-efficient short read aligner for short DNA sequences (reads) from next-gen sequencers. Please cite: Langmead B, et al. Ultrafast and memory-efficient alignment of short DNA sequences to the human genome. Genome Biol 10:R25.
    Leader badge
    Downloads: 438 This Week
    Last Update:
    See Project
  • 7
    This program makes PNG and/or GIF graphics from DVI files. It produces high-quality images while its internals are tuned for speed. It supports PK, VF, PostScript and TrueType fonts, color, PostScript inclusion, bitmap inclusion, and has CJK support
    Leader badge
    Downloads: 69 This Week
    Last Update:
    See Project
  • 8
    GATE
    NOTE THAT THE SOURCE CODE AND ISSUE TRACKER HAVE NOW MOVED TO GITHUB. FIND US AT https://github.com/GateNLP/ GATE (General Architecture for Text Engineering) is an architecture, framework and development environment for developing, evaluating and embedding Human Language Technology. See http://gate.ac.uk for full details.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 9
    Hydrogen Bond Analysis Tool (HBAT)

    Hydrogen Bond Analysis Tool (HBAT)

    HBAT 2 is migrated from PERL to Python.

    HBAT(Hydrogen Bond Analysis Tool) is a tool to automate the analysis of hydrogen bonds present in a PDB Structure file. HBAT 2 is migrated from PERL to Python. Latest Documentation: https://hbat.abhishek-tiwari.com/ Web Server Version: http://hbat-web.abhishek-tiwari.com Latest Github Release: http://github.com/abhishektiwari/hbat Citation: Tiwari, A., & Panigrahi, S. K. (2007). HBAT: A Complete Package for Analysing Strong and Weak Hydrogen Bonds in Macromolecular Crystal...
    Leader badge
    Downloads: 4 This Week
    Last Update:
    See Project
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 10

    multinotes

    Text architecture for music theory.

    ...Furthermore, dynamic interactive documents can be useful for presenting complicated interdependencies to the reader more clearly, far beyond conventional paper publication. The mulitNotes text architecture and processing pipeline is based on d2d and standard technologies (XSLT, ECMAScript. LilyPond, PostScript, etc.) and addresses these issues. An overview about the software architecture and its operation is given in: Journal of the Text Encoding Initiative, Open Issue 18/2024: "Using d2d for Writing XML --- The multiNotes Text Architecture for Musical Analysis" https://doi.org/10.4000/132ex
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    Agena

    Agena

    Agena is an interpreted procedural programming language.

    Agena is an easy-to-learn procedural programming language designed for science, scripting, and many other applications. Binaries are available for Windows, Linux, Solaris, OS/2, Mac OS X, Raspberry Pi and DOS.
    Leader badge
    Downloads: 135 This Week
    Last Update:
    See Project
  • 12

    libCIGI

    C++ Library for the Common Image Generator Interface

    ...Currently versions 3.0, 3.2 and 3.3 are supported as well as support for the *draft* V4.0 CIGI standard. Additional functionality to the base packet interfaces is provided through external classes so that to the packet headers have no further dependencies. Packet processing is kept simple and a couple of simple helper functions / classes are provided in CIGIGeneric.h and some other headers to support this. Helpers are provided for version interpretation and conversion of packet parameters to text. Testing is supported using the boost test framework to develop unit tests for each of the packets. ...
    Downloads: 6 This Week
    Last Update:
    See Project
  • 13
    iGPS

    iGPS

    An interactive tool for GNSS position time series analysis

    * Please check latest update on https://github.com/igps-ftk/iGPS A tool written in Interactive Data Language (IDL), for processing and analyzing daily continuous GPS position time series. It can read time series in various formats, detect outliers, remove abnormal observation spans, find jumps, extract common-mode components(CMC), etc.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    TXM

    TXM

    Unicode XML TEI text analysis platform

    TXM is a free and open-source cross-platform Unicode & XML based text analysis environment and graphical client, supporting Windows, Linux and Mac OS X. It can also be used online as a J2EE standard compliant web portal (GWT based) with access control built in. DOWNLOAD LATEST VERSION OF TXM : http://textometrie.ens-lyon.fr/spip.php?rubrique61&lang=en TXM offers a comprehensive range of analysis tools (concordances, collocate search, frequency lists, etc.) based on the powerfull CQP...
    Downloads: 14 This Week
    Last Update:
    See Project
  • 15

    modnlp-plugins

    External plugins for modnlp/teccli

    This is a general project for modnlp/teccli plugins, with focus on text visualizaton.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    DataMelt

    DataMelt

    Computation and Visualization environment

    DataMelt (or "DMelt") is an environment for numeric computation, data analysis, computational statistics, and data visualization. This Java multiplatform program is integrated with several scripting languages such as Jython (Python), Groovy, JRuby, BeanShell. DMelt can be used to plot functions and data in 2D and 3D, perform statistical tests, data mining, numeric computations, function minimization, linear algebra, solving systems of linear and differential equations. Linear, non-linear...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17

    AhoTTS - TTS for Basque and Spanish

    Text-to-Speech for Basque and Spanish

    Text-to-Speech conversor for Basque and Spanish. It includes linguistic processing and built voices for the languages aforementioned. Its acoustic engine is based on hts_engine and it uses a high quality vocoder called AhoCoder. Developed by Aholab Signal Processing Laboratory: https://aholab.ehu.es/aholab/ http://aholab.ehu.es/ahocoder/
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18

    MITRE Annotation Toolkit

    A toolkit for managing and manipulating text annotations

    The MITRE Annotation Toolkit (MAT) is a suite of tools which can be used for automated and human tagging of annotations. Annotation is a process, used mostly by researchers in natural language processing, of enhancing documents with information about the various phrase types the documents contain. MAT supports both UI interaction and command-line interaction, and provides various levels of control over the overall annotation process. It can be customized for specific tasks (e.g.,...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    SEGTeX
    LaTeX package for geophysical publications
    Downloads: 6 This Week
    Last Update:
    See Project
  • 20
    HistogramsApp

    HistogramsApp

    Application that generates KDE-PDP plots from geochronological data

    HistogramsApp is a Python 3.6 application that generates (KDE and PDP) from geochronological data .HistogramsApp allows to interactively setup plot parameters such as the bandwidth and the peak detection sensibility. To cite the application please refer to: 1) https://www.tandfonline.com/doi/abs/10.1080/00206814.2021.1954556?journalCode=tigr20 Rodriguez-Corcho, A. F., Rojas-Agramonte, Y., Barrera-Gonzalez, J. A., Marroquin-Gomez, M. P., Bonilla-Correa, S., Izquierdo-Camacho, D.,...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    libpostal

    libpostal

    A C library for parsing/normalizing street addresses around the world

    A C library for parsing/normalizing street addresses around the world. Powered by statistical NLP and open geo data. libpostal is a C library for parsing/normalizing street addresses around the world using statistical NLP and open data. The goal of this project is to understand location-based strings in every language, everywhere. Addresses and the locations they represent are essential for any application dealing with maps (place search, transportation, on-demand/delivery services,...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 22
    pgfplots - A TeX package to draw normal and/or logarithmic plots directly in TeX in two and three dimensions with a user-friendly interface and pgfplotstable - a TeX package to round and format numerical tables. Examples in manuals and/or on web site. ATTENTION: As of February 2020, the development has been moved to https://github.com/pgf-tikz/pgfplots ! Please refer to the new page for downloads and change requests! This page is kept in read-only mode.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 23
    Live Transcribe Speech Engine

    Live Transcribe Speech Engine

    Live Transcribe is an Android application

    Live Transcribe Speech Engine provides on-device speech recognition components that power real-time transcription for accessibility and everyday voice-first experiences. Its design prioritizes latency and robustness in noisy, far-field environments, enabling continuous transcription with low delay on mobile hardware. The engine manages audio front-end processing—such as noise suppression and voice activity detection—before feeding audio into compact, accurate acoustic and language models....
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    Leseratte is a Java parser for German written language. Currently, it contains a German lexicon (based on the Wiktionary), inflexion rules, a grammar and a parser. (Semantics component planned.) Usable as a Java library, also provides a graphical UI.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25

    KSUCCA Corpus

    A 50 million tokens corpus of Classical Arabic.

    King Saud University Corpus of Classical Arabic (KSUCCA) is a pioneering 50 million tokens annotated corpus of Classical Arabic texts from the period of pre-Islamic era until the fourth Hijri century (equivalent to the period from the seventh until early eleventh century CE), which is the period of pure classical Arabic. The main aim of this corpus is to be used for studying the distributional lexical semantics of The Quran words. However, it can be used for other research purposes, such...
    Downloads: 5 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB