Showing 69 open source projects for "batch text processing"

View related business solutions
  • Cut Cloud Costs with Google Compute Engine Icon
    Cut Cloud Costs with Google Compute Engine

    Save up to 91% with Spot VMs and get automatic sustained-use discounts. One free VM per month, plus $300 in credits.

    Save on compute costs with Compute Engine. Reduce your batch jobs and workload bill 60-91% with Spot VMs. Compute Engine's committed use offers customers up to 70% savings through sustained use discounts. Plus, you get one free e2-micro VM monthly and $300 credit to start.
    Try Compute Engine
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build generative AI apps with Vertex AI Studio. Switch between models without switching platforms.
    Start Free
  • 1
    Best-of Python

    Best-of Python

    A ranked list of awesome Python open-source libraries

    ...Correctly generate plurals, ordinals, indefinite articles; convert numbers. Libraries for loading, collecting, and extracting data from a variety of data sources and formats. Libraries for data batch- and stream-processing, workflow automation, job scheduling, and other data pipeline tasks.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 2
    Stanza

    Stanza

    Stanford NLP Python library for many human languages

    Stanza is a collection of accurate and efficient tools for the linguistic analysis of many human languages. Starting from raw text to syntactic analysis and entity recognition, Stanza brings state-of-the-art NLP models to languages of your choosing. Stanza is a Python natural language analysis package. It contains tools, which can be used in a pipeline, to convert a string containing human language text into lists of sentences and words, to generate base forms of those words, their parts of...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    ChordSmith

    ChordSmith

    Chordpro editor that can display, transpose and print song sheets.

    ChordSmith is a chordpro editor that can display, transpose and print song sheets containing chords and lyrics. It can also edit and convert song sheet formats (including Harmonica tabs) between chordpro format (chords in square brackets in line with lyrics) and two-line format (chords above lyrics). You can find many free sources of song sheets in both formats on the Internet. More information at https://chordsmith.sourceforge.io/ Here are just a few of ChordSmith's many features:
    Downloads: 60 This Week
    Last Update:
    See Project
  • 4
    biblatex
    Biblatex is a LaTeX package which provides full-featured bibliographic facilities
    Leader badge
    Downloads: 76 This Week
    Last Update:
    See Project
  • 99.99% Uptime for Your Most Critical Databases Icon
    99.99% Uptime for Your Most Critical Databases

    Sub-second maintenance. 2x read/write performance. Built-in vector search for AI apps.

    Cloud SQL Enterprise Plus delivers near-zero downtime with 35 days of point-in-time recovery. Supports MySQL, PostgreSQL, and SQL Server.
    Try Free
  • 5
    Advanced Trigonometry Calculator

    Advanced Trigonometry Calculator

    Precision Trigonometry: Advanced Calculator for Complex Math

    Advanced Trigonometry Calculator is equipped with a user-friendly interface that allows for easy input of problems and instant computation. Professionals such as engineers who need to perform advanced trigonometric calculations in their work will find this tool extremely useful. ATC Online Alpha: https://advantrigoncalc.sourceforge.io/atc/ More info by clicking below: https://advantrigoncalc.sourceforge.io/ Advanced Trigonometry Calculator was only and always only developed by...
    Leader badge
    Downloads: 29 This Week
    Last Update:
    See Project
  • 6
    LPub3D

    LPub3D

    LDraw™ editor for LEGO® style digital building instructions.

    LPub3D is an Open Source WYSIWYG editing application for creating LEGO® style digital building instructions. LPub3D is developed and maintained by Trevor SANDY. It uses the LDraw™ parts library, the most comprehensive library of digital Open Source LEGO® bricks available (www.ldraw.org/ ) and reads the LDraw LDR and MPD model file formats. LPub3D is available for free under the GNU Public License v3 and runs on Windows, Linux and macOS Operating Systems. LPub3D is also...
    Leader badge
    Downloads: 34 This Week
    Last Update:
    See Project
  • 7

    DWDS/Dialing Concordance

    a collection of indexing and search tools for corpus linguists

    DWDS/Dialing Concordance (DDC) - a collection of index and search tools for corpus linguists
    Leader badge
    Downloads: 7 This Week
    Last Update:
    See Project
  • 8
    Data Science at the Command Line

    Data Science at the Command Line

    Data science at the command line

    ...To get you started, author Jeroen Janssens provides a Docker image packed with over 100 Unix power tools, useful whether you work with Windows, macOS, or Linux. You’ll quickly discover why the command line is an agile, scalable, and extensible technology. Even if you’re comfortable processing data with Python or R, you’ll learn how to greatly improve your data science workflow by leveraging the command line’s power.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    Programa para generar documentos HTML con expresiones matemáticas incrustadas, procesadas con Maxima (maxima.sourceforge.net).
    Downloads: 0 This Week
    Last Update:
    See Project
  • Host LLMs in Production With On-Demand GPUs Icon
    Host LLMs in Production With On-Demand GPUs

    NVIDIA L4 GPUs. 5-second cold starts. Scale to zero when idle.

    Deploy your model, get an endpoint, pay only for compute time. No GPU provisioning or infrastructure management required.
    Try Free
  • 10
    Command-Line Text Processing

    Command-Line Text Processing

    From finding text to search and replace

    Command-line-text-processing is a curated educational repository providing many examples and tutorials on how to use various command-line tools for processing text: searching, replacing, sorting, transforming, filtering, etc. It covers tools like grep, sed, awk, perl, Ruby one-liners, file attribute commands, sorting, tail/head/less/cat, and many more.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    mbFXWords

    mbFXWords

    Analyze text. Diagonal read subject, predicate, obj. Search other pdf.

    Version 1.04. Applies and builds upon Apache OpenNLP. For English, French and German files. JavaFX Application, runs with Oracle Java Runtime Environment version 8 that is including JavaFX. NLP extensions: - Divide sentences in subclauses: segmentation. - Divide plain text: subject, predicate, object. - Count words: stemming. - Search for similar content: pdf's. Gives out subject, predicate and object of sentences of pdf and plain text files. Provides comfortable GUI. Automatic...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    lottie vectors

    lottie vectors

    Create, display and process 2D vectors in a 3D window.

    Lottie Vectors is an application for Matlab that alows you to do some pretty neat things -with vectors. More exactly -displaying them in ways that hopefully will allow you to explore and better understand your vector data. The basic idea is simple, take a vector defined in one of a few different types of data formats and map it on the screen. Add another vector and you start to form a 'route'. Each route or position vector can be accompanied with a 'force' vector. This can be used to...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 13
    Distributed Proofreaders
    Project has moved to https://github.com/DistributedProofreaders/dproofreaders Distributed Proofreaders is a web application intended to ease the process of converting public domain books and other printed materials into e-texts. The main site is at http://www.pgdp.net By breaking the work into individual pages, many proofreaders can be working on the same book at the same time. This significantly speeds up the proofreading/E-Text creation process. When a proofer elects to...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    Benkyou Studio

    Benkyou Studio

    Benkyou Studio is a Language study toolkit.

    Benkyou Studio is intended to be a One-stop integrated solution for working and learning with languages, For the Learner, it has flashcards,A multi choice Quiz which remembers and adjusts to the words you are struggling with, Speech Synthesis helps you hear the words as you study, you can even export the wordlist to sound files for your portable music player For the Professional, it has unicode lookup and converters, a character map viewer and exporter, a text file converter for...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15

    WebDjVuTextEd

    Edit the OCR text layer of DjVu documents in a web browser

    WebDjVuTextEd allows to edit the text layer of OCR'ed DjVu documents in a web browser. You can modify the structure (paragraphs, lines, words...) create, delete, edit text nodes, modify their container box by mouse, and run a spellchecker. The program does not directly read the DjVu files, it requires exported XML text data and images. When using without a webserver, you can open and save local files, but cannot take advantages of auto-save and spell checking. Note that current SVN...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16

    NotesTyper

    Convert your text typing into music

    NotesTyper system converts text typing into music. You will need computer keyboard, microphone (notebook mic is ok) and Chrome or Firefox browser. NotesTyper has multiple operating modes and settings, which allow to produce different music from same text. Computer keyboards do no allow to input keypress velocity, but NotesTyper overcomes this restriction by processing microphone input level when you type.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    Virastyar

    Virastyar

    Virastyar is an spell checker for low-resource languages

    Virastyar is a free and open-source (FOSS) spell checker. It stands upon the shoulders of many free/libre/open-source (FLOSS) libraries developed for processing low-resource languages, especially Persian and RTL languages Publications: Kashefi, O., Nasri, M., & Kanani, K. (2010). Towards Automatic Persian Spell Checking. SCICT. Kashefi, O., Sharifi, M., & Minaie, B. (2013). A novel string distance metric for ranking Persian respelling suggestions. Natural Language Engineering,...
    Leader badge
    Downloads: 402 This Week
    Last Update:
    See Project
  • 18
    JGloss

    JGloss

    Add readings and translations to Japanese text

    JGloss lets you import Japanese text documents and add reading and translation annotations for words, both automatically during import, and manually. It is written in Java.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 19

    isbntools

    A command line tool to extract, transform and get metadata for ISBNs

    As of 2015-06-02, this project is no longer under active development.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    MatrixUser

    MatrixUser

    A Multi-functional GUI-based Program for Image Processing and Analysis

    The MatrixUser project is moving to GitHub, the latest version can be obtained from https://leoliuf.github.io/MatrixUser/ Most of the medical images (e.g. CT, MRI, PET, etc.) comprises multiple frames which represent slices, phases, timing etc. from the same imaging object. Those images can be saved as multidimensional matrices in Matlab thanks to Matlab's powerful support of multidimensional data representation. However, within Matlab, most of image manipulation functions are limited or...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21

    TransLength

    Text translation proportional length checker

    Small tool to support the translation of text (application user interfaces, resource bundles, ...) to other languages. Checks that the length of the translated text does not exceed the length of the original text with respect to the proportions of a given font.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    Klen-library
    Open Source program for creating electronic books and electronic manuals. Contains professional tools.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 23
    FALCON - Text Search Java Project

    FALCON - Text Search Java Project

    JSON based text search Java Project

    ----------------- - What is it? - ----------------- The "Falcon Search" is a JAVA API and tool to search inside the documents. It was originally started to search the content in pdf files under the project "HAWK Search". Searching with this tool is query-based not word-based as in most of the document search tools OR document readers. It also takes care of jumbling of words within query and spelling mistakes. Commonly used techniques in this project are Natural Language...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    DrPython is a highly customizable cross-platform ide to aid programming in Python. It was developed with teaching in mind, and has a clean, simple interface. It is written in Python, using wxPython as the gui.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 25
    Inhaler

    Inhaler

    speed reading tool

    Inhaler is a speed reading tool programmed in scala using swing. It features variable reading speed and font size. It is licensed under GPL.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • Next
MongoDB Logo MongoDB