Showing 20 open source projects for "text batch processing tools"

View related business solutions
  • Stop Storing Third-Party Tokens in Your Database Icon
    Stop Storing Third-Party Tokens in Your Database

    Auth0 Token Vault handles secure token storage, exchange, and refresh for external providers so you don't have to build it yourself.

    Rolling your own OAuth token storage can be a security liability. Token Vault securely stores access and refresh tokens from federated providers and handles exchange and renewal automatically. Connected accounts, refresh exchange, and privileged worker flows included.
    Try Auth0 for Free
  • Go From AI Idea to AI App Fast Icon
    Go From AI Idea to AI App Fast

    One platform to build, fine-tune, and deploy ML models. No MLOps team required.

    Access Gemini 3 and 200+ models. Build chatbots, agents, or custom models with built-in monitoring and scaling.
    Try Free
  • 1
    Stanza

    Stanza

    Stanford NLP Python library for many human languages

    Stanza is a collection of accurate and efficient tools for the linguistic analysis of many human languages. Starting from raw text to syntactic analysis and entity recognition, Stanza brings state-of-the-art NLP models to languages of your choosing. Stanza is a Python natural language analysis package. It contains tools, which can be used in a pipeline, to convert a string containing human language text into lists of sentences and words, to generate base forms of those words, their parts of...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    ChordSmith

    ChordSmith

    Chordpro editor that can display, transpose and print song sheets.

    ChordSmith is a chordpro editor that can display, transpose and print song sheets containing chords and lyrics. It can also edit and convert song sheet formats (including Harmonica tabs) between chordpro format (chords in square brackets in line with lyrics) and two-line format (chords above lyrics). You can find many free sources of song sheets in both formats on the Internet. More information at https://chordsmith.sourceforge.io/ Here are just a few of ChordSmith's many features:
    Downloads: 47 This Week
    Last Update:
    See Project
  • 3
    LPub3D

    LPub3D

    LDraw™ editor for LEGO® style digital building instructions.

    LPub3D is an Open Source WYSIWYG editing application for creating LEGO® style digital building instructions. LPub3D is developed and maintained by Trevor SANDY. It uses the LDraw™ parts library, the most comprehensive library of digital Open Source LEGO® bricks available (www.ldraw.org/ ) and reads the LDraw LDR and MPD model file formats. LPub3D is available for free under the GNU Public License v3 and runs on Windows, Linux and macOS Operating Systems. LPub3D is also...
    Leader badge
    Downloads: 27 This Week
    Last Update:
    See Project
  • 4

    DWDS/Dialing Concordance

    a collection of indexing and search tools for corpus linguists

    DWDS/Dialing Concordance (DDC) - a collection of index and search tools for corpus linguists
    Downloads: 1 This Week
    Last Update:
    See Project
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 5
    Data Science at the Command Line

    Data Science at the Command Line

    Data science at the command line

    Command Line by Jeroen Janssens, published by O’Reilly Media in October 2021. Obtain, scrub, explore, and model data with Unix Power Tools. This repository contains the full text, data, and scripts used in the second edition of the book Data Science at the Command Line by Jeroen Janssens. This thoroughly revised guide demonstrates how the flexibility of the command line can help you become a more efficient and productive data scientist. You’ll learn how to combine small yet powerful command-line tools to quickly obtain, scrub, explore, and model your data. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 6
    subs2srs

    subs2srs

    Convert movies and TV shows to flashcards

    subs2srs allows you to create import files for Anki or other Spaced Repetition Systems (SRS) based on your favorite foreign language movies and TV shows to aid in the language learning process. See http://subs2srs.sourceforge.net/ for more information.
    Leader badge
    Downloads: 30 This Week
    Last Update:
    See Project
  • 7

    Musaheb

    An Arabic collocation extraction tool

    “Musaheb”, an Arabic collocation extraction tool that has been designed and implemented to overcome the limitations of existing collocation extraction tools. “Musaheb” is able to extract n-gram collocations up to 5-gram, in addition to extracting the collocates of the nodes (the word-types we are looking for its collocates) within a window size of zero to 15 words. Moreover, it provides eight collocation statistics to calculate the strength of the collocation, and permits the input of...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    Japanese Text Analysis Tool

    Japanese Text Analysis Tool

    Generate frequency and readability reports from Japanese texts.

    cb's Japanese Text Analysis Tool allows users to analyze Japanese text files and generate 4 kinds of reports: 1) Word Frequency Report, 2) Kanji Frequency Report, 3) Formula-based Readability Report, 4) User-based Readability Report. Portable and does not require installation.
    Leader badge
    Downloads: 10 This Week
    Last Update:
    See Project
  • 9

    isbntools

    A command line tool to extract, transform and get metadata for ISBNs

    As of 2015-06-02, this project is no longer under active development.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Compliant and Reliable File Transfers Backed by Top Security Certifications Icon
    Compliant and Reliable File Transfers Backed by Top Security Certifications

    Cerberus FTP Server delivers SOC 2 Type II certified security and FIPS 140-2 validated encryption.

    Stop relying on non-certified, legacy file transfer tools that creak under the weight of modern security demands. Get full audit trails, advanced access controls and more supported by an award-winning team of experts. Start your free 25-day trial today.
    Start Free Trial
  • 10
    Klen-library
    Open Source program for creating electronic books and electronic manuals. Contains professional tools.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    FALCON - Text Search Java Project

    FALCON - Text Search Java Project

    JSON based text search Java Project

    ----------------- - What is it? - ----------------- The "Falcon Search" is a JAVA API and tool to search inside the documents. It was originally started to search the content in pdf files under the project "HAWK Search". Searching with this tool is query-based not word-based as in most of the document search tools OR document readers. It also takes care of jumbling of words within query and spelling mistakes. Commonly used techniques in this project are Natural Language...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    Rephrase is a simple string replacement application. The default package comes with a wordiness rule file, a 1337 rule file, and an English to French rule file. It can be used on the command line as part of a set of other tools as well.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    Japanese Language Learning Tools for Hiragana, Katakana and Kanji. Kana Text Editor. Vocabulary Games and Sentence builder.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    The Python scripts for the conversion from the Chinese Pinyin transcription(ISO 7098) to International Phonetic Alphabet(IPA), comprised of a core module for developers and a flexible GUI application for the common end-users on Modern Chinese phonetics.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 15
    The objective of the OpenBerg Project is to develop Open-Source, Open-Standards-based, Multi-Platform tools for eBook authors, editors and users. We are currently working on OpenBerg Lector, an e-Book reader, and OpenBerg Rector, an e-Book compiler.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    Collection of tools for input, reading, processing, and typesetting Taiwanese language. Includes SCIM and quail input methods, Firefox dictionary plugin, plus scripts for LaTeX and HTML generation.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    Tools for Vietnamese Language
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    It's a tools generating some graphics interfaces for applications in Java language. It's to gain a lot of time while building some windows. The Swings classes are very difficult to use! (especially the Layouts) We describe the windows content in XML!
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    ZhDict provides command-line tools to aid English speakers in reading and understanding Chinese texts.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20

    open-tamil

    Tamil Tools, Tamil Library for Python 2, 3

    Open-Tamil is a full featured Tamil text processing library in Python. It works fully in Python 2, 3. Published via pip - python package index. See: https://pypi.python.org/pypi/Open-Tamil/0.67
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • Next
MongoDB Logo MongoDB