Showing 18 open source projects for "batch text processing"

View related business solutions
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build generative AI apps with Vertex AI Studio. Switch between models without switching platforms.
    Start Free
  • Train ML Models With SQL You Already Know Icon
    Train ML Models With SQL You Already Know

    BigQuery turns your data warehouse into an AI platform. No new languages required.

    Build and deploy ML models using familiar SQL. Automate data prep with built-in Gemini. Query 1 TB and store 10 GB free monthly.
    Try Free
  • 1
    Stanza

    Stanza

    Stanford NLP Python library for many human languages

    Stanza is a collection of accurate and efficient tools for the linguistic analysis of many human languages. Starting from raw text to syntactic analysis and entity recognition, Stanza brings state-of-the-art NLP models to languages of your choosing. Stanza is a Python natural language analysis package. It contains tools, which can be used in a pipeline, to convert a string containing human language text into lists of sentences and words, to generate base forms of those words, their parts of...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    TexToSpeech

    TexToSpeech

    TexToSpeech genera interpretaciones de texto a voz

    La presente herramienta genera interpretaciones de texto a voz utilizando las voces instaladas en tu ordenador por defecto. Puedes exportar el resultado del texto a voz generado en archivos .mp3.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    mbFXWords

    mbFXWords

    Analyze text. Diagonal read subject, predicate, obj. Search other pdf.

    Version 1.04. Applies and builds upon Apache OpenNLP. For English, French and German files. JavaFX Application, runs with Oracle Java Runtime Environment version 8 that is including JavaFX. NLP extensions: - Divide sentences in subclauses: segmentation. - Divide plain text: subject, predicate, object. - Count words: stemming. - Search for similar content: pdf's. Gives out subject, predicate and object of sentences of pdf and plain text files. Provides comfortable GUI. Automatic...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    Uosk

    Uosk

    Unicode On-Screen Keyboard: insert text snippets into any Windows app

    Uosk (formerly Charmappino) is a virtual keyboard that opens any ASCII, ANSI, UTF-8 and UTF-16 text file and converts words (e.g. separated by spaces) into buttons. Clicking one button the text snippet is pasted into any text editor (Notepad, WordPad, MS Office, OpenOffice...). You can edit snippets directly in Uosk, and save them as a new txt file. The snippet can also be copied to clipboard. It's basically a virtual keyboard with ability to set any amount of characters in one...
    Downloads: 2 This Week
    Last Update:
    See Project
  • $300 in Free Credit Across 150+ Cloud Services Icon
    $300 in Free Credit Across 150+ Cloud Services

    VMs, containers, AI, databases, storage | build anything. No commitment to start.

    Start your project in minutes. After credits run out, 20+ products include free monthly usage. Only pay when you're ready to scale with Google Cloud.
    Start Building Free
  • 5
    subs2srs

    subs2srs

    Convert movies and TV shows to flashcards

    subs2srs allows you to create import files for Anki or other Spaced Repetition Systems (SRS) based on your favorite foreign language movies and TV shows to aid in the language learning process. See http://subs2srs.sourceforge.net/ for more information.
    Leader badge
    Downloads: 60 This Week
    Last Update:
    See Project
  • 6

    Musaheb

    An Arabic collocation extraction tool

    “Musaheb”, an Arabic collocation extraction tool that has been designed and implemented to overcome the limitations of existing collocation extraction tools. “Musaheb” is able to extract n-gram collocations up to 5-gram, in addition to extracting the collocates of the nodes (the word-types we are looking for its collocates) within a window size of zero to 15 words. Moreover, it provides eight collocation statistics to calculate the strength of the collocation, and permits the input of...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    Text to Morse code translator

    Text to Morse code translator

    This program translates text into morse code.

    This program is used to translate text into morse code. After you type in the text you will see the morse code being displayed and you will hear the suitable beep.(Audio and graphical response).The space inbetween words is marked by a "/" (slash). FOR ANY HELP CONTACT ME USING MAIL
    Downloads: 1 This Week
    Last Update:
    See Project
  • 8
    Benkyou Studio

    Benkyou Studio

    Benkyou Studio is a Language study toolkit.

    Benkyou Studio is intended to be a One-stop integrated solution for working and learning with languages, For the Learner, it has flashcards,A multi choice Quiz which remembers and adjusts to the words you are struggling with, Speech Synthesis helps you hear the words as you study, you can even export the wordlist to sound files for your portable music player For the Professional, it has unicode lookup and converters, a character map viewer and exporter, a text file converter for...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    LunaIDE is a powerfull IDE for Lua programming language and also have support for XML files. Languages: Spanish, English and Portuguese. Note: Lua is licenced under the MIT License and LunaIDE is licensed under the GNU/GPL v3. The creator of this software do not have any kind of relation with PUC-Rio.
    Downloads: 0 This Week
    Last Update:
    See Project
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 10
    Japanese Text Analysis Tool

    Japanese Text Analysis Tool

    Generate frequency and readability reports from Japanese texts.

    cb's Japanese Text Analysis Tool allows users to analyze Japanese text files and generate 4 kinds of reports: 1) Word Frequency Report, 2) Kanji Frequency Report, 3) Formula-based Readability Report, 4) User-based Readability Report. Portable and does not require installation.
    Leader badge
    Downloads: 12 This Week
    Last Update:
    See Project
  • 11
    JNovel Formatter

    JNovel Formatter

    Convert Japanese novels to nicely formatted HTML files.

    Convert Japanese novels to nicely formatted HTML files. Supports Aozora formatting. Allows embedding of bookmark anchors. Portable and does not require installation. Supports batch processing.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 12
    JGloss

    JGloss

    Add readings and translations to Japanese text

    JGloss lets you import Japanese text documents and add reading and translation annotations for words, both automatically during import, and manually. It is written in Java.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 13

    TransLength

    Text translation proportional length checker

    Small tool to support the translation of text (application user interfaces, resource bundles, ...) to other languages. Checks that the length of the translated text does not exceed the length of the original text with respect to the proportions of a given font.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    MangaED is a universal program for manga translators. It consists of text processor, image viewer, dictionary and kanji finder
    Downloads: 1 This Week
    Last Update:
    See Project
  • 15

    Tengwar Writer

    Tengwar Writer provides keyboard layouts to write directly in Tengwar

    Tengwar Writer is NOT a transcripter! It changes the keyboard layout for writing in Tengwar in a easy way. This means, if you type "T" you will get the Tengwar character for T and not something other. You can make your own modes for other languages or just your prefered keyboard layout. With the visual keyboard you can see what Tengwar character is on which key. THIS IS ALPHA!!!
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    LinqYedict

    LinqYedict

    Translate Chinese to English

    Translate Chinese to English using CEDICT (cantonese dictionary). Demonstrate the speed of C# and Linq. Copy the chinese text from any browser/application to Windows clipboard and see the translation.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 17
    Downloadable and open source Chinese-Spanish vocabulary inspired by the CEDICT and EDICT dictionaries. It is distributed in a plain Unicode text file that can be easily ported to other formats or used by different applications.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 18

    open-tamil

    Tamil Tools, Tamil Library for Python 2, 3

    Open-Tamil is a full featured Tamil text processing library in Python. It works fully in Python 2, 3. Published via pip - python package index. See: https://pypi.python.org/pypi/Open-Tamil/0.67
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • Next
MongoDB Logo MongoDB