Search Results for "text batch processing tools" - Page 15

Showing 364 open source projects for "text batch processing tools"

View related business solutions
  • Stop Cyber Threats with VM-Series Next-Gen Firewall on Azure Icon
    Stop Cyber Threats with VM-Series Next-Gen Firewall on Azure

    Native application identity and user-based security for your Azure cloud

    Gain integrated visibility across all traffic in a single pass. Deploy Palo Alto Networks VM-Series to determine application identity and content while automating security policy updates via rich APIs.
    Get a free trial
  • Forever Free Full-Stack Observability | Grafana Cloud Icon
    Forever Free Full-Stack Observability | Grafana Cloud

    Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

    Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
    Create free account
  • 1
    XML DTD and related tools for documenting Tcl packages.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    SchemaDoc is a XML-based markup language for documenting XML schemas. The work products include both the vocabulary and a set of tools for combining it with the schema source (e.g. a DTD) to produce documentation in HTML, XML DocBook, LaTeX, etc.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    Guber, short for Gutenberg renamer, renames text files provided by the Gutenberg project into the format "Author, Title" by automatically extracting the relavent information from the text file. Guber can do single files or Batch processing of a directo
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    COOK is an embedded language which can be used as a macro preprocessor and for similar text processing. The concept is similar to PHP, but is oriented towards batch-mode processing.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build, govern, and optimize agents and models with Gemini Enterprise Agent Platform.
    Start Free
  • 5
    This project will compile a hungarian wordlist for use with spell-checkers like aspell. Additionally it will develop generic tools useful to compile and maintain wordlists for any language.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    Collection of tools to convert data from Windows Treepad files to Unix Yank format and vice versa. The following tools are available: hjt2yank, yank2hjt, and hjtyank windows frontend.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    ZhDict provides command-line tools to aid English speakers in reading and understanding Chinese texts.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    PyKit includes some small applications written in python, such as batch renaming, text processing and so on.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    An experimental set of tools for text analysis and dictionary construction. One goal is to improve text-input e.g. on devices with touchscreens using dictionary-based symbolic on-screen keyboards.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Earn up to 16% annual interest with Nexo. Icon
    Earn up to 16% annual interest with Nexo.

    Let your crypto work for you

    Put idle assets to work with competitive interest rates, borrow without selling, and trade with precision. All in one platform. Geographic restrictions, eligibility, and terms apply.
    Get started with Nexo.
  • 10

    open-tamil

    Tamil Tools, Tamil Library for Python 2, 3

    Open-Tamil is a full featured Tamil text processing library in Python. It works fully in Python 2, 3. Published via pip - python package index. See: https://pypi.python.org/pypi/Open-Tamil/0.67
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    AuthorWeb is an organization platform for writers of all kind which includes tools for creating and organizing Characters, Sets, Plots, Scenes, and other information for the craft. Written in Java, the executable jar can be run on any OS platform.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    This project will provide tools for user to convert existing web sites, blogs and documents with non-standard Myanmar font data to Unicode 5.1 compatible data. (Zawgyi to Unicode 5.1, WinMyanmar system to Unicode 5.1 etc.)
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    mms-300m-1130-forced-aligner

    mms-300m-1130-forced-aligner

    CTC-based forced aligner for audio-text in 158 languages

    mms-300m-1130-forced-aligner is a multilingual forced alignment model based on Meta’s MMS-300M wav2vec2 checkpoint, adapted for Hugging Face’s Transformers library. It supports forced alignment between audio and corresponding text across 158 languages, offering broad multilingual coverage. The model enables accurate word- or phoneme-level timestamping using Connectionist Temporal Classification (CTC) emissions. Unlike other tools, it provides significant memory efficiency compared to the TorchAudio forced alignment API. Users can integrate it easily through the Python package ctc-forced-aligner, and it supports GPU acceleration via PyTorch. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    Bio_ClinicalBERT

    Bio_ClinicalBERT

    ClinicalBERT model trained on MIMIC notes for clinical NLP tasks

    Bio_ClinicalBERT is a domain-specific language model tailored for clinical natural language processing (NLP), extending BioBERT with additional training on clinical notes. It was initialized from BioBERT-Base v1.0 and further pre-trained on all clinical notes from the MIMIC-III database (~880M words), which includes ICU patient records. The training focused on improving performance in tasks like named entity recognition and natural language inference within the healthcare domain. Notes were...
    Downloads: 0 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB