Search Results for "text batch processing tools" - Page 8

198 projects for "text batch processing tools" with 1 filter applied:

  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • Forever Free Full-Stack Observability | Grafana Cloud Icon
    Forever Free Full-Stack Observability | Grafana Cloud

    Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

    Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
    Create free account
  • 1
    xtopdf: Tools to convert other formats (x) to PDF; x as in math. - solve for x :-) Currently x == {.txt, .DBF}. Others to follow. Benefits: all those of PDF (better cross-platform viewing/printing, read-only, etc.)
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    tgen generates a Web site from a collection of input files of a variety of types, using a set of registered HTML autogenerators. Cvs-Brancher allows scheduling of web deployments. vwebedit provides web-based editing of cvs repositories.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    Project to create a unified FAQ XML format with all applicable software to convert it to various formats, such as multiple forms of HTML, TeX, PDF, text files, etc. Useful for most of "FAQ keepers" on various forums and discussion lists.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    It's a tools generating some graphics interfaces for applications in Java language. It's to gain a lot of time while building some windows. The Swings classes are very difficult to use! (especially the Layouts) We describe the windows content in XML!
    Downloads: 0 This Week
    Last Update:
    See Project
  • Train ML Models With SQL You Already Know Icon
    Train ML Models With SQL You Already Know

    BigQuery automates data prep, analysis, and predictions with built-in AI assistance.

    Build and deploy ML models using familiar SQL. Automate data prep with built-in Gemini. Query 1 TB and store 10 GB free monthly.
    Try Free
  • 5
    The DocConversion project provides a distributed document conversion solution with a well defined API which makes use of existing convstion tools and/or a centralized conversion server. This is part of the PRONIR research at http://www.pronir.nl
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    OpenOCR will be a commercial quality ocr engine with tools for pre- and post-processing of images and resulting text.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    xSiteable is a fully relational website compiler written entirely in XSLT, using topic maps (using XTM directly) as the backbone information technology, bundled with the fast Sablotron XSLT parser, a GUI admin tool and other nifty features. Watch this sp
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    QTide is a QT-based Textditor with some nice features: Simple installationscript, Tabbed editor window to allow simultaneous editing of multiple files, Allows integration of own tools/programs, extendable syntaxhighlighting.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    Code to process human readable input is often highly stylized and repetitive. This project extracts the common elements found in such code and makes them available in a concise form as C tables and subroutines.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Enterprise-grade ITSM, for every business Icon
    Enterprise-grade ITSM, for every business

    Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity.

    Freshservice is an intuitive, AI-powered platform that helps IT, operations, and business teams deliver exceptional service without the usual complexity. Automate repetitive tasks, resolve issues faster, and provide seamless support across the organization. From managing incidents and assets to driving smarter decisions, Freshservice makes it easy to stay efficient and scale with confidence.
    Try it Free
  • 10
    Tools for extracting and transforming XML-like mark-up, embedded in source code comments, into proper external entities or well-formed XML files. Can be used for JavaDoc-like "literate programming", or embedding other build-related or CM metadata.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    XML DTD and related tools for documenting Tcl packages.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    SchemaDoc is a XML-based markup language for documenting XML schemas. The work products include both the vocabulary and a set of tools for combining it with the schema source (e.g. a DTD) to produce documentation in HTML, XML DocBook, LaTeX, etc.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    Guber, short for Gutenberg renamer, renames text files provided by the Gutenberg project into the format "Author, Title" by automatically extracting the relavent information from the text file. Guber can do single files or Batch processing of a directo
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    COOK is an embedded language which can be used as a macro preprocessor and for similar text processing. The concept is similar to PHP, but is oriented towards batch-mode processing.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    This project will compile a hungarian wordlist for use with spell-checkers like aspell. Additionally it will develop generic tools useful to compile and maintain wordlists for any language.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 16
    Collection of tools to convert data from Windows Treepad files to Unix Yank format and vice versa. The following tools are available: hjt2yank, yank2hjt, and hjtyank windows frontend.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    ZhDict provides command-line tools to aid English speakers in reading and understanding Chinese texts.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    PyKit includes some small applications written in python, such as batch renaming, text processing and so on.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    An experimental set of tools for text analysis and dictionary construction. One goal is to improve text-input e.g. on devices with touchscreens using dictionary-based symbolic on-screen keyboards.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    AuthorWeb is an organization platform for writers of all kind which includes tools for creating and organizing Characters, Sets, Plots, Scenes, and other information for the craft. Written in Java, the executable jar can be run on any OS platform.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    This project will provide tools for user to convert existing web sites, blogs and documents with non-standard Myanmar font data to Unicode 5.1 compatible data. (Zawgyi to Unicode 5.1, WinMyanmar system to Unicode 5.1 etc.)
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    mms-300m-1130-forced-aligner

    mms-300m-1130-forced-aligner

    CTC-based forced aligner for audio-text in 158 languages

    mms-300m-1130-forced-aligner is a multilingual forced alignment model based on Meta’s MMS-300M wav2vec2 checkpoint, adapted for Hugging Face’s Transformers library. It supports forced alignment between audio and corresponding text across 158 languages, offering broad multilingual coverage. The model enables accurate word- or phoneme-level timestamping using Connectionist Temporal Classification (CTC) emissions. Unlike other tools, it provides significant memory efficiency compared to the TorchAudio forced alignment API. Users can integrate it easily through the Python package ctc-forced-aligner, and it supports GPU acceleration via PyTorch. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    Bio_ClinicalBERT

    Bio_ClinicalBERT

    ClinicalBERT model trained on MIMIC notes for clinical NLP tasks

    Bio_ClinicalBERT is a domain-specific language model tailored for clinical natural language processing (NLP), extending BioBERT with additional training on clinical notes. It was initialized from BioBERT-Base v1.0 and further pre-trained on all clinical notes from the MIMIC-III database (~880M words), which includes ICU patient records. The training focused on improving performance in tasks like named entity recognition and natural language inference within the healthcare domain. Notes were...
    Downloads: 0 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB