Search Results for "batch text processing" - Page 2

Showing 109 open source projects for "batch text processing"

View related business solutions
  • Fully Managed MySQL, PostgreSQL, and SQL Server Icon
    Fully Managed MySQL, PostgreSQL, and SQL Server

    Automatic backups, patching, replication, and failover. Focus on your app, not your database.

    Cloud SQL handles your database ops end to end. Migrate from on-prem or other clouds with free migration tools.
    Try Free
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build generative AI apps with Vertex AI Studio. Switch between models without switching platforms.
    Start Free
  • 1

    iMir

    Integrated pipeline for HT miRNA-Seq data analysis

    Processing of smallRNA-Seq data to gather biologically relevant information requires application of multiple statistical and bioinformatics tools from different sources, each focusing on a specific step of the analysis pipeline. The analytical workflow can be challenging for the continuous interventions by the operator, a critical factor when large numbers of datasets need to be analyzed at once.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    This project is devoted to the development of natural language processing tools and resources for the Lingala language, which is spoken by tens of millions of people in central Africa.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    seppdflatex

    seppdflatex

    Build a large LaTeX book with separate linked chapters

    seppdflatex is a Perl script which automates a lot of the tasks needed to compile PDF documents from LaTeX source for a multi-volume book, or a book with many huge chapters which you may not want as a single document, but which should all be unified by cross-references and external hyperlinks, so a PDF reader will open a link to an external chapter PDF file. The Table of Contents and LOF and LOT are made for all chapters, and the TOC, LOF, LOT are all hyperlinked to the correct external PDF...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    SEO & SEM - Marketing Text Writer

    SEO & SEM - Marketing Text Writer

    Open Source SEO & SEM Text Creation Tools for free Article Writer

    Open Source Tool for Search Engine Optimization (SEO & SEM) used for automatic content processing. These SEO Content Genrators and Article Writers based on Text Writer: https://www.artikelschreiber.com/en/ https://www.unaique.net/en/ https://www.unaique.com/ https://www.artikelschreiben.com/ https://www.buzzerstar.com/ https://googleduplicatecontentsolver.sourceforge.io/ https://inkassos.github.io/inkasso/ https://www.artikelschreiber.com/opensource/ https://www.sebastianenger.com/ https://www.artikelschreiber.com/marketing/review/ https://muckrack.com/markus-muller https://linktr.ee/textgenerator Code Contains: - Perl Source code, language databases and more
    Downloads: 0 This Week
    Last Update:
    See Project
  • Cut Cloud Costs with Google Compute Engine Icon
    Cut Cloud Costs with Google Compute Engine

    Save up to 91% with Spot VMs and get automatic sustained-use discounts. One free VM per month, plus $300 in credits.

    Save on compute costs with Compute Engine. Reduce your batch jobs and workload bill 60-91% with Spot VMs. Compute Engine's committed use offers customers up to 70% savings through sustained use discounts. Plus, you get one free e2-micro VM monthly and $300 credit to start.
    Try Compute Engine
  • 5
    docx2txt

    docx2txt

    Perl based utility to extract formatted text content from MS Docx file

    Docx2txt is a Perl based command-line utility to convert (even corrupted) Microsoft docx documents to reasonably formatted text files, along with appropriate character conversions. Apart from Perl it also requires a command line unzipping program like unzip/7z/pkzipc/wzunzip.
    Leader badge
    Downloads: 83 This Week
    Last Update:
    See Project
  • 6

    Autshumato Text Anonymiser

    Text anonymiser for the Autshumato project.

    A tool for the anonymisation of text corpora which entails the identification of entities that may convey confidential information and replacing those entities with with randomly selected entities of the same type.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    ...There is also a module for CRM. In March 2012, Lx-Office was renamed into "kivitendo"  to rule out misunderstandings  by equating name part "office"  to "office package" for spreadsheets and text processing. The new name comes from the Swahili, translated it means "practically". To preserve the project history, the sourceforge area will keep the previous product name for the time being.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    Antiexcel is a Microsoft-Excel file reader. It shows the excel file on a text console or an x-console. It can be used as an attachment viewer for any text-based email client ( ex. mutt)
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    An Gramadóir is a grammar checking engine that is designed for the rapid development of grammar checkers for minority languages and other languages with limited computational resources.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Build on Google Cloud with $300 in Free Credit Icon
    Build on Google Cloud with $300 in Free Credit

    New to Google Cloud? Get $300 in free credit to explore Compute Engine, BigQuery, Cloud Run, Vertex AI, and 150+ other products.

    Start your next project with $300 in free Google Cloud credit. Spin up VMs, run containers, query exabytes in BigQuery, or build AI apps with Vertex AI and Gemini. Once your credits are used, keep building with 20+ products with free monthly usage, including Compute Engine, Cloud Storage, GKE, and Cloud Run functions. Sign up to start building right away.
    Start Free Trial
  • 10
    Guiguts is a Perl/Tk text editor designed for editing and formatting public domain material for inclusion at Project Gutenberg (www.gutenberg.org). Features are provided for editing text files produced by Distributed Proofreaders (www.pgdp.net). For help or to contact the developers, see http://www.pgdp.net/phpBB2/viewtopic.php?t=46944
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    latexdiff is a Perl script, which compares two latex files and marks up significant differences between them (i.e. a diff for latex files). Various options are available for visual markup using standard latex packages such as "color.sty".
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    almtools

    almtools

    Collection of Open Source tools for HP ALM

    Collection of Open Source and free tools to be used for HP ALM administration, customization, and also end-user usage. This is a community effort. Feel free to use, share and contribute back!
    Downloads: 1 This Week
    Last Update:
    See Project
  • 13

    DataStreamProcessor

    TPL (transfer,parse,load) tool for batch files.

    This is an enterprise-strength system for batch file processing, e.g. transfer, parse and load data using batch (text) files within and without the enterprise. The system is controlled through parameters and doesn't require any programming, code generation or code deployment. This is a heavy duty back end system with no GUI. Nonetheless it's very easy to use, easier than most GUI-based ETLs, even easier to install.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    This project aims to build a suite of Natural Language Processing tools. Modules will include corpus indexing and access tools, a part-of-speech tagger, tokenisers, text classification software, etc.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    A set of Unix command line tools for quick and convenient batch processing of tabular text files (a.k.a., tab-delimited, csv, or flat file format) with a header line. Provides delimiter and compression detection, column reference by name. * tblmap: per-line ("map") computation: derive columns through an expression, delete, reorder, filter rows. * tblred: compute ("reduce") aggregations (e.g., sum, average) over groups defined by key columns
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    This is a Java-based project for complex event extraction from text and co-reference resolution. Currently the code can read BioNLP shared task format (http://2011.bionlp-st.org/) and i2b2 Natural Language Processing for Clinical Data shared task format (https://www.i2b2.org/NLP/DataSets/Main.php). Event extraction includes finding events and the parameters for an event in a text. The method is based on SVM but other ML algorithms can be adopted.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    otl
    otl is a text processor for generating markup from plain text. Much of both the input and output formats can be customized. otl supports structures such as nested ordered lists, headers and footers, and tables.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    This is a colleciton of various tools for preparing and processing hypertext (but not HTML), including a text editor.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    Maintain your resume like the lazy slob you are! Write a simple text file, produce simple HTML. Don't screw around with XML or any other form of overkill.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Build texts and documents bottom-up from building blocks using a language similar (and compatible with) that of GNU Make, in such a way that variants (e.g. translations) can be generated by overwriting parts of the text with minimal redundancy.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    Apostilator is a XML based front end to LaTeX that simplifies the process of writing all kinds of documents from letters to books, reports and articles. It's easy to customize and create templates defining a style for all your documents.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    Why is it quicker to express simple equations in plaintext that it is to write the equivalent LaTeX? Easylatex is a preprocessor to make writing LaTeX much quicker. Project activity mode (http://bayleshanks.com/pamv1 ): sporadic
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    fig2ps is a perl script designed to convert Xfig files to postscript or PDF files, processing text using LaTeX.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    LaTeX2HTML-gui is a graphical user interface for LaTeX2HTML. You can set about 100 parameters of LaTeX2HTML in graphic dialogs instead of typing a config file. LaTeX2HTML offers the descriptions of all parameters without searching the documentation.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    Since Azeri Turkish is written in different alphabets throughout the world, This project aims to convert texts between some mainly used alphabets. specifcally, conversion between arbaic and latin alphabets is intended.
    Downloads: 1 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB