Showing 136 open source projects for "batch text processing"

View related business solutions
  • Cut Your Data Warehouse Bill by 54% Icon
    Cut Your Data Warehouse Bill by 54%

    Migrate from Snowflake, Redshift, or Databricks with free tools. No SQL rewrites.

    BigQuery delivers 54% lower TCO with serverless scale and flexible pricing. Free migration tools handle the SQL translation automatically.
    Try Free
  • Host LLMs in Production With On-Demand GPUs Icon
    Host LLMs in Production With On-Demand GPUs

    NVIDIA L4 GPUs. 5-second cold starts. Scale to zero when idle.

    Deploy your model, get an endpoint, pay only for compute time. No GPU provisioning or infrastructure management required.
    Try Free
  • 1
    PDF Clown

    PDF Clown

    General-Purpose PDF Library for Java and .NET

    PDF Clown is a general-purpose Java and .NET library for manipulating PDF files through multiple abstraction layers, rigorously adhering to PDF 1.7 specification (ISO 32000-1). This project aims to provide a universal access to PDF files (creation, reading, editing, rendering...) through an accurate and elegant object-oriented API. * Features: http://pdfclown.org/overview/features/ * Overview: http://pdfclown.org/overview/architecture/ * Website: http://pdfclown.org/ * Blog:...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 2
    Jericho HTML Parser is a java library allowing analysis and manipulation of parts of an HTML document, including server-side tags, while reproducing verbatim any unrecognised or invalid HTML.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 3

    sitecheck

    Modular web site spider for web developers.

    More than just a link checker, sitecheck is a website spider (also known as a crawler) which can assist with SEO by testing an entire site plus both inbound links from search engines and outbound links to other sites for the following issues: looping redirects (HTTP 301/302), broken links (HTTP 404), server errors (HTTP 500), spelling mistakes, low readability scores (using the Flesch Reading Ease test), missing/empty/duplicate meta tags, duplicate content, slow page speed, W3C validation...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4

    Darkbot

    The IRC's Talking Robot

    [ Please read https://sourceforge.net/p/darkbot/news/2014/01/darkbots-revitalization/ ] Darkbot is a portable IRC chat robot written in the C language that can be taught responses to user inquiries, and even have conversations with them. Darkbot was originally created by Jason Hamilton as an aid for help channels on Intenet Relay Chat.
    Leader badge
    Downloads: 4 This Week
    Last Update:
    See Project
  • Try Google Cloud Risk-Free With $300 in Credit Icon
    Try Google Cloud Risk-Free With $300 in Credit

    No hidden charges. No surprise bills. Cancel anytime.

    Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.
    Start Free
  • 5
    FCKeditor

    FCKeditor

    FCKeditor (retired)

    FCKeditor is the previous version of CKEditor and has been discontinued after version 2. The new CKEditor is redesigned from the ground up, offering more WYSIWYG text editing features, enhanced security and better integration. Don’t force yourself with retro FCKeditor. Switch to the new, cool CKEditor at ckeditor.com
    Downloads: 14 This Week
    Last Update:
    See Project
  • 6
    SEO & SEM - Marketing Text Writer

    SEO & SEM - Marketing Text Writer

    Open Source SEO & SEM Text Creation Tools for free Article Writer

    Open Source Tool for Search Engine Optimization (SEO & SEM) used for automatic content processing. These SEO Content Genrators and Article Writers based on Text Writer: https://www.artikelschreiber.com/en/ https://www.unaique.net/en/ https://www.unaique.com/ https://www.artikelschreiben.com/ https://www.buzzerstar.com/ https://googleduplicatecontentsolver.sourceforge.io/ https://inkassos.github.io/inkasso/ https://www.artikelschreiber.com/opensource/ https://www.sebastianenger.com/ https://www.artikelschreiber.com/marketing/review/ https://muckrack.com/markus-muller https://linktr.ee/textgenerator Code Contains: - Perl Source code, language databases and more
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    A tool to help finding the corresponding interwikis the when translating a wikipedia article from a given language to another one.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    JODReports is a solution for generating dynamic documents and reports in Java based on the OpenDocument format (ODF). Templates can be easily composed with a word processor such as OpenOffice.org Writer. Data sources include POJOs and XML.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 9
    Xelem is a compact Java-library to read and write Excel files of type SpreadsheetML. It can produce sophisticated, intricate and complex spreadsheets from within any Java program. And, since the release of xelem.2.0, it can read xml-spreadsheets.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Go From Idea to Deployed AI App Fast Icon
    Go From Idea to Deployed AI App Fast

    One platform to build, fine-tune, and deploy. No MLOps team required.

    Access Gemini 3 and 200+ models. Build chatbots, agents, or custom models with built-in monitoring and scaling.
    Try Free
  • 10
    WYSIWYG .NET

    WYSIWYG .NET

    WYSIWYG html editor for .NET (C#, VB.NET)

    WYSIWYG .NET editor is an HTML editor that attempt to display the web page as it will show on the browser. It's a visual editor, and you don’t manipulate the code directly.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11

    baco

    Script language with interpreter, for copy/backup operations

    This program reads the script file where the you have declared a set of procedures about copy / backup operations, and executes the procedures in batch or interactive mode. Read baco-readme.txt for a detailed explaination. baco works with the Python3 interpreter.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12

    SuggestSE

    Autosuggest for jQuery

    SuggestSE is text auto-suggest replacement instead using default browser suggestion. Its work under jQuery library. Data source can be taken from server, local array, string, even from element HTML. Still customizable.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    OmniHelp is a cross-platform, browser-independent, tri-pane help viewer built in pure JavaScript and CSS with HTML 4. Some functions (such as help embedding) may in the future be in Java, C, or C++; CSH is fully supported. All code is under the LGPL.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    A general purpose framework for quickly build a working website in a typical PHP/MySQL environment. It is targeted at developers, experienced web administrators and anyone unafraid of hacking and using a text editor.
    Downloads: 8 This Week
    Last Update:
    See Project
  • 15
    This project aims to build a suite of Natural Language Processing tools. Modules will include corpus indexing and access tools, a part-of-speech tagger, tokenisers, text classification software, etc.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    Plugins for Firefox and Google Chrome that automates usage of „Typograf“ service hosted at http://www.artlebedev.ru/tools/typograf/. Plugin takes text from any text area in Firefox and processes it according to typographic rules (e.g. inserts typ
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    This project page is not used go to: http://www.tinymce.com
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    phpCollect aims to quickly and easily graph, record, and display statistical host data.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    A neat and powerful text transformation and processing package for Java applications. Flexible enough that you can use only the bits you need.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    A stand-alone editor using Mediawiki markup language to generate HTML code. You can create and preview pages written using Mediawiki markup (i.e. Wikipedia pages) while off-line.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    The xslt2 script semAuth (semantic authoring) translates a freemind mindmap into an xhtml website and an RDF ontology
    Downloads: 1 This Week
    Last Update:
    See Project
  • 22
    creole/c is a Wiki Creole parser and a HTML converter. It implements Wiki Creole 1.0 and almost all of its additions. The parser is written in C++ and has a simple event driven plain C API. The converter is a stand-alone console application.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    BBLib - The fast and easy library written on PHP. The library makes fast parsing and replacement BB-Tags. It is really easy to customise and add own tags. Supports simple-closing and autolinks. Smiles cooming soon...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    nanoWIME is a simple, flexible, easy-to-use javascript based WikiMarkup editor.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    PDML is an informal markup language written in PHP that is similar to HTML. It allows for the creation of complex PDF documents and can also be used in conjunction with PHP, to define templates which can generate dynamic PDF documents.
    Downloads: 0 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB