Showing 176 open source projects for "data processing"

View related business solutions
  • AI-generated apps that pass security review Icon
    AI-generated apps that pass security review

    Stop waiting on engineering. Build production-ready internal tools with AI—on your company data, in your cloud.

    Retool lets you generate dashboards, admin panels, and workflows directly on your data. Type something like “Build me a revenue dashboard on my Stripe data” and get a working app with security, permissions, and compliance built in from day one. Whether on our cloud or self-hosted, create the internal software your team needs without compromising enterprise standards or control.
    Try Retool free
  • AI-powered service management for IT and enterprise teams Icon
    AI-powered service management for IT and enterprise teams

    Enterprise-grade ITSM, for every business

    Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity. Maximize operational efficiency with refreshingly simple, AI-powered Freshservice.
    Try it Free
  • 1
    A java-based parser for parsing/grabbing web sites and other text or XML documents, based on a nondeterministic parser language, creating XML output. Also contains a few utility classes for HTML, CSV and text parsing, and additional character sets.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    Simple Java delimited and fixed width file parser. Handles CSV, Excel CSV, Tab, Pipe delimiters, just to name a few. Maps column positions in the file to user friendly names via XML. See "FlatPack Feature List" under News for complete feature list.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 3
    The DITA Open Toolkit is an implementation of the OASIS DITA XML Specification. The Toolkit transforms DITA content into many deliverable formats. See https://www.dita-ot.org/ for documentation and links to downloads. The source code and issue trackers have been moved to https://github.com/dita-ot/dita-ot
    Downloads: 4 This Week
    Last Update:
    See Project
  • 4

    python-textgrid

    Python module that handles ASCII-style text-based tables.

    This module contains only a single new-style class, that is called textgrid. It's a string-based solution that allows you to present data converted to strings in ASCII-Style tables. Actually, the class doesn't support different dimensions for each row and column. Textgrid is an ASCII-Style table, easyly writable to a text file. It supports various operations, like writing in its cells, adding or deleting rows and columns, changing the measures of cells, and more. Textgrids allow to simply...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build generative AI apps with Vertex AI. Switch between models without switching platforms.
    Start Free
  • 5
    writeup
    Programming language for converting source documents into HTML or XML. Writeup is a combination of a markup language (similar to markdown) and a macro pre-processing language that enables a formal production system to be set up for documents.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 6

    Detexter

    Detexter is an app designed to extract text from PDF files.

    Detexter lets you extract text from multiple PDF files. Detexter uses the PDFBox library for its text extraction.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    Consilium Sentence Suggestions Tools

    Consilium Sentence Suggestions Tools

    Consilium – User Defined sentence Suggestion Tool.

    There are many tools available in market which will provide spell correction or grammer correction while making documents, but very few tools are available which are providing sentence completion according to previously entered text. But this all are providing sentence complition suggestion for sentences which are oftenly or regularly used by all people in same manner. But in reality style of writing changes person to person. While our aim is to provide a sentence suggestion tool which...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8

    abnTeX

    abnTeX está em novo endereço: http://www.abntex.net.br

    ATENÇÃO: abnTeX está em novo endereço: http://www.abntex.net.br abnTeX is a set of LaTeX macros that follow the rules of ABNT (Brazilian Association of Technical Standards). ### abnTeX é um conjunto de macros LaTeX que segue as regras da ABNT (Associação Brasileira de Normas Técnicas). O projeto foi totalmente reconstruído baseado nas novas regras da ABNT e utilizando uma nova técnica para produção da classe, mais informações em: Portal do projeto: http://www.abntex.net.br Grupo de...
    Leader badge
    Downloads: 141 This Week
    Last Update:
    See Project
  • 9
    TextBlob

    TextBlob

    TextBlob is a Python library for processing textual data

    Simple, Pythonic, text processing, Sentiment analysis, part-of-speech tagging, noun phrase extraction, translation, and more. It provides a simple API for diving into common natural language processing (NLP) tasks such as part-of-speech tagging, noun phrase extraction, sentiment analysis, classification, translation, and more. TextBlob stands on the giant shoulders of NLTK and pattern, and plays nicely with both. Supports word inflection (pluralization and singularization) and lemmatization,...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Forever Free Full-Stack Observability | Grafana Cloud Icon
    Forever Free Full-Stack Observability | Grafana Cloud

    Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

    Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
    Create free account
  • 10
    jPod is a rich PDF manipulation and rendering framework. A complete rendering library based on jPod is available here at "jPodRenderer". To see jPod & jPodRenderer at work, have a look at www.cabaret-solutions.com
    Downloads: 1 This Week
    Last Update:
    See Project
  • 11
    A python script that uses wxwidgets. View or edit delimited data.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    A utility to read/write from: pipes, sockets, standard handles, files, block devices, NT Native API objects, file mapping/section objects, comm ports, etc. Supports various text/data conversions, multiple inputs/outputs, and various IO types/options.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    A simple text process library, aims to assist parsing all kinds of text including plain text, XML, HTML, etc., which means it can be used as a simple XML parser or a HTML parser.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    JODReports is a solution for generating dynamic documents and reports in Java based on the OpenDocument format (ODF). Templates can be easily composed with a word processor such as OpenOffice.org Writer. Data sources include POJOs and XML.
    Downloads: 14 This Week
    Last Update:
    See Project
  • 15
    hext

    hext

    Extensible binary and text editor.

    Heavily extensible (via plugins) editor of text and binary data from files and memory.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16

    KefirBB

    The Java text processor.

    Project was moved to https://github.com/kefirfromperm/kefirbb
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    The DocBook Publishing Utilities tools, which make creation and publishing of DocBook easier. The tools are: Maven plug-in to Transform HTML into XML (use after docbkx); Eclipse DocBook table editor; Eclipse wizards for initial DocBook files.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    NRtfTree library is a set of classes written entirely in C# which may be used to manage (read and write) RTF documents in your own applications. A java port of the library can be found in http://www.sgoliver.net/blog/?page_id=92
    Downloads: 1 This Week
    Last Update:
    See Project
  • 19
    XSL Transformation (XSLT) stylesheets. They are used to convert documents from Adobe's XML Data Package (XDP) XML vocabulary into documents in the W3C's XSL Formatting Objects (XSL-FO) vocabulary.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    gPDFText is a text editor for GTK+ that opens PDF documents for ebook readers, converts the text contents into plain ASCII text, restores the original paragraphs and removes unwanted line breaks to allow easier zooming on the reader.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    DocFrac is a document converter that can convert between RTF, HTML and ASCII text. This includes RTF to HTML and HTML to RTF. Supports text formatting (e.g. bold); tables; and most European languages. Available for Windows; Linux; ActiveX and DLL.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 22

    EncTool

    Command line tool to detect and convert files encoding.

    Command line tool to detect and convert files encoding. Works with files or directories. Can be used to add or remove UTF-8 BOM. Multi-platform. EncTool requires Java 1.5 or highter.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23

    majix

    RTF to XML Converter

    With MajiX you can automatically transform RTF files (Microsoft Word files) in XML. MajiX is Java compliant. You can convert Headings, lists (numbered or not), tables, bold, italics, underline and some more.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 24
    FigTeX manages images and their easy inclusion in LaTeX documents. Similar to BibTex, the image information is stored in an external file and is imported into the document as needed. It comes with a comfortable GUI for managing the image library.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    DPRK pull is a script that pulls the English language North Korean news articles from the KCNA website and puts them into one file for reading by a Text to Speech program.
    Downloads: 1 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB