Showing 316 open source projects for "data processing"

View related business solutions
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • Earn up to 16% annual interest with Nexo. Icon
    Earn up to 16% annual interest with Nexo.

    More flexibility. More control.

    Generate interest, access liquidity without selling, and execute trades seamlessly. All in one platform. Geographic restrictions, eligibility, and terms apply.
    Get started with Nexo.
  • 1
    SICP JavaScript Edition

    SICP JavaScript Edition

    XML sources of SICP and SICP JS, and support

    This repository contains processing scripts and sources for the textbook SICP JS: Structure and Interpretation of Computer Programs, JavaScript Edition (SICP JS). See Preface for background.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2

    HTML parser in Delphi

    A Delphi class with functions to read and dissect a HTML file

    THTMLdom is a (Delphi) class with functions to read a HTML source file and dissect it into a tree of THTMLelement. The attributes of the HTML tags are stored in the elements. Functions are provided to select elements on the basis of the attribute values or tag names. The structure of the tree can be shown and it can be rendered as plain text. The source is plain Delphi pascal, requiring a version that supports Tdictionary. There is no dependency on 3rd party units. The file to be parsed...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 3
    CSSBox

    CSSBox

    Pure Java HTML / CSS rendering engine

    CSSBox is an (X)HTML/CSS rendering engine written in pure Java. Its primary purpose is to provide a complete information about the rendered page suitable for further processing. However, it also allows displaying the rendered document.
    Downloads: 14 This Week
    Last Update:
    See Project
  • 4
    jtc

    jtc

    JSON processing utility

    jtc stand for: JSON transformational chains (used to be JSON test console). jtc offers a powerful way to select one or multiple elements from a source JSON and apply various actions on the selected elements at once (wrap selected elements into a new JSON, filter in/out, sort elements, update elements, insert new elements, remove, copy, move, compare, transform, swap around and many other operations).
    Downloads: 0 This Week
    Last Update:
    See Project
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build generative AI apps with Vertex AI. Switch between models without switching platforms.
    Start Free
  • 5
    Flat file extractor can be used for reading and parsing different flat file structures and printing them in different formats. ffe is a command line tool developed in GNU/Linux environment and it is distributed under GPL. Project moved to https://github.com/igitur/ffe
    Downloads: 2 This Week
    Last Update:
    See Project
  • 6
    FREE CSV & Text (TXT) File Splitter

    FREE CSV & Text (TXT) File Splitter

    Ability to work with large csv files and cut them down to run in Excel

    This CSV and TXT file splitter firstly allows you to work with large data files. We have tested this on the largest CSV file we could find. It had over 25.6 Million rows. Microsoft Excel can only load 1 Million rows. So the challenege was on! Splitting a file in this case by 990,000 rows made sure that it was more than capable to load into excel. Splitting 25.6Million rows took aroud 50 minutes. Has been tested reading 25Million rows which took around 15 seconds to read. Its...
    Downloads: 5 This Week
    Last Update:
    See Project
  • 7
    JavaSeis
    JavaSeis.org: Java based software for processing and storing petroleum industry seismic data.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 8
    disser

    disser

    LaTeX classes and templates for typesetting dissertations in Russian

    LaTeX classes, Makefiles and document templates for typesetting dissertations in Russian
    Downloads: 4 This Week
    Last Update:
    See Project
  • 9
    LaTeX Reference Card Creator

    LaTeX Reference Card Creator

    A Makefile based build system for creating LaTeX reference cards

    LaTeX Reference Card Creator is a Makefile based build system for creating reference cards. LaTeX Reference Card Creator compiles content into PDF, DjVu, TEX DVI, HTML and PostScript output formats. A three column reference card will be created. Features include batch image format conversions, spell checking, broken link checking, automatic backups and .zip and .tar.gz distribution building. LaTeX Reference Card Creator provides many LaTeX examples which can be used to make a reference card.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Custom VMs From 1 to 96 vCPUs With 99.95% Uptime Icon
    Custom VMs From 1 to 96 vCPUs With 99.95% Uptime

    General-purpose, compute-optimized, or GPU/TPU-accelerated. Built to your exact specs.

    Live migration and automatic failover keep workloads online through maintenance. One free e2-micro VM every month.
    Try Free
  • 10
    XSLT syntax highlighting

    XSLT syntax highlighting

    Java based XSLT Processor extension for syntax highlighting

    Please note that project moved to GitHub: https://github.com/xmlark/xslthl This is an implementation of syntax highlighting as an extension module for XSLT processors (Xalan, Saxon), so if you have e.g. article about programming written in DocBook, code examples can be automatically syntax highlighted during the XSLT processing phase.
    Leader badge
    Downloads: 191 This Week
    Last Update:
    See Project
  • 11
    Budou

    Budou

    Budou is an auto organizer tool for beautiful line breaking in CJK

    Budou is a Python library developed by Google to improve web typography for CJK (Chinese, Japanese, Korean) languages by producing semantically meaningful line breaks. Unlike English, CJK scripts lack spaces or hyphenation cues, often resulting in awkward or unreadable text wrapping on web pages. Budou addresses this issue by segmenting sentences into logical lexical chunks and wrapping each chunk in non-breaking HTML <span> tags. These spans can be styled with CSS to ensure smooth, visually...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    Fen2eps is a small console program that converts FEN (Forsyth Edwards Notation) strings to EPS (Encapsulated Postscript) files containing the chess board diagram.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    ModularAdmin

    ModularAdmin

    Free Dashboard Theme Built On Bootstrap 4 | HTML Version

    ModularAdmin is an open source dashboard theme built in a modular way. That makes it easy to scale, modify and maintain. We use SASS as CSS preprocessor language. Main variables are defined in the src/_variables.scss folder. For making life easier we broke down styles into components, and on build we're just merging all .scss files together and processing it to the dist/css/app.css file. There are also different theme variations located in src/_themes/ folder, where you can change the main...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    XSH is a powerfull command-line XML editing tool/programming language in the manner of Unix shell interpreters and line-oriented text editors like ed which can be used either interactively or for batch-mode XML processing.
    Leader badge
    Downloads: 3 This Week
    Last Update:
    See Project
  • 15

    ConcatPDF

    PDF Concatenation Tool

    ConcatPDF is the tool to concatenate PDF files. It can concatenate, extract, encrypt, decrypt, configure PDF files, convert image files to PDF. GUI version and CUI version are both available. iText.NET is iText porting on .NET Framework by J#. This library allows you to generate PDF, (X)HTML, XML, RTF files on Microsoft.NET Framework including ASP.NET.
    Leader badge
    Downloads: 44 This Week
    Last Update:
    See Project
  • 16
    BHM File Format
    A generic hierarchical file format. BHM can be seen as the binary equivalent of XML. It allows flexible hierarchies of any type of data, in a platform-independent way. BHM however trades XML's readability for compactness and speed of processing.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    iText®, a JAVA PDF library

    iText®, a JAVA PDF library

    PDF Library for Developers

    iText is an open-source PDF library available for Java and .NET (C#). iText allows you to effortlessly generate and manipulate standards-compliant PDF documents with a powerful and feature-rich SDK. With iText, you can create archivable and accessible PDFs, split and merge documents, fill and flatten forms, digitally sign documents, and more. iText add-ons enable additional functionality, such as PDF creation from HTML templates, secure redaction, OCR, and much more. The latest...
    Leader badge
    Downloads: 185 This Week
    Last Update:
    See Project
  • 18
    PDF-Shuffler
    PDF-Shuffler is a small python-gtk application, which helps the user to merge or split pdf documents and rotate, crop and rearrange their pages using an interactive and intuitive graphical interface. It is a frontend for python-pyPdf.
    Leader badge
    Downloads: 56 This Week
    Last Update:
    See Project
  • 19
    unfluff

    unfluff

    Automatically extract body content (and other cool stuff) from HTML

    unfluff is a Node.js library designed to automatically extract the main content from an HTML document — stripping away navigation bars, ads, footers and other boilerplate to leave you with the “body content”, metadata (title, author, date) and other useful fields. It’s a tool very much aimed at content-analysis, web scraping, building datasets, or repurposing article text for downstream processing (like machine-learning or summarization). The API is simple: you feed in raw HTML and it...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    pdf-bot

    pdf-bot

    A Node queue API for generating PDFs using headless Chrome

    pdf-bot is a Node.js microservice designed to automate the generation of PDF documents from web pages using headless Chrome. The project provides a queue-based API that allows developers to submit URLs for PDF generation, which are then processed asynchronously by the service. Once a document is generated, the system can notify external applications through webhooks, enabling integration with other backend systems or automation pipelines. The service is particularly useful for generating...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    Create beautiful song books for your church or fellowship using this LaTeX package and related tools.
    Leader badge
    Downloads: 27 This Week
    Last Update:
    See Project
  • 22
    FireTeX: LaTeX Editor and Compiler

    FireTeX: LaTeX Editor and Compiler

    Edit Your files LaTeX and tex

    FireTeX, web based LaTeX editor complete, is a powerful, intuitive and stocked with useful functions for exporting the results in three useful formats. An editor with LaTeX compiler, highlight code, advanced search / replace and filesystem API HTML5. ======== Android app available on Play Store > https://play.google.com/store/apps/details?id=com.ulmdesign.ulmtex ======== Update 30.06.2017 Windows 7 and later and macOS 10.9 and later are supported. == Browser Extensions == Add-on...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    XML - JMS/JSON

    XML - JMS/JSON

    Converter utility between XML text and Javascript object / JSON text

    Convert XML text to Javascript object / JSON text (and vice versa). Can parse: elements, attributes, texts, comments, CData, DOCTYPE, XML declarations, and Processing Instructions. Usually, the output of XML attributes are stored in @attr, _atrr, $attr, or $ in order to avoid conflicting with the name of sub-elements. This library stores them in attributes, but most importantly, you can change this to whatever you like.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 24
    jsondata

    jsondata

    Modular JSON by trees and branches, pointers and patches

    The 'jsondata' package provides for the modular in-memory processing of JSON data by trees, branches, pointers, and patches. The main interface classes are: - JSONData - Core for RFC7159 based data structures. Provides modular data components. - JSONDataSerializer - Core for RFC7159 based data persistence. Provides modular data serialization. - JSONPointer - RFC6901 for addressing by pointer paths.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25

    survol

    RDF-based framework monitoring business systems activity

    A Python agent and a web interface aiming to help the analysis and investigation of a legacy application. A set of machines, processes, databases, programs etc ... all communicating with each other, manipulating your data, and whose software architecture has become, with time, complicated, difficult to understand, and undocumented. Data are aggregated with an RDF inference engine, creating a global vision of the business information processing.
    Downloads: 0 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB