Showing 31 open source projects for "jpk data processing"

View related business solutions
  • Run Any Workload on Compute Engine VMs Icon
    Run Any Workload on Compute Engine VMs

    From dev environments to AI training, choose preset or custom VMs with 1–96 vCPUs and industry-leading 99.95% uptime SLA.

    Compute Engine delivers high-performance virtual machines for web apps, databases, containers, and AI workloads. Choose from general-purpose, compute-optimized, or GPU/TPU-accelerated machine types—or build custom VMs to match your exact specs. With live migration and automatic failover, your workloads stay online. New customers get $300 in free credits.
    Try Compute Engine
  • 99.99% Uptime for MySQL and PostgreSQL on Google Cloud Icon
    99.99% Uptime for MySQL and PostgreSQL on Google Cloud

    Enterprise Plus edition delivers sub-second maintenance downtime and 2x read/write performance. Built for critical apps.

    Cloud SQL Enterprise Plus gives you a 99.99% availability SLA with near-zero downtime maintenance—typically under 10 seconds. Get 2x better read/write performance, intelligent data caching, and 35 days of point-in-time recovery. Supports MySQL, PostgreSQL, and SQL Server with built-in vector search for gen AI apps. New customers get $300 in free credit.
    Try Cloud SQL Free
  • 1
    Logstash

    Logstash

    Centralize, transform and stash your data

    Logstash is a server-side data processing pipeline that dynamically ingests data from numerous sources, transforms it, and ships it to your favorite “stash” regardless of format or complexity. It supports and ingests data of all shapes, sizes and sources, dynamically transforms and prepares this data, and transports it to the output of your choice. Logstash is extensible, with over 200 plugins available to let you create and configure your pipeline how you choose.
    Downloads: 26 This Week
    Last Update:
    See Project
  • 2
    Flink CDC

    Flink CDC

    Flink CDC is a streaming data integration tool

    Apache Flink CDC is a distributed data integration tool that captures data changes in real-time from various databases. It leverages Change Data Capture (CDC) technology to stream data changes into Apache Flink, enabling real-time analytics and data processing. Flink CDC simplifies data pipeline development with its declarative YAML configurations.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    eXist-db

    eXist-db

    eXist Native XML Database and Application Platform

    eXist-db is an open-source, native XML database and application platform that provides a powerful environment for storing, querying, and managing XML documents. It is designed for complex data management needs, offering XQuery, XSLT, and RESTful web services for interacting with structured data.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 4
    KCloud‑Platform‑IoT

    KCloud‑Platform‑IoT

    KCloud-Platform-IoT

    KCloud-Platform-IoT is a comprehensive open-source IoT management platform built with Spring Cloud and Vue.js. It supports device registration, data collection, rule-based processing, and dashboard visualization. Designed for scalability and modularity, the platform is ideal for managing large IoT fleets in industrial or smart city environments.
    Downloads: 1 This Week
    Last Update:
    See Project
  • Deploy Apps in Seconds with Cloud Run Icon
    Deploy Apps in Seconds with Cloud Run

    Host and run your applications without the need to manage infrastructure. Scales up from and down to zero automatically.

    Cloud Run is the fastest way to deploy containerized apps. Push your code in Go, Python, Node.js, Java, or any language and Cloud Run builds and deploys it automatically. Get fast autoscaling, pay only when your code runs, and skip the infrastructure headaches. Two million requests free per month. And new customers get $300 in free credit.
    Try Cloud Run Free
  • 5
    Addax

    Addax

    Addax is a versatile open-source ETL tool

    Addax is a data integration and ETL (Extract, Transform, Load) tool designed for high-performance data migration tasks. It simplifies the process of moving data between different systems and formats.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    PolarDB-X

    PolarDB-X

    PolarDB-X is a cloud native distributed SQL Database

    PolarDB-X is a cloud-native distributed SQL database designed to handle high concurrency, massive storage, and complex querying scenarios. It features a shared-nothing architecture that decouples computing from storage, providing scalability and flexibility for various applications.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    CDC Connectors for Apache Flink

    CDC Connectors for Apache Flink

    CDC Connectors for Apache Flink

    This project provides a set of source connectors for Apache Flink® directly ingesting changes coming from different databases using Change Data Capture(CDC). CDC Connectors for Apache Flink® is a set of source connectors for Apache Flink®, ingesting changes from different databases using change data capture (CDC). CDC Connectors for Apache Flink® integrates Debezium as the engine to capture data changes. So it can fully leverage the ability of Debezium. Supports reading database snapshot and continues to read transaction logs with exactly-once processing even failures happen. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8

    xsd2pgschema

    Relational database replication tool based on XML Schema

    xsd2pgschema is a Java application suite, which converts XML Schema 1.1 (hierarchical data model) to PostgreSQL DDL (relational data model) and supports XML data migration into PostgreSQL based on the XML Schema without defects on information content. It also supports full-text indexing via either Apache Lucene or Sphinx Search utilizing the relational data model. File conversion from XML to CSV, TSV, or JSON is possible as well as mapping XML Schema to JSON Schema. Obtained PostgreSQL...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 9

    GETL

    ETL engine based on Groovy

    P.S. Dear friends. Repository migration to https://github.com/ascrus/getl . You can download jar file from this site or maven. GETL - based package in Groovy, which automates the work of loading and transforming data. His name is an acronym for «Groovy ETL». GETL is a set of libraries of pre-built classes and objects that can be used to solve problems unpacking, transform and load data into programs written in Groovy, or Java, as well as from any software that supports the work with...
    Downloads: 2 This Week
    Last Update:
    See Project
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 10
    Common Resource Grep - crgrep

    Common Resource Grep - crgrep

    Common Resource Grep

    CRGREP searches for matching text in databases, various document formats, archives and other difficult to access resources. A command line tool for name and content text matching in database tables, plain files, MS Office documents, PDF, archives, MP3 audio, image meta-data, scanned documents, maven dependencies and web resources. CRGREP will search resources within resources of any arbitrary combination or depth, so text within a document within a zip archive, and so on. Here you...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 11
    eXist-db

    eXist-db

    eXist-db is a feature rich Open Source native XML database

    eXist-db is a native XML database featuring efficient, index-based XQuery processing, extensions for keyword search, XUpdate support, XSLT support, XForms support, REST and tight integration with existing XML development tools. Moved to Github - https://www.github.com/exist-db/exist
    Downloads: 6 This Week
    Last Update:
    See Project
  • 12
    Heroic

    Heroic

    The Heroic Time Series Database

    Heroic is a scalable time-series database developed by Spotify, designed for real-time analytics and monitoring of large-scale systems.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    ksqlDB

    ksqlDB

    The database purpose-built for stream processing applications

    Build applications that respond immediately to events. Craft materialized views over streams. Receive real-time push updates, or pull current state on demand. Seamlessly leverage your existing Apache Kafka® infrastructure to deploy stream-processing workloads and bring powerful new capabilities to your applications. Use a familiar, lightweight syntax to pack a powerful punch. Capture, process, and serve queries using only SQL. No other languages or services are required. ksqlDB enables you...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14

    DataSink

    Take a JDBC ResultSet and stream it in one of the supported formats

    DataSink takes a JDBC ResultSet and streams it in in a format of your choice. You can as well zip the stream and send it over the network, if you want. DataSink currently implements the following table formats: DBF (the xBase file format), XHTML, and genericode. You can use it as an Ant task or directly from Java.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    Mondrian is an OLAP (online analytical processing) engine written in Java. It reads from JDBC data sources, aggregates data in a memory cache, and implements the MDX language and the olap4j and XML/A APIs.
    Leader badge
    Downloads: 44 This Week
    Last Update:
    See Project
  • 16
    SQLMate

    SQLMate

    Rapidly generate a DAO for SQLite

    Complete source code, usage example, & a code-generated test case are included in the .jar file. ( See main.java for the usage / code generation example )
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    XMLPipeDB is a suite of tools for building relational databases from XML sources with minimal manual processing of the data. While the applicability is general, our motivation was to facilitate the management of biological data from different sources.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    DBvolution

    DBvolution

    Java library to remove Object/Relation Impedance

    Now stored at https://github.com/gregorydgraham/DBvolution Available from Maven Central: nz.co.gregs.dbvolution Makes relational data easy and fast to use with semantic annotations and encapsulated datatypes. Simple declaration of a DB table: @DBTableName("mytable") public class MyTable extends DBRow { @DBColumn("numeric_code") @DBForeignKey(Code.class) private DBNumber numericCode = new DBNumber(); @DBColumn("uid_mytable") @DBPrimaryKey @DBAutoIncrement private DBNumber uidMyTable = new DBNumber(); } Easy Gets: List<MyTable> myRows = myDB.get(new MyTable()); MyTable row = myDB.getDBTable(new MyTable()).getByPrimaryKey(61176904); Natural and Outer Joins and Criteria: CarCo carCo = new CarCo(); carCo.name.isLiterally("GM"); List<DBQueryRow> results = myDB.get(carCo, new Marque()); dbQuery.addOptional(new CarModel()); Easy debugging: query.getSQLForQuery(); Simple processing: db.print(query.getAllRows()); List<Marque> marques = query.getAll
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19

    ProC 3.0

    smart Workflow Engine

    ProC 3.0 is a scientific workflow engine to build, manage and execute workflows (pipelines) in heterogeneous environments, supporting GRID and other means of parallel processing. It includes a data management component (DMC) to transparently access databases for storage of results and automatically adds metadata to track the processing of data products, so that at every time a full processing history is available. The software was developed and used within the ESA Planck satellite mission.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Tockit tries to build a framework for Conceptual Knowledge Processing (CKP) and Formal Concept Analysis (FCA) in Java, using a component-based approach, XML formats and a three-tier architecture.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    This project deals with developing a distributed infrastructure to handle and process sensor data including their provenance data.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    TriggerMan

    TriggerMan

    An Audit Table and Trigger Management Tool for MySQL

    .... • After creating an audit table, it can create triggers on the original table which control how and when data is written to the audit table; for example, Before Update or After Insert. • Apply processing to multiple objects in a single operation.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23

    Large Text File converter

    Java Based Heavy-duty utilitity to process large delimited text files

    TextZilla is a Multithreaded Java utility which can process huge size delimited text files to extract, convert, encode, decode, encrypt/decrypt text data from source and write it in desired output file or files. It provides fully extensible framework based on which Java classes can be created, for example it currently has MD5 conversion capability, based on same design classes for 3DES ,AES or any other Algo can be created. Another strength of this tool is in its configurability, it's...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 24

    DBIO

    OLTP Database I/O Simulation Tool

    DBIO is a tool to simulate Online Transaction Processing Database, which can be used to model production envirment for I/O test. A standard configuration file contained in the software simulates TPCC-Like I/O workload, which is modeled from DB2 Database running TPCC-Like workload. DBIO can also do data consistency check after the I/O workload test.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 25
    MXQuery is a low-footprint implementation of XQuery 1.0, XQuery Update 1.0, XQuery Fulltext 1.0 and XQuery Scripting 1.0 as well as a subset of XQuery 1.1 (windowing, try/catch). It provides extensions to do data stream processing/CEP and SOAP/REST
    Downloads: 1 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • Next
MongoDB Logo MongoDB