Showing 92 open source projects for "data processing"

View related business solutions
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • Try Google Cloud Risk-Free With $300 in Credit Icon
    Try Google Cloud Risk-Free With $300 in Credit

    No hidden charges. No surprise bills. Cancel anytime.

    Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.
    Start Free
  • 1
    CDC Connectors for Apache Flink

    CDC Connectors for Apache Flink

    CDC Connectors for Apache Flink

    This project provides a set of source connectors for Apache Flink® directly ingesting changes coming from different databases using Change Data Capture(CDC). CDC Connectors for Apache Flink® is a set of source connectors for Apache Flink®, ingesting changes from different databases using change data capture (CDC). CDC Connectors for Apache Flink® integrates Debezium as the engine to capture data changes. So it can fully leverage the ability of Debezium. Supports reading database snapshot and continues to read transaction logs with exactly-once processing even failures happen. ...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 2
    MongoDB PHP Library

    MongoDB PHP Library

    The Official MongoDB PHP library

    ...Built on top of the underlying MongoDB PHP extension, the library handles serialization, connection pooling, and error handling in a way that feels natural in idiomatic PHP. It supports rich query expressions, bulk writes, change streams, transactions, and GridFS, making it suitable for everything from simple content apps to complex data processing services. The project also includes helpers for working with BSON types such as ObjectId, UTC datetime, and decimals, which helps bridge the gap between native PHP types and MongoDB’s storage model.
    Downloads: 9 This Week
    Last Update:
    See Project
  • 3
    ArangoDB-Community/pyArango

    ArangoDB-Community/pyArango

    Python Driver for ArangoDB with built-in validation

    PyArango is a Python driver for ArangoDB, a multi-model NoSQL database. It provides a Pythonic way to interact with ArangoDB, allowing developers to manage collections, execute AQL queries, and integrate ArangoDB's document, graph, and key-value storage models into Python applications.
    Downloads: 7 This Week
    Last Update:
    See Project
  • 4
    An innovative Open Source CEP (Complex Event Processing) engine. It implements the event stream processing as a library embeddable in C++ and Perl. You can think of the Complex Event Processing engine as an in-memory database driven by triggers, or a data-flow machine, or a spreadsheet on steroids (and without the GUI part).
    Downloads: 0 This Week
    Last Update:
    See Project
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build generative AI apps with Vertex AI. Switch between models without switching platforms.
    Start Free
  • 5

    xsd2pgschema

    Relational database replication tool based on XML Schema

    xsd2pgschema is a Java application suite, which converts XML Schema 1.1 (hierarchical data model) to PostgreSQL DDL (relational data model) and supports XML data migration into PostgreSQL based on the XML Schema without defects on information content. It also supports full-text indexing via either Apache Lucene or Sphinx Search utilizing the relational data model. File conversion from XML to CSV, TSV, or JSON is possible as well as mapping XML Schema to JSON Schema. Obtained PostgreSQL...
    Downloads: 5 This Week
    Last Update:
    See Project
  • 6
    CursusDB

    CursusDB

    CursusDB is an open-source distributed in-memory database

    CursusDB is a time-series database built for high-performance analytics and data processing, optimized for handling large volumes of sequential data efficiently.
    Downloads: 8 This Week
    Last Update:
    See Project
  • 7
    LSJ-Server

    LSJ-Server

    LSJ-Server-Terra (Java Multipurpose Sever connected with MariaDB)

    ...LSJ-Server (Terra) is a high-performance, lightweight socket server built with Pure Java and MariaDB. It offers a robust backend for IT maintenance and MMORPG/RTS games without heavy framework overhead. [Features] Pure Java & MariaDB: Fast data processing & persistence. Real-time Monitoring: GUI for live logs and user management. Optimized Networking: Low-latency TCP/IP support. [How to Use] Config: Set MariaDB and Port (default: 10815). Control: Use 'Start' and 'Stop' buttons. Manage: Monitor logs/users and use the bottom command field. https://github.com/s1213123/LSJ-Server/
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    RedisGraph

    RedisGraph

    A graph database as a Redis module

    A high-performance graph database module for Redis that enables fast graph processing and analytics using a query engine based on Cypher.
    Downloads: 8 This Week
    Last Update:
    See Project
  • 9

    GETL

    ETL engine based on Groovy

    P.S. Dear friends. Repository migration to https://github.com/ascrus/getl . You can download jar file from this site or maven. GETL - based package in Groovy, which automates the work of loading and transforming data. His name is an acronym for «Groovy ETL». GETL is a set of libraries of pre-built classes and objects that can be used to solve problems unpacking, transform and load data into programs written in Groovy, or Java, as well as from any software that supports the work with...
    Downloads: 2 This Week
    Last Update:
    See Project
  • Custom VMs From 1 to 96 vCPUs With 99.95% Uptime Icon
    Custom VMs From 1 to 96 vCPUs With 99.95% Uptime

    General-purpose, compute-optimized, or GPU/TPU-accelerated. Built to your exact specs.

    Live migration and automatic failover keep workloads online through maintenance. One free e2-micro VM every month.
    Try Free
  • 10
    Cetus

    Cetus

    Cetus is a high performance middleware that provides routing

    ...Cetus is divided into two versions: read-write separation and sub-library (sub-table is a special form of sub-library). Multi-process lock-free improves operating efficiency. Supports transparent backend connection pooling. Support SQL read-write separation. Support data sub-database. Support distributed transaction processing. Support insert batch operations. Support for conditional distinct operations. Enhanced SQL route parsing and injection.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    FeatureBase

    FeatureBase

    A crazy fast analytical database, built on bitmaps

    FeatureBase is an Open Source, in-memory, MLAP engine providing SQL support, real-time updates, and analytical processing for your growing data. A binary tree index improves the performance & efficiency of analytical queries by reducing I/O operation. Simple or complex, FeatureBase knocks it out in milliseconds. On-the-fly updates and deletes. Operate instantly on your freshest data without the need for preaggregation. Built on bitmaps, FeatureBase offers up to 5-10X reduction in storage footprint and 90% reduction in hardware footprint. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    Sqlite Index Blaster

    Sqlite Index Blaster

    Create huge Sqlite indexes at breakneck speeds

    SQLite Blaster is an advanced SQLite extension that enhances database performance by enabling multi-threading, data compression, and memory optimizations. It is designed for applications that require fast local storage with improved query efficiency.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    Common Resource Grep - crgrep

    Common Resource Grep - crgrep

    Common Resource Grep

    CRGREP searches for matching text in databases, various document formats, archives and other difficult to access resources. A command line tool for name and content text matching in database tables, plain files, MS Office documents, PDF, archives, MP3 audio, image meta-data, scanned documents, maven dependencies and web resources. CRGREP will search resources within resources of any arbitrary combination or depth, so text within a document within a zip archive, and so on. Here you...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 14
    SnappyData

    SnappyData

    Memory optimized analytics database, based on Apache Spark

    ...SnappyData delivers high throughput, low latency, and high concurrency for a unified analytics workload. By fusing an in-memory hybrid database inside Apache Spark, it provides analytic query processing, mutability/transactions, access to virtually all big data sources and stream processing all in one unified cluster. One common use case for SnappyData is to provide analytics at interactive speeds over large volumes of data with minimal or no pre-processing of the dataset. For instance, there is no need to often pre-aggregate/reduce or generate cubes over your large data sets for ad-hoc visual analytics. ...
    Downloads: 7 This Week
    Last Update:
    See Project
  • 15
    eXist-db

    eXist-db

    eXist-db is a feature rich Open Source native XML database

    eXist-db is a native XML database featuring efficient, index-based XQuery processing, extensions for keyword search, XUpdate support, XSLT support, XForms support, REST and tight integration with existing XML development tools. Moved to Github - https://www.github.com/exist-db/exist
    Downloads: 23 This Week
    Last Update:
    See Project
  • 16
    Datatables.AspNet

    Datatables.AspNet

    Microsoft AspNet bindings and automatic parsing for jQuery DataTables

    Formerly known as DataTables.Mvc, this project started with small objectives around 2014, aiming to provide intermediate and experienced developers a tool to avoid the boring process of handling DataTables parameters. More than a year later after a full rewrite, we are now proud to support Asp.net MVC, WebApi, and Asp.Net Core (full .NET Core support). Unit-testing is a priority to avoid breaking your app and every stable release should provide better and wider test cases. Datatables.AspNet...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 17
    Demo Scene

    Demo Scene

    Scripts and samples to support Confluent Demos, Talks, and Blogs

    Demo Scene is a collection of resources and examples provided by Confluent Inc. to demonstrate the capabilities of Apache Kafka and its ecosystem. It includes various demos showcasing real-time data streaming, processing, and integration patterns
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    Heroic

    Heroic

    The Heroic Time Series Database

    Heroic is a scalable time-series database developed by Spotify, designed for real-time analytics and monitoring of large-scale systems.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    ksqlDB

    ksqlDB

    The database purpose-built for stream processing applications

    Build applications that respond immediately to events. Craft materialized views over streams. Receive real-time push updates, or pull current state on demand. Seamlessly leverage your existing Apache Kafka® infrastructure to deploy stream-processing workloads and bring powerful new capabilities to your applications. Use a familiar, lightweight syntax to pack a powerful punch. Capture, process, and serve queries using only SQL. No other languages or services are required. ksqlDB enables you...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    PipelineDB

    PipelineDB

    High-performance time-series aggregation for PostgreSQL

    PipelineDB is a PostgreSQL extension for continuous aggregation and stream processing. It allows users to define continuous queries that automatically process incoming data streams, storing results in materialized views. Designed for real-time analytics, PipelineDB extends PostgreSQL with stream-oriented features while maintaining compatibility with standard SQL and tooling.
    Downloads: 13 This Week
    Last Update:
    See Project
  • 21
    Cosmos DB Spark

    Cosmos DB Spark

    Apache Spark Connector for Azure Cosmos DB

    ...The connector allows you to easily read to and write from Azure Cosmos DB via Apache Spark DataFrames in Python and Scala. It also allows you to easily create a lambda architecture for batch-processing, stream-processing, and a serving layer while being globally replicated and minimizing the latency involved in working with big data.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22

    DataSink

    Take a JDBC ResultSet and stream it in one of the supported formats

    DataSink takes a JDBC ResultSet and streams it in in a format of your choice. You can as well zip the stream and send it over the network, if you want. DataSink currently implements the following table formats: DBF (the xBase file format), XHTML, and genericode. You can use it as an Ant task or directly from Java.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    Mondrian is an OLAP (online analytical processing) engine written in Java. It reads from JDBC data sources, aggregates data in a memory cache, and implements the MDX language and the olap4j and XML/A APIs.
    Leader badge
    Downloads: 48 This Week
    Last Update:
    See Project
  • 24
    AvanceDB

    AvanceDB

    An in-memory database based on the CouchDB REST API

    AvanceDB is a high-performance, in-memory database designed to accelerate SQL-based applications. It uses advanced caching techniques to reduce database latency and improve query execution speed, making it ideal for real-time analytics and transactional workloads.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    SQLMate

    SQLMate

    Rapidly generate a DAO for SQLite

    Complete source code, usage example, & a code-generated test case are included in the .jar file. ( See main.java for the usage / code generation example )
    Downloads: 4 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB