Showing 274 open source projects for "data processing"

View related business solutions
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build generative AI apps with Vertex AI. Switch between models without switching platforms.
    Start Free
  • Try Google Cloud Risk-Free With $300 in Credit Icon
    Try Google Cloud Risk-Free With $300 in Credit

    No hidden charges. No surprise bills. Cancel anytime.

    Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.
    Start Free
  • 1
    Web based cataloging and dedupe application. Highly optimized for processing journal articles. Reads MarcXML and dedupes records using the field 773 combined with a fuzzy search on the title. Written for bibnet.org
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    Amadeus

    Amadeus

    Harmonious distributed data analysis in Rust

    Amadeus is a high-performance, distributed data processing framework written in Rust, designed to offer an ergonomic and safe alternative to tools like Apache Spark. It provides both streaming and batch capabilities, allowing users to work with real-time and historical data at scale. Thanks to Rust’s memory safety and zero-cost abstractions, Amadeus delivers performance gains while reducing the complexity and bugs common in large-scale data pipelines. ...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 3

    Zarja

    Agent-based simulation toolkit

    It is an open source scientific library that is focused on multi-agent simulations on graphs. It prepares basic toolkit for agents, models, timers, graphs, nodes and edges. It is also accompanied with tools for data processing.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    MLDataUtils.jl

    MLDataUtils.jl

    Utility package for generating, loading, and processing ML datasets

    This package is designed to be the end-user facing front-end to all the data related functionality that is spread out across the JuliaML ecosystem. Most of the following sub-categories are covered by a single back-end package that is specialized on that specific problem. Consequently, if one of the following topics is of special interest to you, make sure to check out the corresponding documentation of that package.
    Downloads: 6 This Week
    Last Update:
    See Project
  • Forever Free Full-Stack Observability | Grafana Cloud Icon
    Forever Free Full-Stack Observability | Grafana Cloud

    Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

    Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
    Create free account
  • 5
    FLOGO

    FLOGO

    Simplify building efficient & modern serverless functions and apps

    Project Flogo is an ultra-light, Go-based open source ecosystem for building event-driven apps. Event-driven, you say? Yup, the notion of triggers and actions are leveraged to process incoming events. An action, a common interface, exposes key capabilities such as application integration, stream processing, etc. All capabilities within the Flogo Ecosystem have a few things in common, they all process events (in a manner suitable for the specific purpose) and they all implement the action...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 6
    spatial-framework-for-hadoop

    spatial-framework-for-hadoop

    The Spatial Framework for Hadoop allows developers

    The Spatial Framework for Hadoop allows developers and data scientists to use the Hadoop data processing system for spatial data analysis. For tools, samples, and tutorials that use this framework, head over to GIS Tools for Hadoop. At the root level of this repository, you can build a single jar with everything in the framework using Apache Ant. Alternatively, you can build a jar at the root level of each framework component.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    SZT-bigdata

    SZT-bigdata

    SZT‑bigdata is an open source project

    SZT‑bigdata is an open-source project analyzing real Shenzhen metro (subway) card usage data using big‑data frameworks like Spark, Hadoop, Hive, Kafka, Flink, ClickHouse, HBase, and Elasticsearch. Aimed at exploring transit passenger flow patterns and system optimization using a variety of Scala-based technologies.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 8
    Luminance HDR

    Luminance HDR

    Complete solution for HDR photography

    Luminance HDR is a complete suite for HDR imaging workflow. It provides a wide range of functionalities, during both the fusion stage and the tonemapping stage. Its graphical user interface, based on Qt5, runs on a variety of platforms, such as Microsoft Windows, Mac OS X 10.9 and later and several Unix flavors (Linux, FreeBSD and others). Input images can be supplied in multiple formats, from JPEG to RAW files. In the same way, output can be saved in many different formats as well, from...
    Leader badge
    Downloads: 262 This Week
    Last Update:
    See Project
  • 9
    ArrayFire.jl

    ArrayFire.jl

    Julia wrapper for the ArrayFire library

    ArrayFire is a library for GPU and accelerated computing. ArrayFire.jl wraps the ArrayFire library for Julia, and provides a Julia interface. Install ArrayFire library: either download a binary from the official site, or you can build from source.
    Downloads: 6 This Week
    Last Update:
    See Project
  • Enterprise-grade ITSM, for every business Icon
    Enterprise-grade ITSM, for every business

    Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity.

    Freshservice is an intuitive, AI-powered platform that helps IT, operations, and business teams deliver exceptional service without the usual complexity. Automate repetitive tasks, resolve issues faster, and provide seamless support across the organization. From managing incidents and assets to driving smarter decisions, Freshservice makes it easy to stay efficient and scale with confidence.
    Try it Free
  • 10
    geometry-api-java

    geometry-api-java

    The Esri Geometry API for Java enables developers to write apps

    The Esri Geometry API for Java can be used to enable spatial data processing in 3rd-party data-processing solutions. Developers of custom MapReduce-based applications for Hadoop can use this API for spatial processing of data in the Hadoop system. The API is also used by the Hive UDF’s and could be used by developers building geometry functions for 3rd-party applications such as Cassandra, HBase, Storm and many other Java-based “big data” applications.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    Forecasting Best Practices

    Forecasting Best Practices

    Time Series Forecasting Best Practices & Examples

    ...Rather than creating implementations from scratch, we draw from existing state-of-the-art libraries and build additional utilities around processing and featuring the data, optimizing and evaluating models, and scaling up to the cloud. The examples and best practices are provided as Python Jupyter notebooks and R markdown files and a library of utility functions.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    Data Science at the Command Line

    Data Science at the Command Line

    Data science at the command line

    ...To get you started, author Jeroen Janssens provides a Docker image packed with over 100 Unix power tools, useful whether you work with Windows, macOS, or Linux. You’ll quickly discover why the command line is an agile, scalable, and extensible technology. Even if you’re comfortable processing data with Python or R, you’ll learn how to greatly improve your data science workflow by leveraging the command line’s power.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13

    Spectral Python

    A python module for hyperspectral image processing

    Spectral Python (SPy) is a python package for reading, viewing, manipulating, and classifying hyperspectral image (HSI) data. SPy includes functions for clustering, dimensionality reduction, supervised classification, and more.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 14
    SCiDA Pro

    SCiDA Pro

    Program for solar cell production data analysis

    The purpose of the SCiDA Pro program is to help with processing solar cell production data. It has the following features: - Able to handle large data sets in a fast way (e.g. plotting 100k cell data takes a few seconds) - Easy data filtering - Easy generation of a data summary report - Extensive data plotting features - Cross-platform (Windows/Linux/MacOS) - Supports multiple languages
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    ksqlDB

    ksqlDB

    The database purpose-built for stream processing applications

    Build applications that respond immediately to events. Craft materialized views over streams. Receive real-time push updates, or pull current state on demand. Seamlessly leverage your existing Apache Kafka® infrastructure to deploy stream-processing workloads and bring powerful new capabilities to your applications. Use a familiar, lightweight syntax to pack a powerful punch. Capture, process, and serve queries using only SQL. No other languages or services are required. ksqlDB enables you...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    DSPatch

    DSPatch

    The Refreshingly Simple C++ Dataflow Framework

    Webite: http://flowbasedprogramming.com DSPatch, pronounced "dispatch", is a powerful C++ dataflow framework. DSPatch is not limited to any particular domain or data type, from reactive programming to stream processing, DSPatch's generic, object-oriented API allows you to create virtually any dataflow system imaginable. *See also:* DSPatcher ( https://github.com/MarcusTomlinson/DSPatcher ): A cross-platform graphical tool for building DSPatch circuits. DSPatchables ( https://github.com/MarcusTomlinson/DSPatchables ): A DSPatch component repository.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    JuliaFEM.jl

    JuliaFEM.jl

    The JuliaFEM software library is a framework

    The JuliaFEM software library is a framework that allows for the distributed processing of large Finite Element Models across clusters of computers using simple programming models. It is designed to scale up from single servers to thousands of machines, each offering local computation and storage. The JuliaFEM software library is a framework that allows for the distributed processing of large Finite Element Models across clusters of computers using simple programming models. It is designed...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 18
    Inviwo

    Inviwo

    Interactive Visualization Workshop

    Inviwo is a modern, open-source visualization framework designed for interactive visual data analysis and scientific computing. Built in C++ with a modular, extensible architecture, Inviwo combines a visual editor (for creating data pipelines) with a powerful runtime engine that supports real-time rendering, interaction, and GPU-accelerated processing. It’s widely used in scientific domains for building and sharing visualizations of complex data such as medical imaging, simulations, and machine learning models. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 19
    nonechucks

    nonechucks

    Deal with bad samples in your dataset dynamically

    ...Or what if your dataset is a folder full of scanned PDFs that you have to OCRize, and then run a language detector on the resulting text, because you want only the ones that are in English? Or maybe you have an AlternateIndexSampler, and you want to be able to move to dataset[6] after dataset[4] fails while attempting to load! PyTorch's data processing module expects you to rid your dataset of any unwanted or invalid samples before you feed them into its pipeline, and provides no easy way to define a "fallback policy" in case such samples are encountered during dataset iteration.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20

    OpenFace

    A state-of-the-art facial behavior analysis toolkit

    OpenFace is an advanced facial behavior analysis toolkit intended for computer vision and machine learning researchers, those in the affective computing community, and those who are simply interested in creating interactive applications based on facial behavior analysis. The OpenFace toolkit is capable of performing several complex facial analysis tasks, including facial landmark detection, eye-gaze estimation, head pose estimation and facial action unit recognition. OpenFace is able to...
    Downloads: 28 This Week
    Last Update:
    See Project
  • 21
    BDS

    BDS

    Blockchain data parsing and persisting results

    JD Cloud Blockchain Data Service (BDS) is a real-time data aggregating, analyzing, and visualization service for chain-like unstructured data from all kinds of 3rd party Blockchains. Splitter is the key module of Blockchain Data Service (BDS) and provides data analysis capability. Splitter is responsible for consuming blockchain data from message queue (kafka) and inserting data into persistent data storage services (relational database, data warehouse, etc.) for further processing. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22

    TSA_CRAFT

    Automatic command line tool for TSA data analysis

    DSF is a high-throughput platform of TSA assay to screen various conditions that affect protein stability. To facilitate TSA data analysis, we developed an automatic tool "TSA-CRAFT". TSA-CRAFT was developed by integrating PERL script and Gnuplot. PERL scripts manage the entire workflow and data processing procedures of TSA data analysis. On the other hand, Gnuplot takes over the curve fitting and result presentation works. All analysed results are coded in an html file that can be easily displayed by a web browser.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 23

    An introduction to Data Analysis in R

    A guide for learning the basic tools on data anaylisis with R

    An Introduction to Data Analysis in R [Book] A guide for learning the basic tools on data anaylisis: process, visualize and learn from your data using R programming. This repository holds the necessary data sets for the book "An introduction to Data Analysis in R", to be published by Springer series Use R!. The book can be purchased in XXX. The book is meant as an introductory guide to manipulate data sets in the Big Data paradigm. One of the main goals of this book is to take the...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 24
    AQUATONE

    AQUATONE

    A tool for domain flyovers

    Aquatone is a tool for visual inspection of websites across a large amount of hosts and is convenient for quickly gaining an overview of HTTP-based attack surface. Aquatone is designed to be as easy to use as possible and to integrate with your existing toolset with no or minimal glue. Aquatone is started by piping output of a command into the tool. It doesn't really care how the piped data looks as URLs, domains, and IP addresses will be extracted with regular expression pattern matching....
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    LaueTools

    LaueTools

    open source python packages for X-ray MicroLaue Diffraction analysis

    LaueTools is an open-source project for white beam Laue x-ray microdiffraction data analysis including tools in image processing, peaks searching & indexing, crystal structure solving (orientation & strain) and data & grain mapping visualisation. Python 3 Code and new features are now at: https://gitlab.esrf.fr/micha/lauetools
    Downloads: 1 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB