Showing 44 open source projects for "dataflow"

View related business solutions
  • Our Free Plans just got better! | Auth0 by Okta Icon
    Our Free Plans just got better! | Auth0 by Okta

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your secuirty. Auth0 now, thank yourself later.
    Try free now
  • Bright Data - All in One Platform for Proxies and Web Scraping Icon
    Bright Data - All in One Platform for Proxies and Web Scraping

    Say goodbye to blocks, restrictions, and CAPTCHAs

    Bright Data offers the highest quality proxies with automated session management, IP rotation, and advanced web unlocking technology. Enjoy reliable, fast performance with easy integration, a user-friendly dashboard, and enterprise-grade scaling. Powered by ethically-sourced residential IPs for seamless web scraping.
    Get Started
  • 1
    TensorFlow

    TensorFlow

    TensorFlow is an open source library for machine learning

    ... expresses its computations as dataflow graphs, with each node in the graph representing an operation. Nodes take tensors—multidimensional arrays—as input and produce tensors as output. The framework allows for these algorithms to be run in C++ for better performance, while the multiple levels of APIs let the user determine how high or low they wish the level of abstraction to be in the models produced. Tensorflow can also be used for research and production with TensorFlow Extended.
    Downloads: 27 This Week
    Last Update:
    See Project
  • 2
    ElasticJob

    ElasticJob

    Distributed scheduled job framework

    ElasticJob is a distributed scheduling solution consisting of two separate projects, ElasticJob-Lite and ElasticJob-Cloud. ElasticJob-Lite is a lightweight, decentralized solution that provides distributed task sharding services. ElasticJob-Cloud uses Mesos to manage and isolate resources. It uses a unified job API for each project. Developers only need code one time and can deploy at will. Support job sharding and high availability in distributed system. Scale out for throughput and...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 3
    XGBoost

    XGBoost

    Scalable and Flexible Gradient Boosting

    XGBoost is an optimized distributed gradient boosting library, designed to be scalable, flexible, portable and highly efficient. It supports regression, classification, ranking and user defined objectives, and runs on all major operating systems and cloud platforms. XGBoost works by implementing machine learning algorithms under the Gradient Boosting framework. It also offers parallel tree boosting (GBDT, GBRT or GBM) that can quickly and accurately solve many data science problems....
    Downloads: 3 This Week
    Last Update:
    See Project
  • 4
    Tributary

    Tributary

    Streaming reactive and dataflow graphs in Python

    Tributary is a library for constructing dataflow graphs in Python. Unlike many other DAG libraries in Python (airflow, luigi, prefect, dagster, dask, kedro, etc), tributary is not designed with data/etl pipelines or scheduling in mind. Instead, tributary is more similar to libraries like mdf, loman, pyungo, streamz, or pyfunctional, in that it is designed to be used as the implementation for a data model. One such example is the greeks library, which leverages tributary to build data models...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Empower decisions with IBM SPSS Statistics. Icon
    Empower decisions with IBM SPSS Statistics.

    For companies that need a powerful data platform

    IBM SPSS Statistics software is used by a variety of customers to solve industry-specific business issues to drive quality decision-making. Advanced statistical procedures and visualization can provide a robust, user friendly and an integrated platform to understand your data and solve complex business and research problems
    Learn More
  • 5
    Arroyo

    Arroyo

    Distributed stream processing engine in Rust

    Arroyo is a distributed stream processing engine written in Rust, designed to efficiently perform stateful computations on streams of data. Unlike traditional batch processing, streaming engines can operate on both bounded and unbounded sources, emitting results as soon as they are available.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    Hamilton DAGWorks

    Hamilton DAGWorks

    Helps scientists define testable, modular, self-documenting dataflow

    Hamilton is a lightweight Python library for directed acyclic graphs (DAGs) of data transformations. Your DAG is portable; it runs anywhere Python runs, whether it's a script, notebook, Airflow pipeline, FastAPI server, etc. Your DAG is expressive; Hamilton has extensive features to define and modify the execution of a DAG (e.g., data validation, experiment tracking, remote execution). To create a DAG, write regular Python functions that specify their dependencies with their parameters. As...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    WALA

    WALA

    Libraries for Analysis, with frontends for Java, Android, and JS

    The T. J. Watson Libraries for Analysis (WALA) provide static analysis capabilities for Java bytecode and related languages and for JavaScript. The system is licensed under the Eclipse Public License, which has been approved by the OSI (Open Source Initiative) as a fully certified open-source license. The initial WALA infrastructure was independently developed as part of the DOMO research project at the IBM T.J. Watson Research Center. In 2006, IBM donated the software to the community. The...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    Apache Beam

    Apache Beam

    Unified programming model for Batch and Streaming

    Apache Beam is an open source, unified programming model to define both batch and streaming data-parallel processing pipelines, as well as certain language-specific SDKs for constructing pipelines and Runners. These pipelines are executed on one of Beam’s supported distributed processing back-ends, which include Apache Apex, Apache Flink, Apache Spark, and Google Cloud Dataflow. Beam is especially useful for Embarrassingly Parallel data processing tasks, and caters to the different needs...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    Bytewax

    Bytewax

    Python Stream Processing

    ... processing engine that uses a dataflow computational model to provide parallelizable stream processing and event processing capabilities similar to Flink, Spark, and Kafka Streams. You can use Bytewax for a variety of workloads from moving data à la Kafka Connect style all the way to advanced online machine learning workloads. Bytewax is not limited to streaming applications but excels anywhere that data can be distributed at the input and output.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Manage Properties Better For Free Icon
    Manage Properties Better For Free

    For small to mid-sized landlords and property managers

    Innago is a free and easy-to-use property management solution. Whether you have 1 unit or 1000, student housing, or commercial properties, Innago is built for you. Our software is designed to save you time and money, so you can spend more time doing the things that matter most.
    Learn More
  • 10
    OctoSQL

    OctoSQL

    Join, analyse and transform data from multiple databases

    OctoSQL is predominantly a CLI tool which lets you query a plethora of databases and file formats using SQL through a unified interface, even do JOINs between them. (Ever needed to join a JSON file with a PostgreSQL table? OctoSQL can help you with that.) OctoSQL is a query tool that allows you to join, analyse and transform data from multiple databases and file formats using SQL. At the same time it's an easily extensible full-blown dataflow engine, and you can use it to add a SQL interface...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    BMDFM

    BMDFM

    Binary Modular DataFlow Machine (BMDFM)

    Binary Modular DataFlow Machine (BMDFM) is a software package that enables running an application in parallel on shared memory symmetric multiprocessing (SMP) computers using the multiple processors to speed up the execution of single applications. BMDFM automatically identifies and exploits parallelism due to the static and mainly dynamic scheduling of the dataflow instruction sequences derived from the formerly sequential program. The BMDFM dynamic scheduling subsystem performs a symmetric...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    DataKit

    DataKit

    Connect processes into powerful data pipelines

    Connect processes into powerful data pipelines with a simple git-like filesystem interface. DataKit is a tool to orchestrate applications using a Git-like dataflow. It revisits the UNIX pipeline concept, with a modern twist: streams of tree-structured data instead of raw text. DataKit allows you to define complex build pipelines over version-controlled data. DataKit is currently used as the coordination layer for HyperKit, the hypervisor component of Docker for Mac and Windows
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    DSPatch

    DSPatch

    The Refreshingly Simple C++ Dataflow Framework

    Webite: http://flowbasedprogramming.com DSPatch, pronounced "dispatch", is a powerful C++ dataflow framework. DSPatch is not limited to any particular domain or data type, from reactive programming to stream processing, DSPatch's generic, object-oriented API allows you to create virtually any dataflow system imaginable. *See also:* DSPatcher ( https://github.com/MarcusTomlinson/DSPatcher ): A cross-platform graphical tool for building DSPatch circuits. DSPatchables ( https...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    Python Taint

    Python Taint

    Static Analysis Tool for Detecting Security Vulnerabilities in Python

    Static analysis of Python web applications based on theoretical foundations (Control flow graphs, fixed point, dataflow analysis) Detect command injection, SSRF, SQL injection, XSS, directory traveral etc. A lot of customization is possible. For functions from builtins or libraries, e.g. url_for or os.path.join, use the -m option to specify whether or not they return tainted values given tainted inputs, by default this file is used.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    Easy Machine Learning

    Easy Machine Learning

    Easy Machine Learning is a general-purpose dataflow-based system

    ... platform Easy Machine Learning presents a general-purpose dataflow-based system for easing the process of applying machine learning algorithms to real-world tasks. In the system, a learning task is formulated as a directed acyclic graph (DAG) in which each node represents an operation (e.g. a machine learning algorithm), and each edge represents the flow of the data from one node to its descendants.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    Repatch

    Repatch

    Dispatch reducers

    Repatch is just a simplified Redux, that let you create actions more briefly by dispatching reducers directly. Redux has verbose action management. Most of redux projects do not need sctrict action administration. Action types, action creators and the reducer's action handlers are mutually assigned to each other. Repatch's purpose is to create actions briefly. The simplest way to keep the immutable action-controlled dataflow and define actions briefly is by dispatching pure functions...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    Crevaya
    General purpose dataflow programming language.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18

    autoCode4

    Synthesizing embedded control software from formal specifications

    autoCode4 is an engine that synthesizes controllers from formal specifications described under a subset of linear temporal logic (LTL). Importantly, it synthesizes synchronous dataflow controllers (in Lustre or in Ptolemy II form) and maintains requirement-to-code traceability. Such feature is mandated in developing safety-critical systems and are considered essential for specification validation or integrating manual implementation such as legacy code. The LTL specification captures...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    ICon (Input Configurator) is a Java-based system for managing input devices and building advanced interaction techniques using a dataflow model. It contains a runtime engine, a rich library of input devices, several processing modules and a visual editor.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Magmi

    Magmi

    Magento Mass Importer

    This project is an attempt to provide a product importer for magento that can handle several thousands of products at a reasonable pace compared to magento dataflow. it may evolve to a dataflow alternative as new features are implemented. New : Wiki available at : http://wiki.magmi.org
    Downloads: 1 This Week
    Last Update:
    See Project
  • 21
    DSPatcher

    DSPatcher

    Cross-Platform Graphical Tool for DSPatch

    Webite: http://flowbasedprogramming.com DSPatcher is a cross-platform graphical tool for building DSPatch circuits. The DSPatcher project is made up of 2 parts: * QtPatcher - A cross-platform circuit drawing library written in Qt C++. * DSPatcher - A graphical tool based on QtPatcher for building DSPatch circuits. *See also:* DSPatch ( https://github.com/MarcusTomlinson/DSPatch ): A powerful C++ dataflow framework. DSPatchables ( https://github.com/MarcusTomlinson...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22

    Sharpulator

    A multithreaded, test-driven electronics emulator for .net 4.0.

    Sharpulator is a multithreaded, test-driven electronics emulator written in C# 4.0. It's initial focus will be on emulating gaming consoles, with the GameBoy as a starting point. It will be written using modern software development methodologies, with the TPL Dataflow library as a backbone. It will simulate hardware functionality at a low level without sacrificing performance. The end goal for this project is to allow the ability to simulate hardware by simply dropping common components...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    Marsyas (Music Analysis, Retrieval and Synthesis for Audio Signals) is a framework for developing systems for audio processing. It provides an general architecture for connecting audio, soundfiles, signal processing blocks and machine learning. Source code at SF is outdated! Marsyas is now hosted at GitHub: https://github.com/marsyas/marsyas Downloads are now provided at Bintray: https://bintray.com/marsyas
    Downloads: 1 This Week
    Last Update:
    See Project
  • 24
    TransLucid is a multidimensional declarative language. Used for Cartesian programming, TransLucid is a generalization of the Lucid dataflow language.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    Open RVC-CAL Compiler
    The Open RVC-CAL Compiler (Orcc) is a compiler under BSD license for the RVC-CAL dataflow programming language. Please consult the project homepage at http://orcc.sourceforge.net for more information.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • Next