Showing 279 open source projects for "apache"

View related business solutions
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • Automate contact and company data extraction Icon
    Automate contact and company data extraction

    Build lead generation pipelines that pull emails, phone numbers, and company details from directories, maps, social platforms. Full API access.

    Generate leads at scale without building or maintaining scrapers. Use 10,000+ ready-made tools that handle authentication, pagination, and anti-bot protection. Pull data from business directories, social profiles, and public sources, then export to your CRM or database via API. Schedule recurring extractions, enrich existing datasets, and integrate with your workflows.
    Explore Apify Store
  • 1
    Amadeus

    Amadeus

    Harmonious distributed data analysis in Rust

    Amadeus is a high-performance, distributed data processing framework written in Rust, designed to offer an ergonomic and safe alternative to tools like Apache Spark. It provides both streaming and batch capabilities, allowing users to work with real-time and historical data at scale. Thanks to Rust’s memory safety and zero-cost abstractions, Amadeus delivers performance gains while reducing the complexity and bugs common in large-scale data pipelines. It emphasizes developer productivity through a fluent, expressive API and makes it easier to build composable and reliable data transformation pipelines without sacrificing speed or safety.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    Kale

    Kale

    Kubeflow’s superfood for Data Scientists

    KALE (Kubeflow Automated pipeLines Engine) is a project that aims at simplifying the Data Science experience of deploying Kubeflow Pipelines workflows. Kubeflow is a great platform for orchestrating complex workflows on top Kubernetes and Kubeflow Pipeline provides the mean to create reusable components that can be executed as part of workflows. The self-service nature of Kubeflow make it extremely appealing for Data Science use, at it provides an easy access to advanced distributed jobs...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 3
    Web based cataloging and dedupe application. Highly optimized for processing journal articles. Reads MarcXML and dedupes records using the field 773 combined with a fuzzy search on the title. Written for bibnet.org
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    Talend Spatial Module (aka Spatial Data Integrator or SDI) is an ETL tool for geospatial. Based on Talend Open Studio, input, output and transform geocomponents are available. IO components read/write GIS formats(eg.PostGIS, GeoRSS). Transformers all
    Downloads: 6 This Week
    Last Update:
    See Project
  • Total Network Visibility for Network Engineers and IT Managers Icon
    Total Network Visibility for Network Engineers and IT Managers

    Network monitoring and troubleshooting is hard. TotalView makes it easy.

    This means every device on your network, and every interface on every device is automatically analyzed for performance, errors, QoS, and configuration.
    Learn More
  • 5
    jnotebook_reader

    jnotebook_reader

    An awesome viewer to browse and render Jupyter Notebooks

    An awesome viewer to browse and render Jupyter Notebooks from local, Amazon S3, Google Cloud Storage, or MinIO. Browse and render Jupyter Notebooks from local, Amazon S3, Google Cloud Storage, or MinIO. Register and access multiple directories(or buckets) at the same time. Show and allow to config ToC(Table of Contents) included in your Jupyter Notebook. Hide/show all codes in your Jupyter Notebook interactively. Generate a permanent link about your Jupyter Notebook. Download your Jupyter...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    spatial-framework-for-hadoop

    spatial-framework-for-hadoop

    The Spatial Framework for Hadoop allows developers

    ...For tools, samples, and tutorials that use this framework, head over to GIS Tools for Hadoop. At the root level of this repository, you can build a single jar with everything in the framework using Apache Ant. Alternatively, you can build a jar at the root level of each framework component. Custom MapReduce jobs that use the Esri Geometry API require that the developer has authored the job, (referencing the com.esri.geometry.* classes), and deployed the job Jar file to the Hadoop system, prior to the ArcGIS user submitting the workflow file.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    Davinci

    Davinci

    Davinci is a DVsaaS (Data Visualization as a Service) Platform

    Davinci is oriented towards product managers, business people, data engineers, data analysts, data scientists, etc. It aims to provide a one-stop data visualization solution, which could be both independently used as a public cloud/private cloud and integrated into third-party systems as plugin. A simple configuration on Davinci UI can meet multiple visualization requirements. It also supports other visualization features like advanced interaction, industry analysis, pattern searching,...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 8
    OpenFrames

    OpenFrames

    Real-time interactive 3D graphics API for scientific simulations

    OpenFrames has moved its primary development repository to GitHub! Everything else will follow. Get it at https://github.com/ravidavi/OpenFrames/wiki OpenFrames is an Application Programming Interface (API) that allows developers to provides the ability to add interactive 3D graphics to any scientific simulation. A simulation developer can use OpenFrames to specify what they want to visualize, without having to know any details of computer graphics programming. OpenFrames is currently...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    Open Source Data Quality and Profiling

    Open Source Data Quality and Profiling

    World's first open source data quality & data preparation project

    ...This project is also known as "Aggregate Profiler" Resful API for this project is getting built as (Beta Version) https://sourceforge.net/projects/restful-api-for-osdq/ apache spark based data quality is getting built at https://sourceforge.net/projects/apache-spark-osdq/
    Downloads: 2 This Week
    Last Update:
    See Project
  • Free and Open Source HR Software Icon
    Free and Open Source HR Software

    OrangeHRM provides a world-class HRIS experience and offers everything you and your team need to be that HR hero you know that you are.

    Give your HR team the tools they need to streamline administrative tasks, support employees, and make informed decisions with the OrangeHRM free and open source HR software.
    Learn More
  • 10
    Amazon SageMaker Examples

    Amazon SageMaker Examples

    Jupyter notebooks that demonstrate how to build models using SageMaker

    Welcome to Amazon SageMaker. This projects highlights example Jupyter notebooks for a variety of machine learning use cases that you can run in SageMaker. If you’re new to SageMaker we recommend starting with more feature-rich SageMaker Studio. It uses the familiar JupyterLab interface and has seamless integration with a variety of deep learning and data science environments and scalable compute resources for training, inference, and other ML operations. Studio offers teams and companies...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    geometry-api-java

    geometry-api-java

    The Esri Geometry API for Java enables developers to write apps

    The Esri Geometry API for Java can be used to enable spatial data processing in 3rd-party data-processing solutions. Developers of custom MapReduce-based applications for Hadoop can use this API for spatial processing of data in the Hadoop system. The API is also used by the Hive UDF’s and could be used by developers building geometry functions for 3rd-party applications such as Cassandra, HBase, Storm and many other Java-based “big data” applications.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    ClimateMachine.jl

    ClimateMachine.jl

    Earth System Model that automatically learns from data

    The ClimateMachine is a software package that models the evolution of the Earth system over weeks to centuries. The ClimateMachine solves three-dimensional partial differential equations for the distributions of water, momentum, energy, and tracers such as carbon in the atmosphere, oceans, and on land. The ClimateMachine will harness a wide range of Earth observations and data generated computationally to predict the evolution of Earth’s climate and features such as droughts, rainfall...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    WhyLogs Java Library

    WhyLogs Java Library

    Profile and monitor your ML data pipeline end-to-end

    This is a Java implementation of WhyLogs, with support for Apache Spark integration for large scale datasets. Understanding the properties of data as it moves through applications is essential to keeping your ML/AI pipeline stable and improving your user experience, whether your pipeline is built for production or experimentation. WhyLogs is an open source statistical logging library that allows data science and ML teams to effortlessly profile ML/AI pipelines and applications, producing log files that can be used for monitoring, alerts, analytics, and error analysis. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    Optimus

    Optimus

    Agile Data Preparation Workflows made easy with Pandas

    Easily write code to clean, transform, explore and visualize data using Python. Process using a simple API, making it easy to use for newcomers. More than 100 functions to handle strings, process dates, urls and emails. Easily plot data from any size. Out-of-box functions to explore and fix data quality. Use the same code to process your data in your laptop or in a remote cluster of GPUs.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 15
    StellarGraph

    StellarGraph

    Machine Learning on Graphs

    StellarGraph is a Python library for machine learning on graphs and networks. The StellarGraph library offers state-of-the-art algorithms for graph machine learning, making it easy to discover patterns and answer questions about graph-structured data. It can solve many machine learning tasks. Graph-structured data represent entities as nodes (or vertices) and relationships between them as edges (or links), and can include data associated with either as attributes. For example, a graph can...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    SageMaker Containers

    SageMaker Containers

    Create SageMaker-compatible Docker containers

    Amazon SageMaker is a fully managed service for data science and machine learning (ML) workflows. You can use Amazon SageMaker to simplify the process of building, training, and deploying ML models. To train a model, you can include your training script and dependencies in a Docker container that runs your training code. A container provides an effectively isolated environment, ensuring a consistent runtime and reliable training process. The SageMaker Training Toolkit can be easily added to...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    Crystalsim -  XRD hkl simulation

    Crystalsim - XRD hkl simulation

    X-ray diffraction (XRD) analysis for hkl simulation of any crystal.

    Crystalsim is a simple freeware program with a neat graphical user interface for X-ray diffraction (XRD) data analysis . It can simulates all possible {hkl} planes data for the selected crystal. Crystallographic Information File (.cif) can also be used. Analyze both powder diffraction and single crystal data . Indexed at International Union of Crystallography (IUCR). Crystalline lattice parameters such as ‘a’, ‘b’, ‘c’ as well as interfacial angles such as alpha, beta,...
    Downloads: 6 This Week
    Last Update:
    See Project
  • 18

    Custom Apache Big data Distribution

    A Custom Apache Distribution including Spark and Hadoop, for Windows.

    This Distribution has been customized to work out of the box. So, just download it, and unzip it. Set the Path variables for bin folders, HADOOP_HOME, SPARK_HOME, and JAVA_HOME. That's it..! use Hadoop and Spark natively on Windows.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    ksqlDB

    ksqlDB

    The database purpose-built for stream processing applications

    Build applications that respond immediately to events. Craft materialized views over streams. Receive real-time push updates, or pull current state on demand. Seamlessly leverage your existing Apache Kafka® infrastructure to deploy stream-processing workloads and bring powerful new capabilities to your applications. Use a familiar, lightweight syntax to pack a powerful punch. Capture, process, and serve queries using only SQL. No other languages or services are required. ksqlDB enables you to build event streaming applications leveraging your familiarity with relational databases. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Semiotic

    Semiotic

    A data visualization framework combining React & D3

    Semiotic is a data visualization framework combining React & D3. It provides three types of frames XYFrame, OrdinalFrame, NetworkFrame, to deploy a wide variety of charts. XY data i.e. line charts and scatterplots. Categorical data i.e. bar charts, violin plots, parallel coordinates. Topological and network data i.e. flow diagrams, network visualization, and hierarchical views. A guide for creating a line chart, timeseries, difference line, and line percents using XYFrame along with hover...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    inMap

    inMap

    Rich layers, better user experience, big data geographic visualization

    inMap is a big data visualization library based on Baidu Map. It focuses on the display of scatter, heat map, grid, and aggregation in the direction of big data. It is committed to making big data visualization easy to use.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    Beigesoft Enterprise Information System

    Beigesoft Enterprise Information System

    JEE based Accounting web-application.

    Beigesoft™ EIS is free accounting software. This is JEE-WEB-based software that works anywhere - on MS Windows, Mac, Linux and Android. This is the first version of the successor to Beige Accounting All In One. The main goal was successfully achieved, it works noticeably faster on weak devices such as a smartphone than its predecessor. It works as: * Standalone A-Jetty web-app for standard Java (Ms Windows, Mac, Nix...). * Standalone A-Jetty web-app for Android (use "Beigesoft...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    BDS

    BDS

    Blockchain data parsing and persisting results

    JD Cloud Blockchain Data Service (BDS) is a real-time data aggregating, analyzing, and visualization service for chain-like unstructured data from all kinds of 3rd party Blockchains. Splitter is the key module of Blockchain Data Service (BDS) and provides data analysis capability. Splitter is responsible for consuming blockchain data from message queue (kafka) and inserting data into persistent data storage services (relational database, data warehouse, etc.) for further processing. Before...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    This is going to be a Java Swing Application that will be used as a quick reference for Java API names. It will feature incremental search and filtering so that searching efficiency is improved. It will be very simple and a good way to statistically analyze java api's.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 25
    Facets

    Facets

    Visualizations for machine learning datasets

    The power of machine learning comes from its ability to learn patterns from large amounts of data. Understanding your data is critical to building a powerful machine learning system. Facets contains two robust visualizations to aid in understanding and analyzing machine learning datasets. Get a sense of the shape of each feature of your dataset using Facets Overview, or explore individual observations using Facets Dive. Explore Facets Overview and Facets Dive on the UCI Census Income...
    Downloads: 0 This Week
    Last Update:
    See Project