Showing 248 open source projects for "apache"

View related business solutions
  • Cloud tools for web scraping and data extraction Icon
    Cloud tools for web scraping and data extraction

    Deploy pre-built tools that crawl websites, extract structured data, and feed your applications. Reliable web data without maintaining scrapers.

    Automate web data collection with cloud tools that handle anti-bot measures, browser rendering, and data transformation out of the box. Extract content from any website, push to vector databases for RAG workflows, or pipe directly into your apps via API. Schedule runs, set up webhooks, and connect to your existing stack. Free tier available, then scale as you need to.
    Explore 10,000+ tools
  • Dun and Bradstreet Connect simplifies the complex burden of data management Icon
    Dun and Bradstreet Connect simplifies the complex burden of data management

    Our self-service data management platform enables your organization to gain a complete and accurate view of your accounts and contacts.

    The amount, speed, and types of data created in today’s world can be overwhelming. With D&B Connect, you can instantly benchmark, enrich, and monitor your data against the Dun & Bradstreet Data Cloud to help ensure your systems of record have trusted data to fuel growth.
    Learn More
  • 1
    Apache Airflow

    Apache Airflow

    Programmatically author, schedule, and monitor workflows

    Apache Airflow is a community-created platform for programmatically authoring, scheduling, and monitoring workflows. It’s much easier to do all these things when workloads are defined as code. They become more versionable, testable, maintainable and collaborative. With Airflow you can author workflows as directed acyclic graphs (DAGs) of tasks.
    Downloads: 17 This Week
    Last Update:
    See Project
  • 2
    Superset

    Superset

    Apache Superset is a data visualization and data exploration platform

    Apache Superset is a modern data exploration and visualization platform. Superset is fast, lightweight, intuitive, and loaded with options that make it easy for users of all skill sets to explore and visualize their data, from simple line charts to highly detailed geospatial charts. Quickly and easily integrate and explore your data, using either our simple no-code viz builder or state-of-the-art SQL IDE.
    Downloads: 16 This Week
    Last Update:
    See Project
  • 3
    tika-python

    tika-python

    Python binding to the Apache Tika™ REST services

    ...This is the only way to run python-tika without internet access. Without this set, the default is to check the tika version and pull latest every time from Apache.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 4
    Caddy

    Caddy

    Powerful, enterprise-ready, open source web server w/ automatic HTTPS

    Caddy is a powerful, extensible, enterprise-ready server platform that uses TLS by default. Everything you would require in your infrastructure, from TLS certificate renewals and OCSP stapling, to reverse proxying and ingress, Caddy simplifies it all. Its modular architecture lets you do more with just a single static binary that compiles for any platform. Caddy is the only web server that uses HTTPS automatically and by default. It automatically renews TLS certificates, staples OCSP...
    Downloads: 23 This Week
    Last Update:
    See Project
  • Create and run cloud-based virtual machines. Icon
    Create and run cloud-based virtual machines.

    Secure and customizable compute service that lets you create and run virtual machines.

    Computing infrastructure in predefined or custom machine sizes to accelerate your cloud transformation. General purpose (E2, N1, N2, N2D) machines provide a good balance of price and performance. Compute optimized (C2) machines offer high-end vCPU performance for compute-intensive workloads. Memory optimized (M2) machines offer the highest memory and are great for in-memory databases. Accelerator optimized (A2) machines are based on the A100 GPU, for very demanding applications.
    Try for free
  • 5
    Dolphin Scheduler

    Dolphin Scheduler

    A distributed and extensible workflow scheduler platform

    Apache DolphinScheduler is a distributed and extensible workflow scheduler platform with powerful DAG visual interfaces, dedicated to solving complex job dependencies in the data pipeline and providing various types of jobs available `out of the box`. Dedicated to solving the complex task dependencies in data processing, making the scheduler system out of the box for data processing.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 6
    wasm-pack

    wasm-pack

    Your favorite rust -> wasm workflow tool!

    This tool seeks to be a one-stop-shop for building and working with rust-generated WebAssembly that you would like to interop with JavaScript, in the browser or with Node.js. wasm-pack helps you build rust-generated WebAssembly packages that you could publish to the npm registry, or otherwise use alongside any javascript packages in workflows that you already use, such as webpack. This project is a part of the rust-wasm group. You can find more info by visiting that repo! Currently,...
    Downloads: 6 This Week
    Last Update:
    See Project
  • 7
    akello

    akello

    Healthcare infrastructure for absolutely everyone

    Akello is an open-source platform that simplifies healthcare service integration using microservices. It offers flexibility, scalability, and strong security for developers building healthcare infrastructures.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 8
    SageMaker Spark Container

    SageMaker Spark Container

    Docker image used to run data processing workloads

    Apache Spark™ is a unified analytics engine for large-scale data processing. It provides high-level APIs in Scala, Java, Python, and R, and an optimized engine that supports general computation graphs for data analysis. It also supports a rich set of higher-level tools including Spark SQL for SQL and DataFrames, MLlib for machine learning, GraphX for graph processing, and Structured Streaming for stream processing.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    Prefect

    Prefect

    Prefect is a workflow orchestration framework

    Prefect is an open-source modern workflow orchestration tool for scheduling, monitoring, and managing data workflows and tasks. It enables Python-native pipeline definitions with robust retries, caching, observability, and a powerful UI—ideal for data engineering and ETL processes.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Easy-to-use Business Software for the Waste Management Software Industry Icon
    Easy-to-use Business Software for the Waste Management Software Industry

    Increase efficiency, expedite accounts receivables, optimize routes, acquire new customers, & more!

    DOP Software’s mission is to streamline waste and recycling business’ processes by providing them with dynamic, comprehensive software and services that increase productivity and quality of performance.
    Learn More
  • 10
    Arkflow

    Arkflow

    High performance Rust stream processing engine

    Arkflow is a Rust-based framework for building reactive, event-driven data pipelines. Inspired by tools like Airflow and Dagster, it focuses on strong typing, modularity, and performance. Arkflow is ideal for developers who want a fast, extensible way to orchestrate workflows and data transformations in Rust.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    nextflow

    nextflow

    A DSL for data-driven computational pipelines

    Nextflow is a powerful workflow orchestration tool designed for the development and execution of data-intensive pipelines, especially in the field of bioinformatics. It enables reproducible and portable workflows that can run seamlessly across different environments including local, HPC clusters, and cloud platforms. With support for containers and software package managers, Nextflow makes it easy to define complex data processes using a concise DSL and manage dependencies efficiently.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    Salt

    Salt

    Automate the management and configuration of infrastructures at scale

    Salt Bootstrap is a shell script that detects the target platform and selects the best installation method. (Supported Platforms) Your Salt master can manage itself, so a Salt minion is installed along with the Salt master. Before implementing Salt to manage any data center infrastructure it is important to understand a few things about Salt platform support including: What platforms will the Salt Master run on? What systems and infrastructure can be managed by a Salt Minion? Salt runs on...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 13
    Luigi

    Luigi

    Python module that helps you build complex pipelines of batch jobs

    Luigi is a Python (3.6, 3.7, 3.8, 3.9 tested) package that helps you build complex pipelines of batch jobs. It handles dependency resolution, workflow management, visualization, handling failures, command line integration, and much more. The purpose of Luigi is to address all the plumbing typically associated with long-running batch processes. You want to chain many tasks, automate them, and failures will happen. These tasks can be anything, but are typically long running things like Hadoop...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 14
    Gardener

    Gardener

    Kubernetes-native system managing the full lifecycle of Kubernetes

    Kubernetes-native system managing the full lifecycle of conformant Kubernetes clusters as a service on Alicloud, AWS, Azure, GCP, OpenStack, EquinixMetal, vSphere, MetalStack, and Kubevirt with minimal TCO. Kubernetes is a cloud-native enabler built around the principles of a resilient, manageable, observable, highly automated, loosely coupled system. Gardener is a standard Kubernetes extension and adheres to the same concepts by design. The Gardener project is committed to fostering an open...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 15
    AWS Load Balancer Controller

    AWS Load Balancer Controller

    A Kubernetes controller for Elastic Load Balancers

    AWS Load Balancer Controller is a controller to help manage Elastic Load Balancers for a Kubernetes cluster. It satisfies Kubernetes Ingress resources by provisioning Application Load Balancers. It satisfies Kubernetes Service resources by provisioning Network Load Balancers. This project was formerly known as "AWS ALB Ingress Controller", we rebranded it to be "AWS Load Balancer Controller". AWS ALB Ingress Controller was originated by Ticketmaster and CoreOS as part of Ticketmaster's move...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 16
    Seata

    Seata

    An easy-to-use, high-performance distributed transaction solution

    Seata, or Simple Extensible Autonomous Transaction Architecture is an open source distributed transaction solution that brings high performance and easy-to-use distributed transaction services under a microservices architecture. It has 3 basic components: a Transaction Coordinator for maintaining the status of global and branch transactions; a Transaction Manager that defines the scope of global transaction; and a Resource Manager that manages resources being worked on by branch...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    TorchIO

    TorchIO

    Medical imaging toolkit for deep learning

    TorchIO is an open-source Python library for efficient loading, preprocessing, augmentation and patch-based sampling of 3D medical images in deep learning, following the design of PyTorch. It includes multiple intensity and spatial transforms for data augmentation and preprocessing. These transforms include typical computer vision operations such as random affine transformations and also domain-specific ones such as simulation of intensity artifacts due to MRI magnetic field inhomogeneity...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    Infracost

    Infracost

    Cloud cost estimates for Terraform in pull requests

    Infracost scans for Terraform code changes and checks over 3 million prices to create a simple, understandable cost estimate before any resources are launched. Infracost integrates into CI/CD so everyone knows the cost impact of changes without leaving the workflow. Infracost integrates with Open Policy Agent, Sentinel, and Conftest, enabling DevOps teams to set best practices as policies. Infracost automatically creates detailed, shareable cost estimates which can be sent to clients and...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    Dolibarr ERP - CRM

    Dolibarr ERP - CRM

    Open source ERP and CRM web software for business

    Dolibarr ERP - CRM is an easy to use ERP and CRM open source software package (run with a web php server or as standalone software) for businesses, foundations or freelancers (prospect, invoicing, inventory, warehouse, order, shipment, POS, members for foundations, bank accounts...). Dolibarr is also available with auto-installers for users with no technical knowledge to install Dolibarr and all its prerequisites (Apache, Mysql, PHP) with just one package. Available platforms for such packages are: Windows, Debian, Ubuntu, Mint, Redhat, Fedora, OpenSuse, Mandriva, Mageia. Other platform can use the generic distribution. This is a modular product, than can be enhanced with tons of external modules (to provide you features not available by default) from https://www.dolistore.com Download Dolibarr ERP - CRM here or sign up for 15-day free trial of the cloud-based application with free updates and support at one of the Cloud providers https://saas.dolibarr.org/
    Leader badge
    Downloads: 5,086 This Week
    Last Update:
    See Project
  • 20
    Conductor

    Conductor

    Conductor is a microservices orchestration engine

    We built Conductor to help us orchestrate microservices-based process flows at Netflix with the following features, a distributed server ecosystem, which stores workflow state information efficiently. Allow creation of process/business flows in which each individual task can be implemented by the same/different microservices. Simple interface to connect workers, which execute the tasks in workflows. Full operational control over workflows with the ability to pause, resume, restart, retry and...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    C# ECG Toolkit

    C# ECG Toolkit

    ECG Toolkit support for: SCP-ECG, DICOM, HL7 aECG, ISHNE & MUSE-XML

    C# ECG Toolkit is an open source software toolkit to convert, view and print electrocardiograms. The toolkit is developed using C# .NET Framework 2.0 and later (code also supports netstandard2.0). Support for ECG formats: SCP-ECG, DICOM, HL7 aECG, ISHNE, MUSE-XML and OmronECG.
    Leader badge
    Downloads: 17 This Week
    Last Update:
    See Project
  • 22
    ResCarta

    ResCarta

    Archive your personal history

    ResCarta Toolkit offers an open source solution to creating, storing, viewing, and searching digital collections. Applications in the toolkit let users create and edit metadata, convert data to open standard ResCarta format, index and host collections.
    Leader badge
    Downloads: 7 This Week
    Last Update:
    See Project
  • 23

    FasRH

    Software de Gestão de Recursos Humanos

    Software de Gestão de Recursos Humanos versão beta integrado com excel e ponto Biométricos
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    WKFsuite

    WKFsuite

    WKFsuite - Quick and easy leave and vacation management

    # ITALIANO ## Funzionalità Principali - Gestione permessi e ferie aziendali - Database locale - dati restano in azienda - Accesso da PC e smartphone via WiFi aziendale - Multi-utente: Admin, Manager, Dipendente ## Versione FREE - Sistema completo gestione permessi - Dashboard e calendario permessi - Richieste con approvazione/rifiuto - Export PDF ## Versione PRO (€20 una tantum) - Email automatiche approvazioni/rifiuti - Grafici analytics avanzati - Supporto prioritario :...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    Harvy Open Trouble Ticketing System

    Harvy Open Trouble Ticketing System

    Harvy is an enterprise-class, telco friendly trouble ticketing system

    Harvy is the younger brother of Kuwaiba Open Network Inventory. It aims to be an enterprise-class trouble ticketing system, focused on providing support for complex work flows and integration with other OSS (Operations Support Systems) and BSS (Business Support System) in the context of eTOM and Frameworx/NGOSS directives. Although it was born as a solution for small and mid-sized telecommunications operators, it can also be used in environments with standard networking infrastructure, such...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • 4
  • 5
  • Next