Showing 128 open source projects for "apache"

View related business solutions
  • Earn up to 16% annual interest with Nexo. Icon
    Earn up to 16% annual interest with Nexo.

    Let your crypto work for you

    Put idle assets to work with competitive interest rates, borrow without selling, and trade with precision. All in one platform. Geographic restrictions, eligibility, and terms apply.
    Get started with Nexo.
  • Try Google Cloud Risk-Free With $300 in Credit Icon
    Try Google Cloud Risk-Free With $300 in Credit

    No hidden charges. No surprise bills. Cancel anytime.

    Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.
    Start Free
  • 1
    SageMaker Training Toolkit

    SageMaker Training Toolkit

    Train machine learning models within Docker containers

    Train machine learning models within a Docker container using Amazon SageMaker. Amazon SageMaker is a fully managed service for data science and machine learning (ML) workflows. You can use Amazon SageMaker to simplify the process of building, training, and deploying ML models. To train a model, you can include your training script and dependencies in a Docker container that runs your training code. A container provides an effectively isolated environment, ensuring a consistent runtime and...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    NVIDIA Merlin

    NVIDIA Merlin

    Library providing end-to-end GPU-accelerated recommender systems

    NVIDIA Merlin is an open-source library that accelerates recommender systems on NVIDIA GPUs. The library enables data scientists, machine learning engineers, and researchers to build high-performing recommenders at scale. Merlin includes tools to address common feature engineering, training, and inference challenges. Each stage of the Merlin pipeline is optimized to support hundreds of terabytes of data, which is all accessible through easy-to-use APIs. For more information, see NVIDIA...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    Luigi

    Luigi

    Python module that helps you build complex pipelines of batch jobs

    Luigi is a Python (3.6, 3.7, 3.8, 3.9 tested) package that helps you build complex pipelines of batch jobs. It handles dependency resolution, workflow management, visualization, handling failures, command line integration, and much more. The purpose of Luigi is to address all the plumbing typically associated with long-running batch processes. You want to chain many tasks, automate them, and failures will happen. These tasks can be anything, but are typically long running things like Hadoop...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    airda

    airda

    airda(Air Data Agent

    airda(Air Data Agent) is a multi-smart body for data analysis, capable of understanding data development and data analysis needs, understanding data, generating data-oriented queries, data visualization, machine learning and other tasks of SQL and Python codes.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Stop Cyber Threats with VM-Series Next-Gen Firewall on Azure Icon
    Stop Cyber Threats with VM-Series Next-Gen Firewall on Azure

    Native application identity and user-based security for your Azure cloud

    Gain integrated visibility across all traffic in a single pass. Deploy Palo Alto Networks VM-Series to determine application identity and content while automating security policy updates via rich APIs.
    Get a free trial
  • 5
    Fondant

    Fondant

    Production-ready data processing made easy and shareable

    Fondant is a modular, pipeline-based framework designed to simplify the preparation of large-scale datasets for training machine learning models, especially foundation models. It offers an end-to-end system for ingesting raw data, applying transformations, filtering, and formatting outputs—all while remaining scalable and traceable. Fondant is designed with reproducibility in mind and supports containerized steps using Docker, making it easy to share and reuse data processing components....
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    Encord Active

    Encord Active

    The toolkit to test, validate, and evaluate your models and surface

    Encord Active is an open-source toolkit to test, validate, and evaluate your models and surface, curate, and prioritize the most valuable data for labeling to supercharge model performance. Encord Active has been designed as a all-in-one open source toolkit for improving your data quality and model performance. Use the intuitive UI to explore your data or access all the functionalities programmatically. Discover errors, outliers, and edge-cases within your data - all in one open source...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    akello

    akello

    Healthcare infrastructure for absolutely everyone

    Akello is an open-source platform that simplifies healthcare service integration using microservices. It offers flexibility, scalability, and strong security for developers building healthcare infrastructures.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    GMAT

    GMAT

    General Mission Analysis Tool

    The General Mission Analysis Tool (GMAT) is an open-source tool for space mission design and navigation. GMAT is developed by a team of NASA, private industry, and public and private contributors. The GMAT development team is pleased to announce the release of GMAT version R2026a. For a complete list of new features, compatibility changes, and bug fixes, see the R2026a Release Notes in the Users Guide.
    Leader badge
    Downloads: 1,061 This Week
    Last Update:
    See Project
  • 9
    KeyParaStocX

    KeyParaStocX

    Set styles to words and create a Table of Contents in a click

    KeyParaStocX (Keyword-based Paragraph Styling and Table of Contents eXtension) is a LibreOffice/Apache OpenOffice/OpenOffice.org extension that searches for the configured keywords in a text, changes their style and builds a Table of Contents for them, up to 7 levels. The keywords and their target styles can be configured by the users and used for every document they open. The extension integrates into Writer options and is independent of the operating system (should work on all). ...
    Downloads: 199 This Week
    Last Update:
    See Project
  • Full-stack observability with actually useful AI | Grafana Cloud Icon
    Full-stack observability with actually useful AI | Grafana Cloud

    Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

    Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
    Create free account
  • 10
    SnacTrac

    SnacTrac

    The cash-based snack bar manager!

    SnacTrac is a tool designed to simplify managing a cash-based snack bar. After creating an account with admin approval, customers can use SnacTrac as a self-service purchasing kiosk. The system automatically tracks customer tabs and inventory, maintains purchase history, and provides an admin control panel with configurable user debt limits and more!
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    Visphotgraph Software

    Visphotgraph Software

    Intelligent Precision for Vibration Detection

    Visphotgraph is a compact, automated IoT-based seismograph that accurately detects and records ground vibrations in real time. With cloud connectivity and instant data updates, it provides quick insights and reliable alerts to researchers, institutions, industries, and safety teams.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    Forex Assistant

    Forex Assistant

    Manage accounts, analyze trades, track strategies & stats , News & Cal

    Forex Assistant is a comprehensive Windows desktop application designed to support forex traders in every aspect of their trading journey. From essential trading calculators and market insights to powerful account and trade management, it brings everything you need into one intuitive platform. 🔑 Main Features 🧮 Calculator Tools Profit/Loss Calculator Currency Converter Position Size Calculator Pips Calculator Margin Calculator 🌍 Market Insights Live Market News Economic...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 13
    gravitino

    gravitino

    Unified metadata lake for data & AI assets.

    Apache Gravitino is a high-performance, geo-distributed, and federated metadata lake. It manages metadata directly in different sources, types, and regions, providing users with unified metadata access for data and AI assets.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    PipeRider

    PipeRider

    Code review for data in dbt

    PipeRider automatically compares your data to highlight the difference in impacted downstream dbt models so you can merge your Pull Requests with confidence. PipeRider can profile your dbt models and obtain information such as basic data composition, quantiles, histograms, text length, top categories, and more. PipeRider can integrate with dbt metrics and present the time-series data of metrics in the report. PipeRider generates a static HTML report each time it runs, which can be viewed...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    odd-collector

    odd-collector

    Open-source metadata collector based on ODD Specification

    ODD Collector is a lightweight service that gathers metadata from all your data sources. Push-client is a provider which sends information directly to the central repository of the Platform. ODDRN (Open Data Discovery Resource Name) is a unique resource name that identifies entities such as data sources, data entities, dataset fields etc. It is used to build lineage and update metadata.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    SageMaker Inference Toolkit

    SageMaker Inference Toolkit

    Serve machine learning models within a Docker container

    Serve machine learning models within a Docker container using Amazon SageMaker. Amazon SageMaker is a fully managed service for data science and machine learning (ML) workflows. You can use Amazon SageMaker to simplify the process of building, training, and deploying ML models. Once you have a trained model, you can include it in a Docker container that runs your inference code. A container provides an effectively isolated environment, ensuring a consistent runtime regardless of where the...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    odd-collector-gcp

    odd-collector-gcp

    Open-source GCP metadata collector based on ODD Specification

    ODD Collector GCP is a lightweight service which gathers metadata from all your Google Cloud Platform data sources.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    text-dedup

    text-dedup

    All-in-one text de-duplication

    text-dedup is a Python library that enables efficient deduplication of large text corpora by using MinHash and other probabilistic techniques to detect near-duplicate content. This is especially useful for NLP tasks where duplicated training data can skew model performance. text-dedup scales to billions of documents and offers tools for chunking, hashing, and comparing text efficiently with low memory usage. It supports Jaccard similarity thresholding, parallel execution, and flexible...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    Tributary

    Tributary

    Streaming reactive and dataflow graphs in Python

    Tributary is a library for constructing dataflow graphs in Python. Unlike many other DAG libraries in Python (airflow, luigi, prefect, dagster, dask, kedro, etc), tributary is not designed with data/etl pipelines or scheduling in mind. Instead, tributary is more similar to libraries like mdf, loman, pyungo, streamz, or pyfunctional, in that it is designed to be used as the implementation for a data model. One such example is the greeks library, which leverages tributary to build data models...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    TradeMaster

    TradeMaster

    TradeMaster is an open-source platform for quantitative trading

    TradeMaster is a first-of-its-kind, best-in-class open-source platform for quantitative trading (QT) empowered by reinforcement learning (RL), which covers the full pipeline for the design, implementation, evaluation and deployment of RL-based algorithms. TradeMaster is composed of 6 key modules: 1) multi-modality market data of different financial assets at multiple granularities; 2) whole data preprocessing pipeline; 3) a series of high-fidelity data-driven market simulators for mainstream...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    Visdom

    Visdom

    A tool for creating, organizing, and sharing data visualizations

    A flexible tool for creating, organizing, and sharing visualizations of live, rich data. Supports Torch and Numpy. Visdom aims to facilitate visualization of (remote) data with an emphasis on supporting scientific experimentation. Broadcast visualizations of plots, images, and text for yourself and your collaborators. Organize your visualization space programmatically or through the UI to create dashboards for live data, inspect results of experiments, or debug experimental code. Visdom has...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    Bloxs

    Bloxs

    Build dashboards in Jupyter Notebook with numeric and chart boxes

    Bloxs is a simple Python package that helps you display information in an attractive way (formed in blocks). Perfect for building dashboards, reports and apps in the notebook.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    umi-project

    umi-project

    Just bring Linux for a better world

    U M I, pronounce "ou" "ème" "aie" to an approach of "you & I" expression, is meant to be a derivative of Ubuntu, a Linux distribution. U M I is a system that wants generalist, simple and tailored to your needs. M I perhaps as "Maths Infos", "Mission Impossible", "Micro Imagination", "Museum Incarnation", ..., "Mandela Ideologie", ...,"Magne Isapèt" :), ... ; but in reality M I for "Me Inside", inside Linux, inside Debian, inside Ubuntu. This project designates all logistics associated...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 24
    AWS Step Functions Data Science SDK

    AWS Step Functions Data Science SDK

    For building machine learning (ML) workflows and pipelines on AWS

    The AWS Step Functions Data Science SDK is an open-source library that allows data scientists to easily create workflows that process and publish machine learning models using Amazon SageMaker and AWS Step Functions. You can create machine learning workflows in Python that orchestrate AWS infrastructure at scale, without having to provision and integrate the AWS services separately. The best way to quickly review how the AWS Step Functions Data Science SDK works is to review the related...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    BPYTOP

    BPYTOP

    Linux/OSX/FreeBSD resource monitor

    BPYTOP is a feature-rich, terminal-based resource monitor written in Python 3 that provides a highly visual overview of system performance. It displays real-time usage and statistics for CPU, memory, disks, network, and processes, with colorful graphs and widgets that update at configurable intervals. Users can drill into a process list, sort by various metrics, view tree hierarchies, and quickly spot heavy resource consumers. The tool is highly configurable through both an in-app options...
    Downloads: 0 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB