Showing 20 open source projects for "apache"

View related business solutions
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • Context for your AI agents Icon
    Context for your AI agents

    Crawl websites, sync to vector databases, and power RAG applications. Pre-built integrations for LLM pipelines and AI assistants.

    Build data pipelines that feed your AI models and agents without managing infrastructure. Crawl any website, transform content, and push directly to your preferred vector store. Use 10,000+ tools for RAG applications, AI assistants, and real-time knowledge bases. Monitor site changes, trigger workflows on new data, and keep your AIs fed with fresh, structured information. Cloud-native, API-first, and free to start until you need to scale.
    Try for free
  • 1
    Apache Doris

    Apache Doris

    MPP-based interactive SQL data warehousing for reporting and analysis

    Apache Doris is a modern MPP analytical database product. It can provide sub-second queries and efficient real-time data analysis. With it's distributed architecture, up to 10PB level datasets will be well supported and easy to operate. Apache Doris can meet various data analysis demands, including history data reports, real-time data analysis, interactive data analysis, and exploratory data analysis.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    Greenplum Database

    Greenplum Database

    Massive parallel data platform for analytics, machine learning and AI

    Rapidly create and deploy models for complex applications in cybersecurity, predictive maintenance, risk management, fraud detection, and many other areas. With its unique cost-based query optimizer designed for large-scale data workloads, Greenplum scales interactive and batch-mode analytics to large datasets in the petabytes without degrading query performance and throughput. Based on PostgreSQL, Greenplum provides you with more control over the software you deploy, reducing vendor...
    Downloads: 21 This Week
    Last Update:
    See Project
  • 3
    DataChain

    DataChain

    AI-data warehouse to enrich, transform and analyze unstructured data

    Datachain enables multimodal API calls and local AI inferences to run in parallel over many samples as chained operations. The resulting datasets can be saved, versioned, and sent directly to PyTorch and TensorFlow for training. Datachain can persist features of Python objects returned by AI models, and enables vectorized analytical operations over them. The typical use cases are data curation, LLM analytics and validation, image segmentation, pose detection, and GenAI alignment. Datachain...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 4
    gravitino

    gravitino

    Unified metadata lake for data & AI assets.

    Apache Gravitino is a high-performance, geo-distributed, and federated metadata lake. It manages metadata directly in different sources, types, and regions, providing users with unified metadata access for data and AI assets.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Free and Open Source HR Software Icon
    Free and Open Source HR Software

    OrangeHRM provides a world-class HRIS experience and offers everything you and your team need to be that HR hero you know that you are.

    Give your HR team the tools they need to streamline administrative tasks, support employees, and make informed decisions with the OrangeHRM free and open source HR software.
    Learn More
  • 5
    Aestel

    Aestel

    Applications for data management

    "Information is data in action", and, consequently, having good quality data is essential. The AESTEL package contains two highly configurable applications for data management: A data loader and a reporting application, i.e. DataLoader and AEREA, respectively. The data loader application applies user-defined instructions to validate, process and load data. The reporting application provides a query builder and spreadsheet template designer. Both applications work with any relational data...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    Snowplow Analytics

    Snowplow Analytics

    Enterprise-strength marketing and product analytics platform

    Snowplow is ideal for data teams who want to manage the collection and warehousing of data across all their platforms and products.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 7
    TensorBase

    TensorBase

    TensorBase is a new big data warehousing with modern efforts

    TensorBase hopes the open source not become a copy game. TensorBase has a clear-cut opposition to fork communities, repeat wheels, or hack traffic for so-called reputations (like Github stars). After thoughts, we decided to temporarily leave the general data warehousing field. For people who want to learn how a database system can be built up, or how to apply modern Rust to the high-performance field, or embed a lightweight data analysis system into your own big one. You can still try, ask...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    Open Source Data Quality and Profiling

    Open Source Data Quality and Profiling

    World's first open source data quality & data preparation project

    ...This project is also known as "Aggregate Profiler" Resful API for this project is getting built as (Beta Version) https://sourceforge.net/projects/restful-api-for-osdq/ apache spark based data quality is getting built at https://sourceforge.net/projects/apache-spark-osdq/
    Downloads: 2 This Week
    Last Update:
    See Project
  • 9
    apache spark data pipeline osDQ

    apache spark data pipeline osDQ

    osDQ dedicated to create apache spark based data pipeline using JSON

    This is an offshoot project of open source data quality (osDQ) project https://sourceforge.net/projects/dataquality/ This sub project will create apache spark based data pipeline where JSON based metadata (file) will be used to run data processing , data pipeline , data quality and data preparation and data modeling features for big data. This uses java API of apache spark. It can run in local mode also. Get json example at https://github.com/arrahtech/osdq-spark How to run Unzip the zip file Windows : java -cp ....
    Downloads: 1 This Week
    Last Update:
    See Project
  • Say goodbye to broken revenue funnels and poor customer experiences Icon
    Say goodbye to broken revenue funnels and poor customer experiences

    Connect and coordinate your data, signals, tools, and people at every step of the customer journey.

    LeanData is a Demand Management solution that supports all go-to-market strategies such as account-based sales development, geo-based territories, and more. LeanData features a visual, intuitive workflow native to Salesforce that enables users to view their entire lead flow in one interface. LeanData allows users to access the drag-and-drop feature to route their leads. LeanData also features an algorithms match that uses multiple fields in Salesforce.
    Learn More
  • 10

    CrNiCKL

    CrNiCKL (chronicle) is a Java database for time series

    CrNiCKL (pronounced "chronicle") is a data manager written in Java handling large sets of heterogeneous time series. A simple schema system allows to confiture value types and time domains. CrNiCKL runs on top of SQL or NoSQL databases. Drivers for JDBC and MongoDB are available.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    OGSA-DAI is a product that allows data resources, such as file collections, relational or XML databases, to be accessed, integrated and federated across the Internet.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    ServiceNow Data Mart Loader

    ServiceNow Data Mart Loader

    ServiceNow Data Mart Loader (a.k.a. ServiceNow Data Pump)

    The ServiceNow Data Mart Loader (a.k.a. ServiceNow DataPump) is a Java application which uses ServiceNow’s Direct Web Services (SOAP) API to extract meta-data and data from your Service-now ITSM instance. The application automatically creates and maintains tables in an Oracle or MySQL database. Please view the Wiki Quick Start Guide for instructions. NOTE: This project has been rehosted on github (see https://github.com/gflewis/sndml). The updated version has been tested with Geneva...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    Scriptella is an open source ETL (Extract-Transform-Load) and script execution tool written in Java. Its primary focus is simplicity.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14

    ChronoDB

    ChronoDB is a data manager for generic time series.

    The project has been renamed CrNiCKL (chronicle) and moved to http://crnickl.sourceforge.net. ChronoDB is a data manager written in Java. It supports time series of any type. With its simple and powerful schema subsystem it takes charge of very large heterogeneous data sets. The software consist of an API and a generic implementation layer running on top of an SQL or a NoSQL system. Applications can mix objects from multiple ChronoDB databases.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    REST API for XML and HTML schema-less query and recomposition.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    Longname:Operational data business express---- ODBExpress is a report suit for business intelligence, it includes reporting, analysis (OLAP),etc.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    An open source data version repository model based on true INSERT only logic. Contains high performance support for data versioning and history. Enables point-in-time query and CDC sourcing including mapping each change to a transaction.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    Spire is a print stream converter/manipulator. It can transform print streams from Metacode to Postscript, Postscript to Metacode. PCL support will be added soon. Spire is also capable of sorting documents (think postal sortation) and added barcodes.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    Openminer, as a data mining engine, is developed on java for analysis of dataset with the methods of data mining. By making use of openminer, we could discovery the knowledge which interests us but hides in the raw data.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Modest collection of OMBPlus scripts for Oracle Warehouse Builder. Utilities for creating repositories and targets, staging source tables, generating surrogate keys, etc.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • Next