Showing 27 open source projects for "apache"

View related business solutions
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • Context for your AI agents Icon
    Context for your AI agents

    Crawl websites, sync to vector databases, and power RAG applications. Pre-built integrations for LLM pipelines and AI assistants.

    Build data pipelines that feed your AI models and agents without managing infrastructure. Crawl any website, transform content, and push directly to your preferred vector store. Use 10,000+ tools for RAG applications, AI assistants, and real-time knowledge bases. Monitor site changes, trigger workflows on new data, and keep your AIs fed with fresh, structured information. Cloud-native, API-first, and free to start until you need to scale.
    Try for free
  • 1
    Apache Doris

    Apache Doris

    MPP-based interactive SQL data warehousing for reporting and analysis

    Apache Doris is a modern MPP analytical database product. It can provide sub-second queries and efficient real-time data analysis. With it's distributed architecture, up to 10PB level datasets will be well supported and easy to operate. Apache Doris can meet various data analysis demands, including history data reports, real-time data analysis, interactive data analysis, and exploratory data analysis.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    Greenplum Database

    Greenplum Database

    Massive parallel data platform for analytics, machine learning and AI

    Rapidly create and deploy models for complex applications in cybersecurity, predictive maintenance, risk management, fraud detection, and many other areas. With its unique cost-based query optimizer designed for large-scale data workloads, Greenplum scales interactive and batch-mode analytics to large datasets in the petabytes without degrading query performance and throughput. Based on PostgreSQL, Greenplum provides you with more control over the software you deploy, reducing vendor...
    Downloads: 21 This Week
    Last Update:
    See Project
  • 3
    DataChain

    DataChain

    AI-data warehouse to enrich, transform and analyze unstructured data

    Datachain enables multimodal API calls and local AI inferences to run in parallel over many samples as chained operations. The resulting datasets can be saved, versioned, and sent directly to PyTorch and TensorFlow for training. Datachain can persist features of Python objects returned by AI models, and enables vectorized analytical operations over them. The typical use cases are data curation, LLM analytics and validation, image segmentation, pose detection, and GenAI alignment. Datachain...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 4
    gravitino

    gravitino

    Unified metadata lake for data & AI assets.

    Apache Gravitino is a high-performance, geo-distributed, and federated metadata lake. It manages metadata directly in different sources, types, and regions, providing users with unified metadata access for data and AI assets.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Nonprofit Budgeting Software Icon
    Nonprofit Budgeting Software

    Martus Solutions provides seamless budgeting, reporting, and forecasting tools that integrate with accounting systems for real-time financial insights

    Martus' collaborative and easy-to-use budgeting and reporting platform will save you hundreds of hours each year. It's designed to make the entire budgeting process easier and create unlimited financial transparency.
    Learn More
  • 5
    Aestel

    Aestel

    Applications for data management

    "Information is data in action", and, consequently, having good quality data is essential. The AESTEL package contains two highly configurable applications for data management: A data loader and a reporting application, i.e. DataLoader and AEREA, respectively. The data loader application applies user-defined instructions to validate, process and load data. The reporting application provides a query builder and spreadsheet template designer. Both applications work with any relational data...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    Snowplow Analytics

    Snowplow Analytics

    Enterprise-strength marketing and product analytics platform

    Snowplow is ideal for data teams who want to manage the collection and warehousing of data across all their platforms and products.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 7
    TensorBase

    TensorBase

    TensorBase is a new big data warehousing with modern efforts

    TensorBase hopes the open source not become a copy game. TensorBase has a clear-cut opposition to fork communities, repeat wheels, or hack traffic for so-called reputations (like Github stars). After thoughts, we decided to temporarily leave the general data warehousing field. For people who want to learn how a database system can be built up, or how to apply modern Rust to the high-performance field, or embed a lightweight data analysis system into your own big one. You can still try, ask...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    Open Source Data Quality and Profiling

    Open Source Data Quality and Profiling

    World's first open source data quality & data preparation project

    ...This project is also known as "Aggregate Profiler" Resful API for this project is getting built as (Beta Version) https://sourceforge.net/projects/restful-api-for-osdq/ apache spark based data quality is getting built at https://sourceforge.net/projects/apache-spark-osdq/
    Downloads: 2 This Week
    Last Update:
    See Project
  • 9
    Beigesoft Enterprise Information System

    Beigesoft Enterprise Information System

    JEE based Accounting web-application.

    Beigesoft™ EIS is free accounting software. This is JEE-WEB-based software that works anywhere - on MS Windows, Mac, Linux and Android. This is the first version of the successor to Beige Accounting All In One. The main goal was successfully achieved, it works noticeably faster on weak devices such as a smartphone than its predecessor. It works as: * Standalone A-Jetty web-app for standard Java (Ms Windows, Mac, Nix...). * Standalone A-Jetty web-app for Android (use "Beigesoft...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Reach Your Audience with Rise Vision, the #1 Cloud Digital Signage Software Solution Icon
    Reach Your Audience with Rise Vision, the #1 Cloud Digital Signage Software Solution

    K-12 Schools, Higher Education, Businesses, Restaurants

    Rise Vision is the #1 digital signage company, offering easy-to-use cloud digital signage software compatible with any player across multiple screens. Forget about static displays. Save time and boost sales with 500+ customizable content templates for your screens. If you ever need help, get free training and exceptionally fast support.
    Learn More
  • 10
    apache spark data pipeline osDQ

    apache spark data pipeline osDQ

    osDQ dedicated to create apache spark based data pipeline using JSON

    This is an offshoot project of open source data quality (osDQ) project https://sourceforge.net/projects/dataquality/ This sub project will create apache spark based data pipeline where JSON based metadata (file) will be used to run data processing , data pipeline , data quality and data preparation and data modeling features for big data. This uses java API of apache spark. It can run in local mode also. Get json example at https://github.com/arrahtech/osdq-spark How to run Unzip the zip file Windows : java -cp ....
    Downloads: 1 This Week
    Last Update:
    See Project
  • 11

    CrNiCKL

    CrNiCKL (chronicle) is a Java database for time series

    CrNiCKL (pronounced "chronicle") is a data manager written in Java handling large sets of heterogeneous time series. A simple schema system allows to confiture value types and time domains. CrNiCKL runs on top of SQL or NoSQL databases. Drivers for JDBC and MongoDB are available.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    OpenSearchServer Extractor

    OpenSearchServer Extractor

    A RESTFul/JSON Web Service for text and metata extraction

    An open source RESTFul Web Service for text , meta-data extraction and analysis. oss-text-extractor supports various binary formats: Word processor (doc, docx, odt, rtf) Spreadsheet (xls, xlsx, ods) Presentation (ppt, pptx, odp) Publishing (pdf, pub) Web (rss, html/xhtml) Medias (audio, images) Others (vsd, text)
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13

    Data Migrator for Oracle

    Migrate/Copy your data between Oracle database and 13 major DBs.

    Command line data Copy/Migration tool for Oracle. Supports Oracle 7.3, Oracle 8i, Oracle 9i, Oracle 10G, Oracle 11G and 13 major databases. 1. Exadata 2. Sybase ASE 3. Informix Innovator C 4. Sybase SQL Anywhere 5. DB2 UDB 6. CSV 7. SQLServer 8. MariaDB 9. Sybase IQ 10. PostgreSQL 11. MySQL 12. Informix IDS 13. TimesTen
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14

    Data Migration Tools for RDBMS

    DataMigrator for 14 major databases

    Touch and go Windows command line data migration tool for 14 databases: 1. Sybase ASE 2. Informix Innovator C 3. Sybase SQL Anywhere 4. DB2 UDB 5. SQLServer 6. MariaDB 7. Sybase IQ 8. PostgreSQL 9. MySQL 10. Informix IDS 11. TimesTen 12. Oracle 13. SQL Lite 14. Exadata
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15

    Data Spooler For Oracle #SaveUkraine

    Simplified turbo spooler for Oracle.

    #SaveUkraine #StopRussia #FreeUkraine #StopPutin #CrimeaIsUkraine #UnitedForUkraine #RussiaInvadedUkraine Exports/Spools scalar data on disk for a given Oracle table. Turbo mode spools 5x faster.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16

    PDF*Merger for Windows

    Merge/concatenate PDF files into one PDF file

    Merge your PDF files for upload to reporting engine or other needs. Command line, win32 Written in Python. Compiled with PyInstaller.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17

    CSV*Loader for Oracle

    Simplified CSV turbo loader to Oracle

    Tired of writing control files? No problem! CSV*Loader will generate control file for SQL*Loader. Too slow? No problem! CSV*Loader turbo mode may load it 10x faster to your Oracle database than your good old Perl::DBI script.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    OGSA-DAI is a product that allows data resources, such as file collections, relational or XML databases, to be accessed, integrated and federated across the Internet.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    ServiceNow Data Mart Loader

    ServiceNow Data Mart Loader

    ServiceNow Data Mart Loader (a.k.a. ServiceNow Data Pump)

    The ServiceNow Data Mart Loader (a.k.a. ServiceNow DataPump) is a Java application which uses ServiceNow’s Direct Web Services (SOAP) API to extract meta-data and data from your Service-now ITSM instance. The application automatically creates and maintains tables in an Oracle or MySQL database. Please view the Wiki Quick Start Guide for instructions. NOTE: This project has been rehosted on github (see https://github.com/gflewis/sndml). The updated version has been tested with Geneva...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Scriptella is an open source ETL (Extract-Transform-Load) and script execution tool written in Java. Its primary focus is simplicity.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21

    ChronoDB

    ChronoDB is a data manager for generic time series.

    The project has been renamed CrNiCKL (chronicle) and moved to http://crnickl.sourceforge.net. ChronoDB is a data manager written in Java. It supports time series of any type. With its simple and powerful schema subsystem it takes charge of very large heterogeneous data sets. The software consist of an API and a generic implementation layer running on top of an SQL or a NoSQL system. Applications can mix objects from multiple ChronoDB databases.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    Longname:Operational data business express---- ODBExpress is a report suit for business intelligence, it includes reporting, analysis (OLAP),etc.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    An open source data version repository model based on true INSERT only logic. Contains high performance support for data versioning and history. Enables point-in-time query and CDC sourcing including mapping each change to a transaction.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    Spire is a print stream converter/manipulator. It can transform print streams from Metacode to Postscript, Postscript to Metacode. PCL support will be added soon. Spire is also capable of sorting documents (think postal sortation) and added barcodes.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    Openminer, as a data mining engine, is developed on java for analysis of dataset with the methods of data mining. By making use of openminer, we could discovery the knowledge which interests us but hides in the raw data.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • Next