Showing 6127 open source projects for "data"

View related business solutions
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • AI-powered service management for IT and enterprise teams Icon
    AI-powered service management for IT and enterprise teams

    Enterprise-grade ITSM, for every business

    Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity. Maximize operational efficiency with refreshingly simple, AI-powered Freshservice.
    Try it Free
  • 1
    OpenDataLoader PDF

    OpenDataLoader PDF

    PDF Parser for AI-ready data. Automate PDF accessibility

    OpenDataLoader PDF is an open-source document processing system designed to convert complex PDF files into structured, AI-ready formats such as Markdown, JSON, and HTML while preserving layout, hierarchy, and semantic meaning. It focuses on enabling downstream use cases like retrieval-augmented generation (RAG), knowledge extraction, and document intelligence pipelines by maintaining accurate reading order and spatial metadata through bounding boxes. The tool combines deterministic parsing...
    Downloads: 9 This Week
    Last Update:
    See Project
  • 2
    fess

    fess

    Open source enterprise search server for websites, files, and data

    Fess is an open source enterprise search server designed to provide powerful full-text search capabilities across multiple data sources. It enables organizations to quickly deploy a scalable search environment without requiring deep knowledge of underlying search technologies. Fess is built on top of OpenSearch and offers an integrated solution for crawling, indexing, and searching documents from websites, file systems, and various data stores. Fess includes a built-in crawler that can collect content from sources such as databases, CSV files, and shared storage, making it suitable for centralized knowledge discovery. ...
    Downloads: 9 This Week
    Last Update:
    See Project
  • 3
    HugeGraph

    HugeGraph

    A graph database that supports more than 100+ billion data

    ...HugeGraph supports fast import performance in the case of more than 10 billion Vertices and Edges Graph, millisecond-level OLTP query capability, and can be integrated into big data platforms like Hadoop or Spark for OLAP analysis. The main scenarios of HugeGraph include correlation search, fraud detection, and knowledge graph. Not only supports Gremlin graph query language and RESTful API but also provides commonly used graph algorithm APIs. To help users easily implement various queries and analyses, HugeGraph has a full range of accessory tools, such as supporting distributed storage, data replication, scaling horizontally, and supports many built-in backends of storage engines.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 4
    EnvFile

    EnvFile

    EnvFile 3.x is a plugin for JetBrains IDEs

    Env File is a plugin for JetBrains IDEs that allows you to set environment variables for your run configurations from one or multiple files. Not all run configurations available in IDEA-based IDEs are implemented similarly. Some of them differ significantly. In certain cases (so far, only Gradle has been confirmed) the implementation exposes interfaces to integrate the EnvFile UI but doesn't provide interfaces for it to actually make its work. Luckily, it was possible to make few assumptions...
    Downloads: 11 This Week
    Last Update:
    See Project
  • $300 in Free Credit Towards Top Cloud Services Icon
    $300 in Free Credit Towards Top Cloud Services

    Build VMs, containers, AI, databases, storage—all in one place.

    Start your project in minutes. After credits run out, 20+ products include free monthly usage. Only pay when you're ready to scale.
    Get Started
  • 5
    Wren Engine

    Wren Engine

    The Semantic Engine for Model Context Protocol(MCP)

    Wren Engine is a semantic engine designed to empower Model Context Protocol (MCP) clients and AI agents by providing accurate, contextual, and governed access to business data. It serves as a bridge between large language models (LLMs) and enterprise systems, facilitating seamless integration and interaction. ​
    Downloads: 3 This Week
    Last Update:
    See Project
  • 6
    jsonista

    jsonista

    Clojure library for fast JSON encoding and decoding

    Clojure library for fast JSON encoding and decoding.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 7
    Apache Hudi

    Apache Hudi

    Upserts, Deletes And Incremental Processing on Big Data

    Apache Hudi (pronounced Hoodie) stands for Hadoop Upserts Deletes and Incrementals. Hudi manages the storage of large analytical datasets on DFS (Cloud stores, HDFS or any Hadoop FileSystem compatible storage). Apache Hudi is a transactional data lake platform that brings database and data warehouse capabilities to the data lake. Hudi reimagines slow old-school batch data processing with a powerful new incremental processing framework for low latency minute-level analytics. Hudi provides efficient upserts, by mapping a given hoodie key (record key + partition path) consistently to a file id, via an indexing mechanism. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    Alluxio

    Alluxio

    Open Source Data Orchestration for the Cloud

    Alluxio is the world’s first open source data orchestration technology for analytics and AI for the cloud. It bridges the gap between computation frameworks and storage systems, bringing data from the storage tier closer to the data driven applications. This enables applications to connect to numerous storage systems through a common interface. It makes data local, more accessible and as elastic as compute.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    OrientDB

    OrientDB

    DBMS supporting graph, document, full-text and geospatial models

    OrientDB is an Open Source Multi-Model NoSQL DBMS with the support of Native Graphs, Documents, Full-Text search, Reactivity, Geo-Spatial and Object Oriented concepts. It's written in Java and it's amazingly fast. No expensive run-time JOINs, connections are managed as persistent pointers between records. You can traverse thousands of records in no time. Supports schema-less, schema-full and schema-mixed modes. Has a strong security profiling system based on user, roles and predicate...
    Downloads: 5 This Week
    Last Update:
    See Project
  • AI-generated apps that pass security review Icon
    AI-generated apps that pass security review

    Stop waiting on engineering. Build production-ready internal tools with AI—on your company data, in your cloud.

    Retool lets you generate dashboards, admin panels, and workflows directly on your data. Type something like “Build me a revenue dashboard on my Stripe data” and get a working app with security, permissions, and compliance built in from day one. Whether on our cloud or self-hosted, create the internal software your team needs without compromising enterprise standards or control.
    Try Retool free
  • 10
    Odigos

    Odigos

    Distributed tracing without code changes

    ...Historically, compiled languages like Go have been difficult to instrument without code changes. Odigos solves this problem by uniquely leveraging eBPF. Odigos currently supports all the popular managed and open source destinations. By producing data in the OpenTelemetry format, Odigos can be used with any observability tool that supports OTLP. Odigos automatically scales OpenTelemetry collectors based on observability data volume. Manage and configure collectors via a convenient web UI. Installing Odigos takes less than 5 minutes, and requires no code changes.
    Downloads: 31 This Week
    Last Update:
    See Project
  • 11
    ShardingSphere

    ShardingSphere

    Distributed database ecosphere

    Apache ShardingSphere is an open-source ecosystem consisted of a set of distributed database solutions, including 3 independent products, JDBC, Proxy & Sidecar (Planning). They all provide functions of data scale out, distributed transaction and distributed governance, applicable in a variety of situations such as Java isomorphism, heterogeneous language and cloud native. Apache ShardingSphere aiming at reasonably making full use of the computation and storage capacity of existed database in distributed system, rather than a totally new database. ...
    Downloads: 7 This Week
    Last Update:
    See Project
  • 12
    MCreator

    MCreator

    Make Minecraft Java Edition mods, Minecraft Bedrock Edition Add-Ons

    MCreator is an open-source integrated development environment designed to simplify the process of creating Minecraft mods, add-ons, resource packs, and data packs. The software provides a visual programming environment where users can design mod features using graphical tools instead of writing code manually. Through wizards, block-based programming interfaces, and an integrated editor, MCreator generates the underlying Java or JSON code required for the mod automatically. This approach allows beginners and students to experiment with mod development without needing extensive programming experience. ...
    Downloads: 30 This Week
    Last Update:
    See Project
  • 13
    JSON-java

    JSON-java

    A reference implementation of a JSON package in Java

    JSON is a light-weight language-independent data interchange format. The JSON-Java package is a reference implementation that demonstrates how to parse JSON documents into Java objects and how to generate new JSON documents from the Java classes.
    Downloads: 50 This Week
    Last Update:
    See Project
  • 14
    Chat2DB

    Chat2DB

    AI-driven database tool and SQL client

    ...Just enter the names of the tables and columns, and we will automatically configure the type, password, and comment, saving you 90% of the time. Imports and exports data in multiple formats (CSV, XLSX, XLS, SQL) to facilitate exchange, backup, and migration. Transfers data between different databases or through cloud services, as a backup and recovery solution that guarantees the minimum loss of data and downtime during migrations.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 15
    Cryptomator

    Cryptomator

    Multi-platform transparent client-side encryption for your cloud files

    Cryptomator is a free and open source project that offers multi-platform, transparent client side encryption of your files in the cloud. It works with just about any cloud storage service; is totally transparent so you can work with your files how you normally would; and secure with a 256-bit key length AES encryption. It’s also simple and easy to use with no need for different accounts, key management, cloud access grants or cipher configurations.
    Downloads: 33 This Week
    Last Update:
    See Project
  • 16
    Portfolio Performance

    Portfolio Performance

    Track and evaluate the performance of your investment portfolio

    ...Performance indicators like that True-Time Weighted Rate of Return or the internal rate of return ( Internal rate of return ) are calculated. Historical courses are automatically run by Yahoo Finance loaded or can be extracted from any HTML pages. Due to the open file format, all data is available as XML available or can be used as CSV be exported. Support for Rebalancing based on freely definable Asset allocations. With the help of the historical exchange rates of the European Central Bank ( ECB ) accounts and shares can be held in foreign currency.
    Downloads: 27 This Week
    Last Update:
    See Project
  • 17
    Semantic Type Detection

    Semantic Type Detection

    Metadata/data identification Java library

    Metadata/data identification Java library. Identifies Base Type (e.g. Boolean, Double, Long, String, LocalDate, LocalTime, ...) and Semantic Type information (e.g. Gender, Age, Color, Country, ...). Extensive country/language support. Extensible via user-defined plugins. Comprehensive Profiling support. Large set of built-in Semantic Types (extensible via JSON defined plugins).
    Downloads: 2 This Week
    Last Update:
    See Project
  • 18
    Siddhi Core Libraries

    Siddhi Core Libraries

    Stream Processing and Complex Event Processing Engine

    Fully open source, cloud-native, scalable, micro streaming, and complex event processing system capable of building event-driven applications for use cases such as real-time analytics, data integration, notification management, and adaptive decision-making. Event processing logic can be written using Streaming SQL queries via graphical and source editors, to capture events from diverse data sources, process and analyze them, integrate with multiple services and data stores, and publish output to various endpoints in real time. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 19
    ElasticJob

    ElasticJob

    Distributed scheduled job framework

    ElasticJob is a distributed scheduling solution consisting of two separate projects, ElasticJob-Lite and ElasticJob-Cloud. ElasticJob-Lite is a lightweight, decentralized solution that provides distributed task sharding services. ElasticJob-Cloud uses Mesos to manage and isolate resources. It uses a unified job API for each project. Developers only need code one time and can deploy at will. Support job sharding and high availability in distributed system. Scale out for throughput and...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 20
    XLog

    XLog

    Android logger, pretty, powerful and flexible, log to everywhere

    Lightweight and pretty, powerful and flexible logger for Android and java, can print the log to Logcat, Console, and Files, or anywhere if you like.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 21
    jsonschema2pojo

    jsonschema2pojo

    Generate Java types from JSON or JSON Schema for data-binding

    ...If the additionalProperties node is undefined (not present), null or empty, then a new bean property named "additionalProperties", of type Map<String,Object> is added to the generated type (with appropriate accessors). The accessors are annotated to allow Jackson to marshal/unmarshal unrecognised (additional) properties found in JSON data from/to this map.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 22
    Redisson

    Redisson

    Valkey & Redis Java client. Real-Time Data Platform

    Redisson is a Java client library for Redis that offers distributed data structures, services, and frameworks to build scalable and reliable applications. It simplifies Redis usage by providing in-memory Java objects like maps, sets, locks, queues, and semaphores that are backed by Redis. Redisson supports advanced features like distributed locking, asynchronous APIs, and integrates with frameworks like Spring and Quarkus for reactive and cloud-native development.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 23
    Hello Algorithm

    Hello Algorithm

    Animated illustrations, one-click data structure

    Animated illustrations, one-click data structure and algorithm tutorials. This project aims to create an open source, free, novice-friendly introductory tutorial on data structures and algorithms. The whole book uses animated illustrations, the content is clear and easy to understand, and the learning curve is smooth, guiding beginners to explore the knowledge map of data structures and algorithms.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 24
    OpenSearch

    OpenSearch

    Open source distributed and RESTful search engine

    ...It offers excellent performance and can scale up and down as the needs of the application grow or shrink. Its distributed design means that you interact with OpenSearch clusters. Each cluster is a collection of one or more nodes, servers that store your data and process search requests. You can run OpenSearch locally on a laptop, its system requirements are minimal, but you can also scale a single cluster to hundreds of powerful machines in a data center.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 25
    Apache Iceberg

    Apache Iceberg

    Apache Iceberg

    Iceberg is a high-performance format for huge analytic tables. Iceberg brings the reliability and simplicity of SQL tables to big data while making it possible for engines like Spark, Trino, Flink, Presto, Hive, and Impala to safely work with the same tables, at the same time. The core Java library that tracks table snapshots and metadata is complete, but still evolving. Current work is focused on adding row-level deletes and upserts, and integration work with new engines like Flink and Hive. ...
    Downloads: 4 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB