LakeSoul is a high-performance, unified table storage framework for big data lakes, supporting both streaming and batch data in a single format. Built on top of Apache Spark and leveraging Apache Arrow and Parquet, LakeSoul provides ACID transactions, schema evolution, and time travel. It is designed for large-scale data lake architectures that require consistency, efficiency, and easy integration with modern data stacks.

Features

  • Supports ACID transactions on data lakes
  • Handles both batch and streaming data seamlessly
  • Schema evolution and data versioning support
  • Time travel queries for historical data access
  • Optimized for Apache Spark and Parquet
  • Native integration with Apache Arrow and cloud storage

Project Samples

Project Activity

See All Activity >

Categories

Storage

License

Apache License V2.0

Follow LakeSoul

LakeSoul Web Site

Other Useful Business Software
Full-stack observability with actually useful AI | Grafana Cloud Icon
Full-stack observability with actually useful AI | Grafana Cloud

Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
Create free account
Rate This Project
Login To Rate This Project

User Reviews

Be the first to post a review of LakeSoul!

Additional Project Details

Operating Systems

Linux, Mac, Windows

Programming Language

Java

Related Categories

Java Storage Software

Registered

2025-06-04