What Is Delta Lake
ibm.com·5h·
Discuss: Hacker News
Flag this post

What is Delta Lake?

Delta Lake is an open source data storage format that combines Apache Parquet data files with a robust metadata log. The Delta Lake format brings key data management functions, such as ACID transactions and data versioning, to data lakes, making it the basis for many data lakehouses.

First developed by Databricks in 2016, Delta Lake is an open table format, an open source framework for tabular data that builds a metadata layer on top of existing file formats. Delta Lake specifically uses Parquet tables for data storage. Other open table formats include Apache Iceberg …

Similar Posts

Loading similar posts...