Tag: pyspark tutorial

What is Delta Lake?

Delta Lake is an open-source project that enables building a Lakehouse architecture on top of data lakes. Delta Lake provides ACID transactions, scalable metadata handling, and unifies streaming and batch data processing on top of existing data lakes, such as S3, ADLS Gen1, ADLS Gen2 GCS, and HDFS. Features of Delta Lake ACID Transaction: Readers will never encounter inconsistent data due to the serializable isolation levels feature. […]

Back To Top