BigData

Delta lake 란

Kyle79 2020. 8. 10. 10:02

 

In computer science, ACID (atomicity, consistency, isolation, durability) is a set of properties of database transactions intended to guarantee data validity despite errors, power failures, and other mishaps. In the context of databases, a sequence of database operations that satisfies the ACID properties (which can be perceived as a single logical operation on the data) is called a transaction. For example, a transfer of funds from one bank account to another, even involving multiple changes such as debiting one account and crediting another, is a single transaction.

 

Delta Lake는 데이터 레이크에 안정성을 제공하는 오픈 소스 스토리지 계층입니다. Delta Lake는 ACID 트랜잭션을 제공하고, 확장 가능한 메타데이터를 처리하고, 스트리밍 및 일괄 처리 데이터 처리를 통합합니다. Delta Lake는 기존 데이터 레이크를 기반으로 하여 실행되며 Apache Spark API와 완벽하게 호환됩니다.

 

 

https://docs.microsoft.com/ko-kr/azure/databricks/delta/

 

Delta Lake - Azure Databricks

Azure Databricks의 Delta Lake에서 사용할 수 있는 Delta Lake 스토리지 계층과 최적화에 대해 알아봅니다.

docs.microsoft.com

 

 

https://docs.microsoft.com/ko-kr/azure/synapse-analytics/spark/apache-spark-what-is-delta-lake

 

Delta Lake란? - Azure Synapse Analytics

델타 Lake의 개요 및 Azure Synapse Analytics의 일부로 작동 하는 방법

docs.microsoft.com

 

 

https://github.com/delta-io

 

Delta Lake

An open-source storage layer that brings ACID transactions to Apache Spark™ - Delta Lake

github.com

 

 

https://github.com/aravinthsci/Spark_Delta_Lake

 

aravinthsci/Spark_Delta_Lake

Delta Lake Examples. Contribute to aravinthsci/Spark_Delta_Lake development by creating an account on GitHub.

github.com

 

 

https://github.com/databricks/delta-live-tables-notebooks

 

databricks/delta-live-tables-notebooks

Contribute to databricks/delta-live-tables-notebooks development by creating an account on GitHub.

github.com

 

 

 

https://databricks.com/discover/demos/delta-lake

 

Delta Lake on Databricks Demo

Click to watch this demo, where we cover the main features of Delta Lake on Databricks, as well as touch on some of the performance enhancements available.

databricks.com

 

 

https://github.com/delta-io/delta

 

delta-io/delta

An open-source storage layer that brings scalable, ACID transactions to Apache Spark™ and big data workloads. - delta-io/delta

github.com

 

 

 

'BigData' 카테고리의 다른 글

plotly  (0) 2020.08.14
ACID  (0) 2020.08.12
K8s Lens  (0) 2020.08.06
Cloud Data Lake  (0) 2020.07.22
Airflow Docker-compose  (0) 2020.07.01