Denny Lee, Tristen Wentling, Scott Haines, Prashanth Babu

Delta Lake: The Definitive Guide

Modern Data Lakehouse Architectures with Data Lakes. Sprache: Englisch.
kartoniert , 400 Seiten
ISBN 1098151941
EAN 9781098151942
Veröffentlicht 10. Dezember 2024
Verlag/Hersteller O'Reilly Media
80,00 inkl. MwSt.
vorbestellbar (Versand mit Deutscher Post/DHL)
Teilen
Beschreibung

Discover how Delta Lake simplifies the process of building data lakehouses and data pipelines at scale. With this practical guide, data engineers, data scientists, and data analysts will explore key data reliability challenges and learn to apply modern data engineering and management techniques. You'll also understand how ACID transactions bring reliability to data lakehouses at scale.
This book helps you: - Understand key data reliability challenges - Examine data management and engineering techniques using the modern data stack - Realize data reliability improvements using Delta Lake - Concurrently run streaming and batch jobs against your data lake - Execute update, delete, and merge commands - Use time travel to rollback and examine previous versions of your data - Build a streaming data quality pipeline following the medallion construct

Portrait

Denny Lee is a Staff Developer Advocate at Databricks. He is a hands-on distributed systems and data sciences engineer with extensive experience developing internet-scale infrastructure, data platforms, and predictive analytics systems for both on-premise and cloud environments. He also has a Masters of Biomedical Informatics from Oregon Health and Sciences University and has architected and implemented powerful data solutions for enterprise Healthcare customers. His current technical focuses include Distributed Systems, Apache Spark, Deep Learning, Machine Learning, and Genomics.

Das könnte Sie auch interessieren

vorbestellbar
15,00
vorbestellbar
23,00
vorbestellbar
32,00
vorbestellbar
22,00
vorbestellbar
25,00
vorbestellbar
16,00
vorbestellbar
18,00