London Spark Meetup
Building reliable lakehouses with Delta Lake Primer and AMA
Tuesday 28 February | 6pm
FORA Soho, 33 Broadwick Street, W1F 0DQ
Join this two-part session featuring Seattle-based Denny Lee, Apache Spark™ and MLflow contributor, Delta Lake maintainer, and co-author of Learning Spark: 2nd edition and the upcoming Delta Lake: The Definitive Guide and Simon Whiteley, Director of Engineering & Co-Owner of London-area Advancing Analytics!
Agenda
18:00
Welcome to the meetup
18:30
Session 1: Building reliable lakehouses with Delta Lake
Data lakehouses combine the best of both worlds for databases and data lakes. Databases provide relative simplicity and ACID transactional protection for your data while data lakes provide flexibility, scalability, and support for non-structured data on cheap object stores.
In this session, we describe Delta Lake, which brings reliability by providing a transactional layer on top of data lakes. We will talk about key features of Delta Lake that enable the Lakehouse Architecture. Finally, we will talk about the work we are doing to build the ecosystem around Delta Lake including supporting multiple languages (Python, Rust, Java, etc.) as well as data processing systems (Apache Flink, Apache Pulsar, Apache Hive, PrestoDB, TrinoDB, Apache Spark™, etc.).
19:10
Session 2: Simon & Denny - Ask Us Anything!
Join us in a live session of the monthly series "Simon and Denny - Ask Us Anything!" where we will answer your data engineering questions, from building a data platform, to ingestion, to ETL, to analytics. With our background in SQL Server and BI to Apache Spark and Delta Lake - we want to show you how to build your own lakehouse.
As this session is interactive, come prepared to ask questions all throughout the session! Be prepared for another geeky, trans-Atlantic event from two data nerds … in-person!
19:45
Wrap up
Speakers
Simon Whiteley
Director of Engineering & Co-Owner of London-area
Advancing Analytics
Simon is a Databricks Beacon, Microsoft MVP and owner of Advancing Analytics. A deep techie with a focus on emerging cloud technologies and applying “big data” thinking to traditional analytics problems, Simon also has a passion for bringing it back to the high level and making sense of the bigger picture. When not tinkering with tech, Simon is a death-dodging London cyclist, a sampler of craft beers, an avid chef, and a generally nerdy person.
Denny Lee
Developer Advocate
Databricks
Denny Lee is our Apache Spark™ and MLflow contributor, Delta Lake maintainer, and co-author of Learning
Spark: 2nd edition and the upcoming Delta Lake: The Definitive Guide; and Sr. Staff Developer Advocate at Databricks.
Denny is a Databricks Developer Advocate. A hands-on distributed systems and data sciences engineer with extensive experience developing internet-scale data platforms, and predictive analytics systems. He also has a Masters of Biomedical Informatics from Oregon Health and Sciences University and has implemented powerful data solutions for enterprise Healthcare customers. His current technical focuses include Distributed Systems, Apache Spark, Deep Learning, Machine Learning, and Genomics.
Register Now
© Databricks 2024. All rights reserved. Apache, Apache Spark, Spark and the Spark logo are trademarks of the Apache Software Foundation.