Manage Big Data Pipelines in the Cloud with Databricks and StreamSets

November 13, 2019 - 10:00 AM Pacific Standard Time
Whether you are cloud-native or migrating to the cloud, enterprises are looking for speed and agility. Databricks and StreamSets have partnered to bring rapid data pipeline design and testing to critical cloud workloads. Together, they bring the power of Apache Spark™ to a broad audience with a logical and visual, UI-based pipeline development tool. This allows more users to leverage Apache Spark™ and Delta Lake with confidence, reliability and unmatched performance in the cloud.


In this webinar, we will discuss:
  1. Using a drag-and-drop interface for pipeline development to continuously ingest and stream data into Delta Lake on Databricks
  2. How Delta Lake helps make cloud data more reliable with features like ACID-compliant transactions, schema enforcement and scalable metadata handling
  3. How to migrate on prem Data Lake workloads (e.g. Hadoop) to cloud services and easily manage compute resources using Databricks’ optimized auto-scaling for compute resources


Speakers:
  • Hiral Jasani, Senior Partner Marketing Manager at Databricks
  • Nauman Fakhar, Director of ISV Solutions at Databricks
  • Rupal Shah, Director of Cloud Services at StreamSets

Register Now