<<Back to events page

Wednesday, March 18th, 2020
9:00 AM ET
Virtual Half Day Event

Thank you for your interest in the Virtual: AWS | Databricks Dev Day Workshop: Unifying Data Pipelines and Machine Learning with Apache Spark™ and Amazon SageMaker in Atlanta, your health and safety are of utmost importance to us. Due to the current COVID-19 (coronavirus) situation, Databricks has decided to move this from an in-person event to a web-based event. We will use Zoom for a virtual meeting environment, Zoom link will be sent to you upon registration. We look forward to seeing you on Wednesday, March 18th, 2020 9:00 am to 12:30 pm ET.

Every enterprise today wants to accelerate innovation by building Data and ML into their business. However, most companies struggle with preparing large datasets for analytics, managing the proliferation of Data and ML frameworks, and moving models in development to production.

In this workshop, we’ll cover best practices for enterprises to use powerful open source technologies to simplify and scale your Data and ML efforts. We’ll discuss how to leverage Apache Spark™, the de-facto data processing and analytics engine in enterprises today, for data preparation as it unifies data at massive scale across various sources. You’ll also learn how to use Data and ML frameworks (i.e. TensorFlow, XGBoost, Scikit-Learn, etc.) to train models based on different requirements. And finally, you can learn how to use MLflow to track experiment runs between multiple users within a reproducible environment, and manage the deployment of models to production on Amazon SageMaker.

Join this half-day workshop to learn how Unified Data Analytics can bring Data Science, Business Analytics and engineering together to accelerate your Data and ML efforts. This free workshop will give you the opportunity to:

Learn how to build highly scalable and reliable pipelines for analytics
Deeper insight into Apache Spark and Databricks, including the latest updates with Delta Lake
Train a model against data and learn best practices for working with ML frameworks (i.e. - TensorFlow, XGBoost, Scikit-Learn, etc.)
Learn about MLflow to track experiments, share projects and deploy models in the cloud with Amazon SageMaker
Network and learn from your ML and Apache Spark peers

AGENDA AT A GLANCE

9:00-9:45 Opening Remarks - Unifying Data Science, Business Analytics and Data Engineering
9:45-10:15 Customer Stories and Use Cases
10:15-10:45 Networking with Peers
10:45-11:30 Data Engineering Interactive Demo & Best Practices: Preparing Data for Analytics
11:30-12:15 Data Science, Business Analytics Interactive Demo & Best Practices: Model Training and Machine Learning
12:15-12:30 Q&A

Please fill out the form to confirm your spot

First Name:

Last Name:

Company:

Job Title:

Company Email

Phone Number:

Country:

Currently Using Apache Spark:

How big is your company? (# of employees)

Keep me informed with occasional updates about Databricks and related open source products

UTM Source:

UTM Campaign:

UTM Medium:

UTM Offer:

UTM Ad Group:

UTM Keyword:

UTM Content:

UTM Ad:

UTM Term:

ITM:

GCLID:

<<Back to events page

Wednesday, March 18th, 2020 9:00 AM ETVirtual Half Day Event

Please fill out the form to confirm your spot

Wednesday, March 18th, 2020
9:00 AM ET
Virtual Half Day Event