On their journey to becoming data innovators, most enterprises today are struggling with preparing large datasets for analytics, managing the proliferation of Data and ML frameworks, and moving models from development to production.
In this virtual workshop, we will introduce you to the Databricks Unified Analytics Platform - a fully managed service on AWS that offers a collaborative workspace for Data Engineers, Data Scientists and Business Analysts for faster innovation and less operational overhead. We’ll cover best practices for enterprises to use powerful open source technologies to simplify and scale your Data and ML efforts. We’ll discuss how to leverage Apache SparkTM, the de-facto data processing and analytics engine in enterprises today, for data preparation as it unifies data at massive scale across various sources. You’ll also learn how to use Data and ML frameworks (i.e. TensorFlow, XGBoost, Scikit-Learn, etc.) to train models based on different requirements. And finally, you can learn how to use MLflow to track experiment runs between multiple users within a reproducible environment, and manage the deployment of models to production on Amazon SageMaker.
This workshop will give you the opportunity to:
Agenda at a glance:
Find more information on the AWS Master Path here.