Developing a Batch Processing Solution with Azure Services

1 hour
  • 3 Learning Objectives

About this Hands-on Lab

In this hands-on lab scenario you are a data engineer for Awesome Company. You have been tasked with creating a batch processing solution in Azure that will analyze crowdsourced weather information. You have previously provisioned an Azure Data Lake Storage Gen2 account and containers for holding the uploaded files. Now you need to continue building upon that in order to analyze the data on a nightly basis. Performing the actions of this hands-on lab will help you become familiar with building a complete batch processing solution using Azure services.

Learning Objectives

Successfully complete this lab by achieving the following learning objectives:

Prepare the Environment for Loading
  1. Upload files from GitHub to be processed.
  2. Configure the SQL Pool firewall to allow both your client IP address and other Azure services to connect.
  3. Create a table on the SQL Pool to hold the copied records for analysis.
Copy the Weather Data Using Azure Data Factory
  1. Build a pipeline in Azure Data Factory to copy the weather data file records from the Azure Data Lake Storage Gen2 containers into Synapse Analytics.
  2. Along with data from the files, add extra columns for both the file path and the date processed.
Delete the Weather Files
  1. Once the data has been copied, completely remove all files from the containers.

Additional Resources

In this hands-on lab scenario you are a data engineer for Awesome Company. You have been tasked with creating a batch processing solution in Azure that will analyze crowdsourced weather information. You have previously provisioned an Azure Data Lake Storage Gen2 account and containers for holding the uploaded files. Now you need to continue building upon that in order to analyze the data on a nightly basis.

To accomplish your goal, the following should be completed:

  • Prepare the environment for loading.
  • Use Azure Data Factory to copy the data into Synapse Analytics.
  • Delete the crowdsourced files after they have been processed.

What are Hands-on Labs

Hands-on Labs are real environments created by industry experts to help you learn. These environments help you gain knowledge and experience, practice without compromising your system, test without risk, destroy without fear, and let you learn from your mistakes. Hands-on Labs: practice your skills before delivering in the real world.

Sign In
Welcome Back!

Psst…this one if you’ve been moved to ACG!

Get Started
Who’s going to be learning?