In this hands-on lab, you are working as a data scientist for The Greenery, a gardening store. After an unfortunate accident where the summer intern (we’ll call him Steve) spilled all the seeds on the floor, you’ve been asked to come up with a model that can be used to group together the seeds based on their features. Steve wants to make sure the seeds are sorted before the boss returns and finds the mess! You’ll use Azure Machine Learning studio and Azure Machine Learning designer to create a clustering model that will group together similar seeds.
Learning Objectives
Successfully complete this lab by achieving the following learning objectives:
- Open the Azure ML Studio and Create a New Dataset
In the Azure ML studio, create a new dataset using the seed data file from Microsoft.
- Review the Dataset
Take a look at the data you’re working with and see if there are any missing values or values that will cause issues.
- Design a Pipeline to Group Together Similar Seeds
Using the Azure Machine Learning designer, drag and drop modules to create a pipeline that will train a model to group together seeds based on their features.