Microsoft Learning Experiences DAT203.2x Setup Guide

User Manual:

Open the PDF directly: View PDF PDF.
Page Count: 7

DownloadMicrosoft Learning Experiences DAT203.2x-Setup-Guide
Open PDF In BrowserView PDF
Principles of Machine Learning
Lab Setup Guide

Overview
This guide takes you through the steps to create an environment for performing the data science
experiments described in this repository.
To prepare the lab environment, you must perform the following tasks:
1. Create an Azure ML account
2. Download and extract the lab files
3. Optionally, install Microsoft R Open and RStudio or Python Anaconda

What You’ll Need
To perform the setup tasks, you will need the following:


A Windows, Linux, or Mac OSX computer.



A web browser and Internet connection.

Create an Azure ML Account
Azure ML offers a free-tier account, which you can use to complete the labs in this repository.
Note: A free Azure ML workspace is not the same as a Microsoft Azure trial subscription!

Sign Up for a Microsoft Account and a Free Azure ML Workspace
1.
2.

If you do not already have a Microsoft account, sign up for one at https://signup.live.com/.
If you do not already have an Azure ML workspace, browse to https://aka.ms/edx-dat203.2x-aml
and click Get Started Now. Then follow the instructions to sign up for a free Azure ML
workspace. If prompted, sign in with your Microsoft account credentials.
Note: Your free-tier Azure ML workspace allows you unlimited access, with some reduced
capabilities compared to a full Microsoft Azure subscription. Your experiments will only run at low
priority on a single processor core. As a result, you will experience some longer wait times.
However, you have full access to all features of Azure ML.

Download the Lab Files
All code and data files you will need are contained in a zip file. Follow these steps to download and install
the lab files.
1.
2.

Download the lab files from https://aka.ms/edx-dat203.2x-labfiles.
Extract the downloaded zip file to a convenient folder on your local computer.

Optional: Install Microsoft R Open and RStudio
R is a programming language for conducting statistical analysis processes and visualizing data. You will
use R code in Azure ML for some of the labs in this course, and having a fully-featured R editor can be
useful if you want to explore this code more thoroughly.
Note: For this course, you can choose to complete programming exercises in Python or R (or both if you
are truly ambitious!). If you plan to use R, complete this procedure to install the R runtime and
development tools. If you do not plan to use R, you can skip this section and go to the procedure for
installing Anaconda Python.

Install Microsoft R Open
Note: Microsoft R Open is an enhanced version of R. Microsoft R Open uses the multi-core MKL math
libraries to enhance performance. However, if are using GNU R downloaded from CRAN, you can continue
to do so for this course.
1. In a web browser, navigate to https://aka.ms/edx-dat203.2x-r.
2.

Select the Microsoft R Open download for your operating system and following the installation
directions.

3.

Note: If you are using Mac OS X, you can skip the following step as the MKL math libraries are
already included.
When the Microsoft R Open installation has finished, select the MKL math library download for
your operating system and follow the installation directions.

4.

Verify your installation by starting Microsoft R Open and entering a simple R expression such as 1
+ 2 (which should produce the result 3) in the console as shown in the following image.

5. Close R.

Install RStudio
1.
2.
3.

In a web browser, navigate to https://www.rstudio.com/products/rstudio/download/.
Run the installer for your operating system (Windows, MacOSX, Ubuntu, or Fedora) to install
RStudio.
To verify installation, start RStudio, and look at the console which should resemble the following
image:

4.
5.

Verify that RStudio is configured to use the current version Microsoft R Open, by noting the
version of Microsoft R Open displayed on the console. (The current version is 3.2.3).
If your configuration is not correct select Global Options on the Tools menu and set the path to
the directory where you installed Microsoft R Open, as shown in the following image.

6. When you have verified installation and configuration, close RStudio.

Install R packages
Several of the labs for this course require R packages that are not installed by default. These packages
include:
 ggplot2
 gridExtra
 dplyr
Follow these instructions to install these packages:
1. In RStudio, locate the pane with Packages tab and click on it.
2. Click the Install icon.

3. In the Packages text box of the dialog, type ‘ggplot2, dplyr, gridExtra’, ensuring the names are
comma separated and the Install dependencies box is checked, as shown in the figure.

4.

Click Install. Expect a great deal of text to appear on the console. Watch for error messages. Note
that you may be prompted to create a user library for your downloaded packages.

5. After the packages have been installed, at the console prompt in RStudio, type the following
commands to test loading the packages:

library(ggplot2)
library(dplyr)
library(gridExtra)

Optional: Install Python Anaconda
You will use Python code in Azure ML for some of the labs in this course, and having a fully-featured
Python editor can be useful if you want to explore this code more thoroughly.
Note: In this repository, you can choose to complete programming exercises in Python or R (or both). If
you plan to use Python, complete this procedure to install the Python runtime and development tools. If
you do not plan to use Python, you can skip this procedure.

Install the Python Anaconda Distribution
Note: Microsoft Azure Machine Learning uses a version of the Anaconda Python distribution. However, if
you are used to using another scientific Python distribution, such as Canopy, it is likely that you will be
able to perform all of the exercises for this course using that distribution.
1. In a web browser, navigate to http://continuum.io/downloads.
2. Choose the installer for your operating system (Windows, Apple Macintosh, or Linux).
3. Complete the installation process for Python 2.7 (Not Python 3.X).
4. After installation is complete, verify the installation by starting Spyder, which should look similar
to the following image:

Verify Connection to the IPython kernel
In some cases, when you first start Spyder, you may find that your IPython console does not respond to
the commands you type at the prompt. If this occurs, wait a while to allow IPython kernel to start.
If the IPython console continues to be unresponsive, you may need to manually connect to an IPython
kernel manually. Follow the steps below to connect a console to an IPython kernel.

1. From the kernel icon in the upper right above the IPython console window, select Connect to an
existing kernel as shown in the figure below:

2. Select Browse on the dialog and then Open the first kernel on the list shown in the figure below:

3.
4.
5.

Click OK, to complete selection of the Python kernel for your IPython session.
In the IPython Console pane, enter a command to verify that everything is working – for example,
enter 1+ 2 and verify that the answer 3 is returned.
Close Spyder.

Summary
By completing the tasks in this setup guide, you have prepared you environment for the labs in this
course. Now you’re ready to start learning how to build data science and machine learning solutions.



Source Exif Data:
File Type                       : PDF
File Type Extension             : pdf
MIME Type                       : application/pdf
PDF Version                     : 1.5
Linearized                      : No
Page Count                      : 7
Language                        : en-US
Tagged PDF                      : Yes
XMP Toolkit                     : 3.1-701
Producer                        : Microsoft® Word 2016
Title                           : Microsoft Learning Experiences
Creator                         : WINDOWS ECOSYSTEM
Creator Tool                    : Microsoft® Word 2016
Create Date                     : 2016:06:24 10:17:38+01:00
Modify Date                     : 2016:06:24 10:17:38+01:00
Document ID                     : uuid:CCD4CC70-D4E8-4979-918F-D44501CD984F
Instance ID                     : uuid:CCD4CC70-D4E8-4979-918F-D44501CD984F
Author                          : WINDOWS ECOSYSTEM
EXIF Metadata provided by EXIF.tools

Navigation menu