Skip to main content

DAT209 - Programming R

·373 words·2 mins
Table of Contents
MPPDS - This article is part of a series.
Part 9: This Article

Programming R for Data Science is taught by Anders Stockmarr (on the faculty of Technical University of Denmark.) For US audiences, his accent requires some getting used to. He places emphasis on unexpected syllables and has a unique way of pronouncing many things. I found it helpful to use headphones and to adjust the playback speed of the recordings. It is worth making the effort to understand Dr. Stockmarr because he has put together a course with a lot of substance, using a tight script and backed up by supporting exercises.

Programming R Course Highlights
#

I genuinely enjoyed this course, it goes a lot deeper than the introductory course in R taken earlier in the MPP sequence. For the first course, I used RStudio to experiment. With this course, I wanted to use the Visual Studio version of R to work the exercises and labs. The R Tools for Visual Studio (https://www.visualstudio.com/vs/rtvs/) required some fiddling to get installed, but were stable and had nice IDE features I’ve become used to with VS. Becoming familiar with R Tools for Visual Studio at this point will prepare you for taking DAT213 “Analyzing Big Data in MS R Server” which is the logical follow-on course in the MPP Data Science sequence.

Modules for the course:

featured.gif
  •     Introduction
  •     Functions and Data Structures
  •     Loops and Flow Control
  •     Working with Vectors and Matrices
  •     Reading in Data
  •     Writing Data to Text Files
  •     Reading Data from SQL Databases
  •     Working with Data
  •     Manipulating Data
  •     Simulation
  •     Linear Models
  •     Graphics in R

While taking the R courses in this sequence, I found two supporting references particularly helpful:

DAT209 Course Content
#

There are a total of 43 videos with playing time of about 4.5 hours. The course requirements include:

  1. Quizzes worth 40% of grade,
  2. Labs worth 40%,
  3. The final challenge was worth 20%. With the final, you have four hours to answer 50 questions, and a maximum of four minutes to answer any single question. It is very helpful if you’ve kept notes through the lectures. It is not feasible to Google responses to these questions.

All told, this course required about 25 hours for me to complete.

Jonathan Bartleson
Author
Jonathan Bartleson
MPPDS - This article is part of a series.
Part 9: This Article

Related

DAT203-1 - Data Science Essentials

·392 words·2 mins
Data Science Essentials (DAT203) marks the point where we have enough foundation that we can start forming a bigger picture of data science. To that goal, the course provides this definition: Data Science is the exploration and quantitative analysis of all available structured and unstructured data to develop understanding, extract knowledge, and formulate actionable results. Cynthia Rudin and Steve Elston are co-presenters in this entertaining, informative and well-organized course. Both are really effective instructors, with quite different teaching styles. Cynthia covers the more theoretical topics including a fair bit of concepts relating to statistics. She is an entertaining presenter. A lot of personality and practical examples come through her presentations. She provides specific data science example project from her university lab, one of which concerns predicting manhole fires in Manhattan.

DAT203.2 - Principles of Machine Learning

·427 words·3 mins
Principles of Machine Learning (DAT203.2) is the 7th in a series of 10 courses that form the Microsoft Professional Program in Data Science. It proves that the further you get into this 10-course sequence, the more enjoyable the classes become. Similar to Data Science Orientation, this class is co-led by Cynthia Rudin and Steve Elston. Principles of Machine Learning # The lecture is composed of 60 videos spanning 8 hours lecture time. Watching them and working the exercises reveals the true practical value of the data science tools. This course forces you to genuinely harness the Azure Machine Learning environment with Python or R scripts. All told, this course required about 30 hours to complete.

DAT204 - Intro to R for Data Science

·433 words·3 mins
As a developer, I’m drawn to terse/concise languages that are purpose-built for an objective. Regular expressions are a prime example. There is something beautiful about expressing things in few words (something I try to do in blogging with only partial success!) In this context, I was eagerly anticipating “Intro to R for Data Science.” This course (and this language) did not disappoint. Before going further, I should note something: Within the Microsoft Professional Program for Data Science, it is the student’s discretion to take a Python or an R track. Your decision will be shaped by whether you have prior familiarity with one of those environments, and whether you want to reinforce what you already know or venture into a new tool. Your choice of this class will logically dictate the 2nd “advanced” course required later in the MPP track.