Logo DataBaselines, LLC
  • Home
  • About
  • Experience
  • Education
  • Posts
  • Notes
  • Docs
  • Dark Theme
    Light Theme Dark Theme System Theme
Logo Inverted Logo
  • About
    • Company History
    • Documentation
    • Data Analysis
    • DBA
    • Reporting
    • SSIS
    • SSRS
    • Wrangler
  • Blog
    • College Scorecard
    • Confluence
    • Data Quality
    • DB Make Sense
    • Enterprise Architect
    • GA Exports, API and SSIS
    • GA Exports, PowerBI
    • GA Exports, Top 40B
    • GA Exports, USA Trade Online
    • Georgia Exports, Primer
    • Goodname
    • PBI, Change over time
    • Rotary Directory
  • DataScience
    • MPP-Overview
    • MPP
      • MPP-DAT101
      • MPP-DAT201
      • MPP-DAT203-1
      • MPP-DAT203-2
      • MPP-DAT204
      • MPP-DAT207x
      • MPP-DAT209
      • MPP-DAT213
      • MPP-DAT222x
      • MPP-DAT102
  • Posts
Hero Image
DAT102 Data Science Capstone

With both pride and relief – I get to share news of completing the Capstone course for the Microsoft Professional Program in Data Science. Over the past six months, I’ve been systematically working through the nine courses leading up to this final 10th course in the series. You can look on this index page to see observations for the other courses. This includes some general advice for those considering taking these courses.

Thursday, November 9, 2017 | 6 minutes Read
Hero Image
DAT213 - Analyzing Big Data with Microsoft R Server

This course teaches exploratory data analysis skills using the Microsoft R Server implementation known as RevoScaleR. This product is in most ways functionally equivalent to the open source CRAN-R. RevoScaleR offers three significant benefits over its open source brother: the ability to run analyses in parallel across different servers, the ability to “chunk” data for evaluation and bypass the in-memory limitation of R, and the ability to read more natively from data sources like SQL Server, Hadoop, and Spark. This course explains these benefits and allows a new user to become familiar with the RevoScaleR tool.

Tuesday, July 25, 2017 | 4 minutes Read
Hero Image
DAT209 - Programming R

Programming R for Data Science is taught by Anders Stockmarr (on the faculty of Technical University of Denmark.) For US audiences, his accent requires some getting used to. He places emphasis on unexpected syllables and has a unique way of pronouncing many things. I found it helpful to use headphones and to adjust the playback speed of the recordings. It is worth making the effort to understand Dr. Stockmarr because he has put together a course with a lot of substance, using a tight script and backed up by supporting exercises.

Monday, July 10, 2017 | 2 minutes Read
Hero Image
DAT203-1 - Data Science Essentials

Data Science Essentials (DAT203) marks the point where we have enough foundation that we can start forming a bigger picture of data science. To that goal, the course provides this definition: Data Science is the exploration and quantitative analysis of all available structured and unstructured data to develop understanding, extract knowledge, and formulate actionable results. Cynthia Rudin and Steve Elston are co-presenters in this entertaining, informative and well-organized course. Both are really effective instructors, with quite different teaching styles.

Saturday, June 24, 2017 | 2 minutes Read
Hero Image
DAT203.2 - Principles of Machine Learning

Principles of Machine Learning (DAT203.2) is the 7th in a series of 10 courses that form the Microsoft Professional Program in Data Science. It proves that the further you get into this 10-course sequence, the more enjoyable the classes become. Similar to Data Science Orientation, this class is co-led by Cynthia Rudin and Steve Elston. Principles of Machine Learning The lecture is composed of 60 videos spanning 8 hours lecture time. Watching them and working the exercises reveals the true practical value of the data science tools. This course forces you to genuinely harness the Azure Machine Learning environment with Python or R scripts. All told, this course required about 30 hours to complete.

Saturday, June 24, 2017 | 2 minutes Read
Hero Image
DAT204 - Intro to R for Data Science

As a developer, I’m drawn to terse/concise languages that are purpose-built for an objective. Regular expressions are a prime example. There is something beautiful about expressing things in few words (something I try to do in blogging with only partial success!) In this context, I was eagerly anticipating “Intro to R for Data Science.” This course (and this language) did not disappoint. Before going further, I should note something: Within the Microsoft Professional Program for Data Science, it is the student’s discretion to take a Python or an R track. Your decision will be shaped by whether you have prior familiarity with one of those environments, and whether you want to reinforce what you already know or venture into a new tool. Your choice of this class will logically dictate the 2nd “advanced” course required later in the MPP track.

Thursday, June 15, 2017 | 3 minutes Read
Hero Image
DAT222x - Essential Statistics for Data Analysis using Excel

Call me a nerd, but statistics are fascinating and useful. I’d had quite a bit of course-work years ago in school, and was looking forward to “Essential Statistics for Data Analysis using Excel” as a refresher course. Unfortunately, the experience of this edX course might be tag-lined “Sadistics.” Completing this was a painful experience. I hope the notes here will make the experience a bit more tolerable for others.

Sunday, May 21, 2017 | 3 minutes Read
Hero Image
DAT207x - Analyzing and Visualizing Data with PowerBI

The course “Analyzing and Visualizing Data with PowerBI” is devoted to showing the capabilities of this Microsoft tool. For those who have worked previously with Excel, Microsoft Access or SQL Server Reporting Services – the video demonstration of PowerBI capabilities will cause you to repeatedly think “wow – that is slick.” As an example, pictured below is one of the dashboards created as part of the course. Analyzing and Visualizing Data with PowerBI The course is composed of approximately 120 videos whose duration varies from 1-5 minutes. There are 4 different people presenting and the video content is quite good. The pace of content is well measured and the videos nicely support the lab materials. This is a really enjoyable course outlining capabilities of an innovative tool.

Tuesday, May 9, 2017 | 2 minutes Read
Hero Image
DAT201 - Querying with Transact-SQL

Following the breezy orientation course, the Microsoft Professional Program for Data Science curriculum digs into the Microsoft dialect of SQL known as Transact-SQL. This course briefly addresses updating data, stored procedures, transactions and error handling – but the bulk of the course concerns extracting data from SQL Server. This is the 2nd in a 10-part online course sequence for which I’m documenting my experience for others. The course title is “DAT201x: Querying with Transact-SQL”

Friday, May 5, 2017 | 4 minutes Read
Hero Image
DAT101 - Data Science Orientation

“Data Science Orientation” is the first class in the Microsoft Professional Program (MPP) for Data Sciences. This class is a warm-up exercise for the larger program. It outlines the 10-part certificate process and introduces you to five current data scientists who answer questions about their career. Are you wondering what skills and personality traits will help you succeed as a data scientist? The professional interviews offer practical insight on those topics. The enthusiasm these persons have for their career is contagious. Hearing them speak is just the sort of encouragement one needs while embarking on the MPP courses.

Monday, May 1, 2017 | 2 minutes Read
Navigation
  • About
  • Experience
  • Education
Contact me:
  • sales@DataBaselines.com
  • Jonathan Bartleson
  • 770.324.2398

Toha Theme Logo Toha
© DataBaselines, LLC 2025
Powered by Hugo Logo