So exactly what does Georgia export to the world? Here is a brief video tutorial using US Census data and Microsoft Power BI to explore what's been going through terminals like the port in Savannah and Hartsfield airport. This is part of a series of articles for retrieving and summarizing public data from USA Trade Online.
With both pride and relief -- I get to share news of completing the Capstone course for the Microsoft Professional Program in Data Science. Over the past six months, I've been systematically working through the nine courses leading up to this final 10th course in the series. You can look on this index page to see observations for the other courses.
Our company acquired a data file containing over 15,000 rows and 300 columns. We are trying to identify patterns in the data. Where do we begin evaluating such a large dataset? Would Using R be helpful?
This course teaches exploratory data analysis skills using the Microsoft R Server implementation known as RevoScaleR. This product is in most ways functionally equivalent to the open source CRAN-R. RevoScaleR offers three significant benefits over its open source brother: the ability to run analyses in parallel across different servers, the ability to "chunk" data for evaluation and bypass the in-memory limitation of R, and the ability to read more natively from data sources like SQL Server, Hadoop, and Spark.
"Data Science Orientation" is the first class in the Microsoft Professional Program (MPP) for Data Sciences. This class is a warm-up exercise for the larger program. It outlines the 10-part certificate process and introduces you to five current data scientists who answer questions about their career. Are you wondering what skills and personality traits will help you succeed as a data scientist? The professional interviews offer practical insight on those topics.
Programming R for Data Science is taught by Anders Stockmarr (on the faculty of Technical University of Denmark.) For US audiences, his accent requires some getting used to. He places emphasis on unexpected syllables and has a unique way of pronouncing many things. I found it helpful to use headphones and to adjust the playback speed of the recordings. It is worth making the effort to understand Dr.
Principles of Machine Learning (DAT203.2) is the 7th in a series of 10 courses that form the Microsoft Professional Program in Data Science. It proves that the further you get into this 10-course sequence, the more enjoyable the classes become. Similar to Data Science Orientation, this class is co-led by Cynthia Rudin and Steve Elston.
My goal is to help others measure, report and make sense of the data that drives their organization. Topics include: database tools, visualization techniques, open data sources and anything else that helps you apply data science.