Analyzing Big Data with Microsoft R Server
Microsoft
  • Start Date: 01 Jan, 2018
  • Self-Paced
  • Study Content: Videos


Learn how to use Microsoft R Server to analyze large datasets using R, one of the most powerful programming languages.

Course Fee: Free
Certificate Cost: 99.00 USD

Introduction

Introduction Video

Course Description

This course is made available through the eLearnAfrica and Edx partnership.

The open-source programming language R has for a long time been popular (particularly in academia) for data processing and statistical analysis. Among R's strengths are that it's a succinct programming language and has an extensive repository of third party libraries for performing all kinds of analyses. Together, these two features make it possible for a data scientist to very quickly go from raw data to summaries, charts, and even full-blown reports. However, one deficiency with R is that traditionally it uses a lot of memory, both because it needs to load a copy of the data in its entirety as a data.frame object, and also because processing the data often involves making further copies (sometimes referred to as copy-on-modify). This is one of the reasons R has been more reluctantly received by industry compared to academia.


The main component of Microsoft R Server (MRS) is the RevoScaleR package, which is an R library that offers a set of functionalities for processing large datasets without having to load them all at once in the memory. RevoScaleR offers a rich set of distributed statistical and machine learning algorithms, which get added to over time. Finally, RevoScaleR also offers a mechanism by which we can take code that we developed on our laptop and deploy it on a remote server such as SQL Server or Spark (where the infrastructure is very different under the hood), with minimal effort.


In this course, we will show you how to use MRS to run an analysis on a large dataset and provide some examples of how to deploy it on a Spark cluster or a SQL Server database. Upon completion, you will know how to use R for big-data problems.


Since RevoScaleR is an R package, we assume that the course participants are familiar with R. A solid understanding of R data structures (vectors, matrices, lists, data frames, environments) is required. For example, students should be able to confidently tell the difference between a list and a data frame, or what each object is generally a good representation for and how to subset it. Students should be familiar with basic programming concepts such as control flows, loops, functions and scope. Students should have a good understanding of how to write and debug R functions. Finally, students are expected to have a good understanding of data manipulation and data processing in R (e.g. functions such as merge, transform, subset, cbind, rbind, lapply, apply). Familiarity with 3rd party packages such as dplyr is also helpful.

Certificate cost may vary. You will be redirected to the host page for cost and payment options.

Getting Started

edX Basics

Certificates

MicroMasters

Account

The Courses

Financial Assistance

For more information, please visit the edX FAQ page.

Microsoft

“Be passionate and bold. Always keep learning. You stop doing useful things if you don't learn.” – Satya Nadella, CEO, Microsoft

We live in a mobile-first and cloud-first world. Computing is ubiquitous, and experiences span devices and exhibit ambient intelligence. Billions of sensors, screens and devices – in conference rooms, living rooms, cities, cars, phones, PCs – are forming a vast network and streams of data that simply disappear into the background of our lives. This computing power will digitize nearly everything around us, and will derive insights from all of the data being generated by interactions among people and between people and machines. We are moving from a world where computing power was scarce to a place where it now is almost limitless, and where the true scarce commodity is increasingly human attention.

In this new connected world, we believe technology creates opportunity, and our edX courses are designed to provide you with the core development skills needed to be successful in the cloud-first, mobile-first world.

Our edX courses, taught by Microsoft experts, let you learn through hands-on experience with broad reach, cutting-edge technologies in areas including cloud services, mobile development, and data sciences. Whether you’re a student or a seasoned technologist, we can empower you to build innovative applications, services, and experiences on the Microsoft platform that will help you make a meaningful impact in today’s interconnected world.

Welcome to the world of Microsoft technology – choose a path and make your mark.

 

This institution is available on eLearnAfrica through partnership with Edx.

You may be able to download course materials after enrolling in this course. If not, all of the necessary course materials provided by the course instructor will be available on the provider's course page.
COLLABORATION POLICY

By enrolling in a course on or through eLearnAfrica, you are joining a special worldwide community of learners. The aspiration of eLearnAfrica is to provide anyone with an internet connection access to courses from the best universities and institutions in the world and to provide our learners the best educational experience internet technology enables. You are a part of the community that will help eLearnAfrica achieve this goal. eLearnAfrica depends upon your motivation to learn the material and to do so with honesty and academic integrity. In order to participate in eLearnAfrica, you must agree to the Honor Code below and any additional terms specific to a class.

ELEARNAFRICA HONOR CODE PLEDGE

By enrolling in a course, program, or degree hosted on the eLearnAfrica App or Site, I agree that I will:

  • Complete all tests and assignments on my own, unless collaboration on an assignment is explicitly permitted.

  • Maintain only one user account and not let anyone else use my username and/or password.

  • Not engage in any activity that would dishonestly improve my results, or improve or hurt the results of others.

  • Not post answers to problems that are being used to assess student performance.

  • Abide by any and all requirements of the eLearnAfrica Participants as may regard the expectations of civil or academic behavior or of community standards.

VIOLATIONS

If you are found in violation of the Terms and Conditions or Honor Code, you may be subject to one or more of the following actions:

  • Receiving a zero or no credit for an assignment;

  • Having any certificate earned in the course withheld or revoked;

  • Being unenrolled from a course, learning program or degree program; or

  • Termination of your use of the App and/or Site.

  • Additional actions may be taken at the sole discretion of eLearnAfrica and eLearnAfrica course providers. 

No refunds will be issued in the case of any corrective action for such violations.

Honor code violations will be determined at the sole discretion of eLearnAfrica, the Partners, or Members. You will be notified if a determination has been made that you have violated this honor code and you will be informed of the corresponding action to be taken as a result of the violation.

CHANGING THE HONOR CODE

Please note that we review and may make changes to this Honor Code from time to time. Any changes to this Honor Code will be effective immediately upon posting on this page, with an updated effective date. By accessing the App and/or Site after any changes have been made, you signify your agreement on a prospective basis to the modified Honor Code and any changes contained therein. Be sure to return to this page periodically to ensure familiarity with the most current version of this Honor Code.

Effective Date: September 22, 2016