Skip to main content

Getting Started with R Programming

The Beginner’s Guide to R Programming.

I'm very excited to start R Programming and I hope you are too. This is the second course in the Data Science Specialization and it focuses on the nuts and bolts of using R as a programming language.

The recommended background for this course is the course The Data Scientist's Toolbox. It is possible to take this class concurrently with that class but you may have to read ahead in the prerequisite class to get the relevant background for this class. For a complete set of course dependencies in the Data Science Specialization please see the course dependency chart, that has been posted on our blogpost.

The primary way to interact with me and the other students in this course is through the discussion forums which in our case are comments section under the lectures, social media and blogpost. Here, you can start new threads by asking questions or you can respond to other people's questions. If you have a question about any aspect of the course, I strongly suggest that you search through the discussion boards first to see if anyone has already asked that question. If you see something similar to what you want to ask, you should like that question comment to push the notification to get answered to that question quickly rather than asking your question separately. The more votes a question or comment gets, the more likely it is that I will see it and be able to respond quickly. Of course, if you don't see a question similar to the one you want to ask, then you should definitely start a new thread on the appropriate forum.

Finally, consider getting the course textbook, R Programming for Data Science, which is available for free. The content in the book tracks the material covered in the course and allows you to hang on to the material once the course is finished.



Course Description

In this course you will learn how to program in R and how to use R for effective data analysis. You will learn how to install and configure software necessary for a statistical programming environment, discuss generic programming language concepts as they are implemented in a high-level statistical language. The course covers practical issues in statistical computing which includes programming in R, reading data into R, accessing R packages, writing R functions, debugging, and organizing and commenting R code. Topics in statistical data analysis and optimization will provide working examples.

Course Content:

  • Overview of R
  • R data types and objects
  • Reading and writing data
  • Control structures
  • Functions
  • Scoping rules for R programming
  • Dates and times
  • Loop functions
  • Debugging tools
  • Simulation in R
  • Code profiling

Programming Assignments

Programming assignments will be posted for you to practice on blogpost for each topic. Plus we will cover few guided assignments and projects, during and at the end of the course.

Swirl Programming Assignment (practice)

In this course, you have the option to use the swirl R package to practice some of the concepts we cover in lectures.

While these lessons will give you valuable practice and you are encouraged to complete them all for better understanding of the core concepts that will be covered in this course, so please complete them all.

What is swirl?

Swirl is the package that is developed by John Hopkins university for the R programming class.  It's called Statistics with Interactive R Learning or SWIRL for short.  And it was developed by Nick Carchedi, who was a student then at the Johns Hopkins department of bio-statistics. This is a system that allows you to interactively learn R at your own pace. And it will walk you through a bunch of lessons about different aspects of the R language and you can practice them as you go. So, rather than just watching a lecture and then doing an assignment and doing things piece by piece, you can actually work on R right in the R console in a  guided way. Rather than just figuring things out on your own. So, I think this, the SWIRL modules are really helpful and I encourage you to try to walk through them. And it will be great learning combining lecture videos and swirl practice assignments. I think it'll be a lot of fun.

Practical R Exercises in swirl

The swirl package turns the R console into an interactive learning environment. Using swirl will also give you the opportunity to be completely immersed in an authentic R programming environment. In this programming assignment, you'll have the opportunity to practice some key concepts from this course.

  • Install R

Swirl requires R 3.0.2 or later. If you have an older version of R, please update before going any further. If you're not sure what version of R you have, type R.version.string at the R prompt. You can download the latest version of R from https://www.r-project.org/.

Optional but highly recommended: Install R Studio. You can download the latest version of R Studio at https://www.rstudio.com/products/rstudio/.

  • Install swirl

Since swirl is an R package, you can easily install it by entering a single command from the R console:

install.packages("swirl")

If you've installed swirl in the past make sure you have version 2.2.21 or later. You can check this with:

packageVersion("swirl")

  • Load swirl

Every time you want to use swirl, you need to first load the package. From the R console:

library(swirl)

  • Install the R Programming course

swirl offers a variety of interactive courses, but for our purposes, you want the one called R Programming. Type the following from the R prompt to install this course:

install_from_swirl("R Programming")

  • Start swirl and complete the lessons

Type the following from the R console to start swirl:

swirl()

Then, follow the menus and select the R Programming course when given the option. 

I am very excited to start this course and I hope you enjoy this course and I anticipate a fun time in this course!

Comments

Popular posts from this blog

Mastering Debugging in R: Essential Tools and Techniques

The Beginner’s Guide to Debugging Tools in R: Debugging is an essential part of programming in any language, including R. When your code doesn't work as expected, it can be frustrating and time-consuming to find and fix the issue. Fortunately, R provides a variety of debugging tools that can help you identify and fix issues in your code more efficiently. In this blog post, we'll explore some of the most useful debugging tools in R, along with examples of how to use them. The browser() function:  The browser() function is a built-in debugging tool in R that allows you to pause the execution of your code and inspect the values of variables at that point. To use the browser() function, simply insert it into your code where you want to pause the execution. For example: my_function <- function(x) {                                              y <- x * 2  ...

Mastering Loop Functions in R: Exploring tapply and split for Data Manipulation and Analysis

The Beginner’s Guide to Loop Functions in R: Loop functions are powerful tools in R for data manipulation and analysis . They provide efficient and concise ways to apply a function to multiple elements of a data structure. Two commonly used loop functions in R are tapply and split . In this blogpost, we will explore these functions in detail and learn how they can be used to effectively analyze and manipulate data. We will cover the basics of these functions and provide practical examples to illustrate their usage. tapply()  tapply is a loop function in R that applies a function to subsets of a vector or array based on a grouping factor. The syntax of tapply is as follows: tapply(X, INDEX, FUN) where X is the input vector or array, INDEX is the grouping factor, and FUN is the function to be applied. Now suppose we have a data frame containing information about various cities, including their population and average temperature. We could use tapply() to calculate the mean popula...

Mastering R Data Types: Matrices, Factors, Missing Values, Data Frames, and Names Attribute

The Beginner’s Guide to R Data Types: R is a programming language that is widely used for data analysis and statistical computing. It has a powerful set of data structures, including vectors, lists, and data frames, that allow users to work with data in a flexible and efficient way. Matrices A matrix is a two-dimensional array in R that can contain elements of any data type. You can create a matrix using the matrix() function. For example: # Create a matrix with 3 rows and 2 columns  my_matrix <- matrix(c(1, 2, 3, 4, 5, 6), nrow = 3, ncol = 2) Factors A factor is a type of variable in R that represents categorical data. Factors are stored as integers, where each integer corresponds to a level of the factor. You can create a factor using the factor() function. For example: # Create a factor with three levels: "low", "medium", "high"  my_factor <- factor(c("low", "high", "medium", "high", "low")) Missin...