Summary and Schedule

ATTENTION This is an experimental test of The Carpentries Workbench lesson infrastructure. It was automatically converted from the source lesson via the lesson transition script.

If anything seems off, please contact Zhian Kamvar zkamvar@carpentries.org

Data Carpentry’s aim is to teach researchers basic concepts, skills, and tools for working with data so that they can get more done in less time, and with less pain. This lesson was designed for researchers interested in working with public health data in R, but may be of interest to researchers in other fields as well.

This lesson provides an introduction to simple linear regression. The episodes cover the concept of simple linear regression, the use of simple linear regression with various types of predictor variables (single continuous variable, single factor variable with two groups and single factor variable with more than two groups), predictions of the mean and the assessment of model fit and assumptions.

Getting started

To get started, see the instructions in the Setup page. There you will learn how to obtain the data and packages used in this lesson.

Prerequisites

This lesson does not require a formal background in statistics.

This lesson requires:

  • Working copies of R and RStudio. See here for installation instructions.
  • An understanding of how to use the Tidyverse packages to summarise and manipulate data in RStudio. See these episodes on data handling and data manipulation.
  • An understanding of how to use the ggplot2 package to plot data in RStudio. See this episode on data visualisation.
  • An understanding of the concepts covered in the Statistical thinking for public health lesson.

The actual schedule may vary slightly depending on the topics and exercises chosen by the instructor.

To get set up for this lesson, there are a few steps that you will need to follow, as explained below:

  1. Install R and RStudio, or update these if you already have the programs installed.
  2. Download the RStudio Project environment for this lesson.
  3. Set up the packages used in this lesson using renv.

Installing or updating R and RStudio

If you do not have R and RStudio installed on your computer, follow the installation instructions here. If you have R and RStudio installed, please make sure that your computer is running the latest versions by following the instructions here.

Downloading the RStudio Project environment

To ensure that you have the data and packages used in this lesson, you can use a dedicated RStudio Project environment. Please follow the following steps:

  1. Create a folder on your computer for the lesson, e.g. simple-linear-reg-lesson in your documents folder.
  2. Download the environment as a zipped folder by left clicking here.
  3. Unzip the downloaded folder into the simple-linear-reg-lesson folder.

Setting up the packages used in the lesson

To ensure that everyone is using the same versions of packages used in this lesson, as well as to avoid clashes with packages installed on your computer, the RStudio Project environment has a pre-defined package library using renv. To install the required packages, please follow the following steps:

  1. Launch the RStudio Project by double-clicking on the workshop-project.RProj file in simple-linear-reg-lesson.
  2. Allow renv to install the required packages locally by typing renv::restore() in the console.
  3. When the prompt comes up, answer it with Y.
  4. After installation has completed, load the required packages by typing source("scripts/load_packages.R") into the console.
  5. You are now set up for the lesson!

Note: the lesson data should be in your Environment under the name dat. If you would like more information about the data, see the [data][lesson-data] page.