Introduction to R and RStudio
- Use RStudio to write and run R programs.
- R has the usual arithmetic operators and mathematical functions.
- Use <-to assign values to variables.
- Use ls()to list the variables in a program.
- Use rm()to delete objects in a program.
- Use install.packages()to install packages (libraries).
Project Management With RStudio
- Use RStudio to create and manage projects with consistent layout.
- Treat raw data as read-only.
- Treat generated output as disposable.
- Separate function definition and application.
Seeking Help
- Use help()to get online help in R.
Data Structures
- Use read.csvto read tabular data in R.
- The basic data types in R are double, integer, complex, logical, and character.
- Data structures such as data frames or matrices are built on top of lists and vectors, with some added attributes.
Exploring Data Frames
- Use cbind()to add a new column to a data frame.
- Use rbind()to add a new row to a data frame.
- Remove rows from a data frame.
- Use na.omit()to remove rows from a data frame withNAvalues.
- Use levels()andas.character()to explore and manipulate factors.
- Use str(),summary(),nrow(),ncol(),dim(),colnames(),rownames(),head(), andtypeof()to understand the structure of a data frame.
- Read in a csv file using read.csv().
- Understand what length()of a data frame represents.
Subsetting Data
- Indexing in R starts at 1, not 0.
- Access individual values by location using [].
- Access slices of data using [low:high].
- Access arbitrary sets of data using [c(...)].
- Use logical operations and logical vectors to access subsets of data.
Control Flow
- Use ifandelseto make choices.
- Use forto repeat operations.
Creating Publication-Quality Graphics with ggplot2
- Use ggplot2to create plots.
- Think about graphics in layers: aesthetics, geometry, statistics, scale transformation, and grouping.
Vectorization
- Use vectorized operations instead of loops.
Functions Explained
- Use functionto define a new function in R.
- Use parameters to pass values into functions.
- Use stopifnot()to flexibly check function arguments in R.
- Load functions into programs using source().
Writing Data
- Save plots from RStudio using the ‘Export’ button.
- Use write.tableto save tabular data.
Splitting and Combining Data Frames with plyr
- Use the plyrpackage to split data, apply functions to subsets, and combine the results.
Data Frame Manipulation with dplyr
- Use the dplyrpackage to manipulate data frames.
- Use select()to choose variables from a data frame.
- Use filter()to choose data based on values.
- Use group_by()andsummarize()to work with subsets of data.
- Use mutate()to create new variables.
Data Frame Manipulation with tidyr
- Use the tidyrpackage to change the layout of data frames.
- Use pivot_longer()to go from wide to longer layout.
- Use pivot_wider()to go from long to wider layout.
Producing Reports With knitr
- Mix reporting written in R Markdown with software written in R.
- Specify chunk options to control formatting.
- Use knitrto convert these documents into PDF and other formats.
Writing Good Software
- Keep your project folder structured, organized and tidy.
- Document what and why, not how.
- Break programs into short single-purpose functions.
- Write re-runnable tests.
- Don’t repeat yourself.
- Be consistent in naming, indentation, and other aspects of style.