These cheat sheets make it easy to learn about and use some of the favorite packages of RStudio.
These cheat sheets make it easy to learn about and use some of the favorite packages of RStudio.
RStudio is a set of integrated tools designed to help you be more productive with R. It includes a console, syntax-highlighting editor that supports direct code execution, and a variety of robust tools for plotting, viewing history, debugging and managing your workspace.
The Cornell Statistical Consulting Unit distributes a periodic electronic newsletter, StatNews. This newsletter discusses applications of statistics that are commonly encountered in research and teaching.
R is a language and environment for statistical computing and graphics. It is a GNU project which is similar to the S language and environment which was developed at Bell Laboratories (formerly AT&T, now Lucent Technologies) by John Chambers and colleagues. R can be considered as a different implementation of S. There are some important differences, but much code written for S runs unaltered under R.
R provides a wide variety of statistical (linear and nonlinear modelling, classical statistical tests, time-series analysis, classification, clustering, …) and graphical techniques, and is highly extensible. The S language is often the vehicle of choice for research in statistical methodology, and R provides an Open Source route to participation in that activity.
Which is more robust against outliers: mean or median? This app demonstrates the (in)stability of these descriptive statistics as the value of an outlier and the number of data points change.
This compendium facilitates the creation of good graphs by presenting a set of concrete examples, ranging from the trivial to the advanced. The graphs can all be reproduced and adjusted by copy-pasting code into the R console. Almost every example in this compendium is driven by the same philosophy: A good graph is a simple graph, in the Einsteinian sense that a graph should be made as simple as possible, but not simpler. A note for R fans: the majority of our plots have been created in base R, but you will encounter some examples in ggplot.
Find the best linear fit for a given set of data points and residuals (or let this app show you how it is done).
This resource is designed to provide new users to R, RStudio, and R Markdown with the introductory steps needed to begin their own reproducible research. Many screenshots and screencasts (with no audio) will be included, but if further clarification is needed on these or any other aspect of the book, please create a GitHub issue here or email me with a reference to the error/area where more guidance is necessary. It is recommended that you have R version 3.3.0 or later, RStudio Desktop version 1.0 or higher, and rmarkdown R package version 1.0 or higher.
These handouts/links give a foundational understanding of how to set up and use R
This page presents a series of tutorials and interdisciplinary case studies that can be used in a variety of blended as well as brick-and-mortar courses. The materials can be used in introductory level data science courses as well as more advanced data science or statistics courses. These materials assume that students have a basic prior knowledge of R or Rstudio.