The Cornell Statistical Consulting Unit distributes a periodic electronic newsletter, StatNews. This newsletter discusses applications of statistics that are commonly encountered in research and teaching.
The Cornell Statistical Consulting Unit distributes a periodic electronic newsletter, StatNews. This newsletter discusses applications of statistics that are commonly encountered in research and teaching.
R is a language and environment for statistical computing and graphics. It is a GNU project which is similar to the S language and environment which was developed at Bell Laboratories (formerly AT&T, now Lucent Technologies) by John Chambers and colleagues. R can be considered as a different implementation of S. There are some important differences, but much code written for S runs unaltered under R.
R provides a wide variety of statistical (linear and nonlinear modelling, classical statistical tests, time-series analysis, classification, clustering, …) and graphical techniques, and is highly extensible. The S language is often the vehicle of choice for research in statistical methodology, and R provides an Open Source route to participation in that activity.
Which is more robust against outliers: mean or median? This app demonstrates the (in)stability of these descriptive statistics as the value of an outlier and the number of data points change.
This compendium facilitates the creation of good graphs by presenting a set of concrete examples, ranging from the trivial to the advanced. The graphs can all be reproduced and adjusted by copy-pasting code into the R console. Almost every example in this compendium is driven by the same philosophy: A good graph is a simple graph, in the Einsteinian sense that a graph should be made as simple as possible, but not simpler. A note for R fans: the majority of our plots have been created in base R, but you will encounter some examples in ggplot.
Find the best linear fit for a given set of data points and residuals (or let this app show you how it is done).
This resource is designed to provide new users to R, RStudio, and R Markdown with the introductory steps needed to begin their own reproducible research. Many screenshots and screencasts (with no audio) will be included, but if further clarification is needed on these or any other aspect of the book, please create a GitHub issue here or email me with a reference to the error/area where more guidance is necessary. It is recommended that you have R version 3.3.0 or later, RStudio Desktop version 1.0 or higher, and rmarkdown R package version 1.0 or higher.
These handouts/links give a foundational understanding of how to set up and use R
This page presents a series of tutorials and interdisciplinary case studies that can be used in a variety of blended as well as brick-and-mortar courses. The materials can be used in introductory level data science courses as well as more advanced data science or statistics courses. These materials assume that students have a basic prior knowledge of R or Rstudio.
The goal of this text is to provide a broad set of topics and methods that will give students a solid foundation in understanding how to make decisions with data. This text presents workbook-style, project-based material that emphasizes real world applications and conceptual understanding. Each chapter contains:
The text is highly adaptable in that the various chapters/parts can be taken out of order or even skipped to customize the course to your audience. Depending on the level of in-class active learning, group work, and discussion that you prefer in your course, some of this work might occur during class time and some outside of class.
The Global Terrorism Database (GTD) contains information about more than 140,000 terrorist incidents occurring between 1970 and 2014. The data in the GTD are gathered from information gathered through multiple news sources (LaFree, Dugan, & Miller, 2015). In this activity, we will study the extent to which chemical, biological, radiological, and nuclear (CBRN) weapons have been used so far. We analyze whether or not their past use fits with our perceptions. Have CBRN weapons been used successfully in the past? Which weapons are more historically dangerous (more fatalities, injuries) in the hands of terrorists? What are the implications of past usage of CBRN weapons compared to other weapons in determining our priorities in counter-terrorism policies?