This site presents 19 videos of statisticians summarizing a project that they did. Each video is accompanied by a dataset so that viewers can try to recreate the statistics in the video. Video runtimes vary from about 8 minutes to as many as 35 minutes.
This is a collection of data sets that were part of R packages. The data set page includes information on which package the data set comes from, the name of the data set, and the number of rows and columns included. Each set is given in .csv form with a documentation file also.
This collection of datasets from Dr. John Rasp's Statistics Webpage is for his STAT 460 (Experimental Design & Advanced Data Analysis), STAT 301 (Business Statistics), STAT 201 (Intro to Business Statistics) classes. This also provides links for statistical web pages, resources for statistical studies, Homework and lecture reviews.
This site did a lot of data visualization on many hot button topics. They provide the raw data that they used to create their graphs at this page. These data sets are kept in Google Doc spreadsheets.
The Comprehensive Epidemiologic Data Resource is a collection of data sets. It includes definitions of each variable in the data set. It requires a login to retrieve the data sets. Registering involves giving your name and address and the name of the study and a detailed description of the intended use of the data.
This complete lesson plan, which includes assessments, is based upon a data set partially discussed in the article "Female Hurricanes are Deadlier than Male Hurricanes." The data set contains archival data on actual fatalities caused by hurricanes in the United States between 1950 and 2012. Students analyze and explore this hurricane data in order to formulate a question, design and implement a plan to collect data, analyze the data by measures and graphs, and interpret the results in the context of the original question.
The textbook, "Statistics: Unlocking the Power of Data," by Lock, Lock, Lock, Lock, and Lock, webpage has a collection of data sets which are used in their textbook. Even without the textbook, the variables are well named, and it is relatively easy to tell what the variables represent.
This issue contains articles about microarray data and the partnership between statisticians and biologists, ASA Stat Bowl at JSM 2005, an interview with Stat Bowl 2004 champion Jesse Frey, USCOTS 2005 plans, cluster sampling, an analysis of Civil War intelligence sleuth's Alan Pinkerton's incompetence.
This issue contains articles about the birthday problem probabilities using simulation analysis using R; making money on eBay using multiple regression to estimate prices of violins; McDonald's French fry actual mass vs. industry standard mass student project; PC vs. Mac computers survey of Harvard students; EESEE electronic story and exercise encyclopedia; 12 types of variables used in statistical analysis; the history of probability in the Enlightenment for rational decisions in law, science, and politics.