Datasets

  • Data sets were submitted by the authors of articles in JSE. Each data set is presented along with a link to the article that references the data.
    0
    No votes yet
  • Many data sets useful for modeling bivariate relationships. The data sets are formatted for use in Fathom, but text versions are also available.
    0
    No votes yet
  • This site presents 19 videos of statisticians summarizing a project that they did. Each video is accompanied by a dataset so that viewers can try to recreate the statistics in the video. Video runtimes vary from about 8 minutes to as many as 35 minutes.
    0
    No votes yet
  • This is a collection of data sets that were part of R packages. The data set page includes information on which package the data set comes from, the name of the data set, and the number of rows and columns included. Each set is given in .csv form with a documentation file also.
    0
    No votes yet
  • This collection of datasets from Dr. John Rasp's Statistics Webpage is for his STAT 460 (Experimental Design & Advanced Data Analysis), STAT 301 (Business Statistics), STAT 201 (Intro to Business Statistics) classes. This also provides links for statistical web pages, resources for statistical studies, Homework and lecture reviews.
    0
    No votes yet
  • This site did a lot of data visualization on many hot button topics. They provide the raw data that they used to create their graphs at this page. These data sets are kept in Google Doc spreadsheets.
    0
    No votes yet
  • A game to aid in teaching experimental design and significance testing (especially one sample, two sample, and matched pair situations). Tangrams are puzzles in which a person is expected to place geometrically shaped pieces into a particular design. The on-line Tangram Game provides students the opportunity to design many versions of the original game in order to test which variables have the largest effect on game completion time. A full set of student and instructor materials are available and were created by Kevin Comiskey (West Point), Rod Sturdivant (Ohio State University) and Shonda Kuiper (Grinnell College) as part of the Stat2Labs collection.

    0
    No votes yet
  • The Comprehensive Epidemiologic Data Resource is a collection of data sets. It includes definitions of each variable in the data set. It requires a login to retrieve the data sets. Registering involves giving your name and address and the name of the study and a detailed description of the intended use of the data.
    0
    No votes yet
  • This complete lesson plan, which includes assessments, is based upon a data set partially discussed in the article "Female Hurricanes are Deadlier than Male Hurricanes." The data set contains archival data on actual fatalities caused by hurricanes in the United States between 1950 and 2012. Students analyze and explore this hurricane data in order to formulate a question, design and implement a plan to collect data, analyze the data by measures and graphs, and interpret the results in the context of the original question.
    0
    No votes yet
  • The textbook, "Statistics: Unlocking the Power of Data," by Lock, Lock, Lock, Lock, and Lock, webpage has a collection of data sets which are used in their textbook. Even without the textbook, the variables are well named, and it is relatively easy to tell what the variables represent.
    0
    No votes yet

Pages

register