This article describes a dataset on body temperature, gender, and heart rate. It addresses concepts like true means, confidence intervals, t-statistics, t-tests, the normal distribution, and regression.
This article describes a dataset containing Major League Baseball data from seasons 1969 through 2000 and illustrates how this data can be used as a course long project covering basic data management, the use of exploratory data analysis to "clean" data, and construction of regression models. The data is in .dat format.
This article presents data for examining the ability of individuals to choose numbers randomly. Three datasets of six-tuples selected by a lottery game, generated by S-Plus, and chosen by college students can be compared using descriptive statistics and goodness of fit tests to explore bias and randomness. Key Words: Boxplots; Chi-squared tests; Minimum gap; QQ plots.
This article presents data from 1997 Big Ten Conference men's basketball games involving the University of Iowa Hawkeyes. The data can be used to demonstrate bivariate statistical inference techniques such as confidence regions, paired comparisons, and simultaneous confidence intervals. Key Word: Bivariate data; Scatterplot.
This article presents a dataset from an experiment designed to test if increased reproduction reduces longevity for male fruitflies. The data are in .dat format. Key Words: Analysis of variance; Contrasts; Experimental design; Regression; Precision; Statistical interaction; Survival analysis.
The dataset presented in this article contains information on respiratory function and smoking. The data can be used to explore descriptive statistics, graphical analysis, regression, and observational studies. The data are in .dat format.
This article presents a dataset based on an industrial case study using design of experiments. It can be used to discuss sample size, power, statistical significance, interaction terms, Type I and Type II errors, the role and importance of the error term, design of experiments, and analysis of variance.
This article describes a dataset containing information for 25 brands of domestic cigarettes. The dataset can be used to illustrate multiple regression, outliers, and collinearity.
This article presents a dataset containing physical measurements for 507 physically active individuals. These data can be used to demonstrate simple descriptive statistics, least squares and multiple regression, or discriminant and classification analysis. The data are in .dat format.
This Department of Energy website provides weekly average gasoline prices for several regions, states and cities. The averages are produced from a weekly survey of around 800 retail gasoline stations. The site includes information on data collection methods, survey methodology and historical data.