This website provides a comprehensive overview of descriptive statistics (mean/median/mode, range, standard deviation, and variance) through informative webpages with examples, links to data sets, and problems for the readers to try for themselves.
This website provides a comprehensive overview of descriptive statistics (mean/median/mode, range, standard deviation, and variance) through informative webpages with examples, links to data sets, and problems for the readers to try for themselves.
This applet simulates rolling dice and displays the outcomes in a histogram. Students can choose to roll 1, 2, 6, or 9 dice either 1, 10, 20, or 100 times. The outcome studied is the sum of the dice and a red line is drawn on the histogram to show expected number of occurences of each outcome.
The Food and Drug Administration requires pharmaceutical companies to establish a shelf life for all new drug products through a stability analysis. This is done to ensure the quality of the drug taken by an individual is within established levels. The purpose of this out-of-class project or in-class example is to determine the shelf life of a new drug. This is done through using simple linear regression models and correctly interpreting confidence and prediction intervals. An Excel spreadsheet and SAS program are given to help perform the analysis. Key words: prediction interval, confidence interval, stability
This group activity focuses on conducting an experiment to determine which of two brands of paper towels are more absorbent by measuring the amount of water absorbed. A two-sample t-test can be used to analyze the data, or simple graphics and descriptive statistics can be used as an exploratory analysis. Students are asked to think about design issues, and to write a short report stating their results and conclusions, along with an evaluation of the experimental design. Key words: Two-sample t-test
The activity is designed to help students develop a better intuitive understanding of what is meant by variability in statistics. Emphasis is placed on the standard deviation as a measure of variability. As they learn about the standard deviation, many students focus on the variability of bar heights in a histogram when asked to compare the variability of two distributions. For these students, variability refers to the "variation" in bar heights. Other students may focus only on the range of values, or the number of bars in a histogram, and conclude that two distributions are identical in variability even when it is clearly not the case. This activity can help students discover that the standard deviation is a measure of the density of values about the mean of a distribution and to become more aware of how clusters, gaps, and extreme values affect the standard deviation. Key words: Variability, standard deviation
This group activity illustrates the concepts of size and power of a test through simulation. Students simulate binomial data by repeatedly rolling a ten-sided die, and they use their simulated data to estimate the size of a binomial test. They carry out further simulations to estimate the power of the test. After pooling their data with that of other groups, they construct a power curve. A theoretical power curve is also constructed, and the students discuss why there are differences between the expected and estimated curves. Key words: Power, size, hypothesis testing, binomial distribution
This activity is an advanced version of the "Keep your eyes on the ball" activity by Bereska, et al. (1999). Students should gain experience with differentiating between independent and dependent variables, using linear regression to describe the relationship between these variables, and drawing inference about the parameters of the population regression line. Each group of students collects data on the rebound heights of a ball dropped multiple times from each of several different heights. By plotting the data, students quickly recognize the linear relationship. After obtaining the least squares estimate of the population regression line, students can set confidence intervals or test hypotheses on the parameters. Predictions of rebound length can be made for new values of the drop height as well. Data from different groups can be used to test for equality of the intercepts and slopes. By focusing on a particular drop height and multiple types of balls, one can also introduce the concept of analysis of variance. Key words: Linear regression, independent variable, dependent variables, analysis of variance
Which is more robust against outliers: mean or median? This app demonstrates the (in)stability of these descriptive statistics as the value of an outlier and the number of data points change.
The Caesar Shift is a translation of the alphabet; for example, a five-letter shift would code the letter a as f, b as g, ... z as e. We describe a five-step process for decoding an encrypted message. First, groups of size 4 construct a frequency table of the letters in two lines of a coded message. Second, students construct a bar chart for a reference message of the frequency of letters in the English language. Third, students create a bar chart of the coded message. Fourth, students visually compare the bar chart of the reference message (step 2) to the bar chart of the coded message (step 3). Based on this comparison, students hypothesize a shift. Fifth, students apply the shift to the coded message. After decoding the message, students are asked a series of questions that assess their ability to see patterns. The questions are geared for higher levels of cognitive reasoning. Key words: bar charts, Caesar Shift, encryption, testing hypotheses
Explore the functionality of your scientific calculator.