Statistical Inference & Techniques

  • This article describes an interactive activity illustrating sampling distributions for means, properties of confidence intervals, properties of hypothesis testing, confidence intervals for means, and hypothesis tests for means. Students generate and analyze data and through simulation explore these concepts. The activity is completed in three parts. The three parts of the activity can be used in sequence or they can be used individually as "stand alone" activities. This allows the educator flexibility in utilizing the activity. Part I illustrates the sampling distribution of the sample mean. Part II illustrates confidence intervals for the population mean. Part III illustrates hypothesis tests for the population mean. This activity is appropriate for use in an introductory college or high school AP statistics course. Key words: sampling distribution of a sample mean, confidence interval for a mean, hypothesis test on a mean, simulation, random rectangles
    0
    No votes yet
  • This article describes an interactive activity illustrating general properties of hypothesis testing and hypothesis tests for proportions. Students generate, collect, and analyze data. Through simulation, students explore hypothesis testing concepts. Concepts illustrated are: interpretation of p-values, type I error rate, type II error rate, power, and the relationship between type I and type II error rates and power. This activity is appropriate for use in an introductory college or high school statistics course. Key words: hypothesis test on a proportion, type I and II errors, power, p-values, simulation
    0
    No votes yet
  • Poses the following problem: Suppose there was one of six prizes inside your favorite box of cereal. Perhaps it's a pen, a plastic movie character, or a picture card. How many boxes of cereal would you expect to have to buy, to get all six prizes?

    0
    No votes yet
  • Students explore the definition and interpretations of the probability of an event by investigating the long run proportion of times a sum of 8 is obtained when two balanced dice are rolled repeatedly. Making use of hand calculations, computer simulations, and descriptive techniques, students encounter the laws of large numbers in a familiar setting. By working through the exercises, students will gain a deeper understanding of the qualitative and quantitative relationships between theoretical probability and long run relative frequency. Particularly, students investigate the proximity of the relative frequency of an event to its probability and conclude, from data, the order on which the dispersion of the relative frequency diminishes. Key words: probability, law of large numbers, simulation, estimation

    Includes project file for Minitab and coding for a dice rolling simulation.

    0
    No votes yet
  • This activity is an advanced version of the "Keep your eyes on the ball" activity by Bereska, et al. (1999). Students should gain experience with differentiating between independent and dependent variables, using linear regression to describe the relationship between these variables, and drawing inference about the parameters of the population regression line. Each group of students collects data on the rebound heights of a ball dropped multiple times from each of several different heights. By plotting the data, students quickly recognize the linear relationship. After obtaining the least squares estimate of the population regression line, students can set confidence intervals or test hypotheses on the parameters. Predictions of rebound length can be made for new values of the drop height as well. Data from different groups can be used to test for equality of the intercepts and slopes. By focusing on a particular drop height and multiple types of balls, one can also introduce the concept of analysis of variance. Key words: Linear regression, independent variable, dependent variables, analysis of variance

    0
    No votes yet
  • Residual plots and other diagnostics are important to deciding whether or not linear regression is appropriate for a set of data. Many students might believe that if the correlation coefficient is strong enough, these diagnostic checks are not important. The data set included in this activity was created to lure students into a situation that looks on the surface to be appropriate for the use of linear regression but is instead based (loosely) on a quadratic function. Key words: regression, residuals
    0
    No votes yet
  • This group activity illustrates the concepts of size and power of a test through simulation. Students simulate binomial data by repeatedly rolling a ten-sided die, and they use their simulated data to estimate the size of a binomial test. They carry out further simulations to estimate the power of the test. After pooling their data with that of other groups, they construct a power curve. A theoretical power curve is also constructed, and the students discuss why there are differences between the expected and estimated curves. Key words: Power, size, hypothesis testing, binomial distribution

    0
    No votes yet
  • This group activity focuses on conducting an experiment to determine which of two brands of paper towels are more absorbent by measuring the amount of water absorbed. A two-sample t-test can be used to analyze the data, or simple graphics and descriptive statistics can be used as an exploratory analysis. Students are asked to think about design issues, and to write a short report stating their results and conclusions, along with an evaluation of the experimental design. Key words: Two-sample t-test

    0
    No votes yet
  • The Food and Drug Administration requires pharmaceutical companies to establish a shelf life for all new drug products through a stability analysis. This is done to ensure the quality of the drug taken by an individual is within established levels. The purpose of this out-of-class project or in-class example is to determine the shelf life of a new drug. This is done through using simple linear regression models and correctly interpreting confidence and prediction intervals. An Excel spreadsheet and SAS program are given to help perform the analysis. Key words: prediction interval, confidence interval, stability

    0
    No votes yet
  • This program has been written to explore the relationship between the data points and the error surface of the regression problem. On one hand you can learn how to represent a line in two different spaces ({x,y} and {k,d}), and on the other hand you see that solving the regression problem is nothing else than finding the minimum in the error surface.

    0
    No votes yet

Pages

register