A collection of Java applets and simulations covering a range of topics (descriptive statistics, confidence intervals, regression, effect size, ANOVA, etc.).
A collection of Java applets and simulations covering a range of topics (descriptive statistics, confidence intervals, regression, effect size, ANOVA, etc.).
Examples of real data/studies and their analyses and interpretation.
This UC Berkeley Foundations of Data Science course combines three perspectives: inferential thinking, computational thinking, and real-world relevance. Given data arising from some real-world phenomenon, how does one analyze that data so as to understand that phenomenon? This course teaches critical concepts and skills in computer programming and statistical inference, in conjunction with hands-on analysis of real-world datasets, including economic data, document collections, geographical data, and social networks. It delves into social issues surrounding data analysis such as privacy and design.
How can we accurately model the unpredictable world around us? How can we reason precisely about randomness? This course will guide you through the most important and enjoyable ideas in probability to help you cultivate a more quantitative worldview.
By the end of this course, you’ll master the fundamentals of probability and random variables, and you’ll apply them to a wide array of problems, from games and sports to economics and science. This course includes 62 interactive quizzes and more than 400 probabilty-based problems with solutions. Access to this course requires users to sign up for a free account.
Which is more robust against outliers: mean or median? This app demonstrates the (in)stability of these descriptive statistics as the value of an outlier and the number of data points change.
Approximating a normal distribution with a binomial distribution
This page supports an in-class exercise that highlights several key Bayesian concepts. The scenario is as follows: a large paper bag contains pieces of candy with wrappings of different color, and we are interested in learning about the unknown proportion of yellow-wrapped pieces of candy. After completing the exercises, we will be familiar with the following concepts and ideas: probability distributions can quantify degree of belief, prior distribution, posterior distribution, sequential updating, conjugacy, Cromwell’s Rule (http://en.wikipedia.org/wiki/Cromwell's_rule), the data overwhelm the prior, Bayes factors, Savage-Dickey density ratio, sensitivity analysis, coherence.
Find the best linear fit for a given set of data points and residuals (or let this app show you how it is done).
When does a significant p-value indicate a true effect? This app will help with understanding the Positive Predictive Value (PPV) of a p-value.
This app is based on Ioannidis, J. P. A. (2005). Why most published research findings are false. PLoS Medicine, 2(8), e124. http://doi.org/10.1371/journal.pmed.0020124
The app allows you to see the trade-offs on various types of outlier/anomaly detection algorithms. Outliers are marked with a star and cluster centers with an X.