Resource Library

Statistical Topic

Advanced Search | Displaying 131 - 140 of 588
  • This online software allows you to load data and make professional-looking graphs with it. Graph types are basic (scatterplot, line plot, bar charts, etc.), statistical (histograms, box plots), scientific (error bars, heat map, contour), 3D charts, and financial (e.g. time series). Other graphs are available with the paid pro version. Log in is required, which allows you to upload data and save it for next use.

    0
    No votes yet
  • March 24, 2009 Activity webinar presented by Nicholas Horton, Smith College, and hosted by Leigh Slauson, Otterbein College. Students have a hard time making the connection between variance and risk. To convey the connection, Foster and Stine (Being Warren Buffett: A Classroom Simulation of Risk and Wealth when Investing in the Stock Market; The American Statistician, 2006, 60:53-60) developed a classroom simulation. In the simulation, groups of students roll three colored dice that determine the success of three "investments". The simulated investments behave quite differently. The value of one remains almost constant, another drifts slowly upward, and the third climbs to extremes or plummets. As the simulation proceeds, some groups have great success with this last investment--they become the "Warren Buffetts" of the class. For most groups, however, this last investment leads to ruin because of variance in its returns. The marked difference in outcomes shows students how hard it is to separate luck from skill. The simulation also demonstrates how portfolios, weighted combinations of investments, reduce the variance. In the simulation, a mixture of two poor investments is surprisingly good. In this webinar, the activity is demonstrated along with a discussion of goals, context, background materials, class handouts, and references (extra materials available for download free of charge)

    0
    No votes yet
  • This is a chapter on data wrangling excerpted from a book on data science. The book is “Modern Data Science with R,” and the authors are Benjamin J. Baumer, Daniel T. Kaplan, and Nicholas J. Horton. It contains the R code needed to do basic things with data such as sorting, arranging, and summarizing data.

    0
    No votes yet
  • This is a chapter on ethics excerpted from a book on data science. The book is “Modern Data Science with R,” and the authors are Benjamin J. Baumer, Daniel T. Kaplan, and Nicholas J. Horton. The chapter presents several ethical dilemmas, then a framework to use when evaluating ethical issues. Then it discusses the dilemmas again, now resolving them.

    0
    No votes yet
  • This site is a lesson on using SQL. It starts with a simple SELECT query. The user must type in the correct command to select certain columns from a database. Once the user has completed the first lesson, then he or she may continue to more complicated lessons.

    0
    No votes yet
  • This tutorial on SQL teaches the most used commands. There is a short explanation, then the user is asked a simple question. If the typed answer is correct, the user continues to the next lesson.
    0
    No votes yet
  • Notes on hypothesis testing and how to interpret the p-value with respect to the significance level of a hypothesis test.
    0
    No votes yet
  • This online application allows the user to import data from online resources such as Facebook, Google Analytics, GitHub, as well as spreadsheets on their own computers. They can then drag-and-drop variables to make graphs automatically. The basic version is free, but you can upgrade to a paid version which allows combining data across services and, if the data come from an online resource, the user has the choice to have Data Hub keep the graphs updated as the data changes.
    0
    No votes yet
  • This is an e-book tutorial for R. It is organized according to the topics usually taught in an Introductory Statistics course. Topics include: Qualitative Data; Quantitative Data; Numerical Measures; Probability Distributions; Interval Estimation; Hypothesis Testing; Type II Error; Inference about Two Populations; Goodness of Fit; Analysis of Variance; Non-parametric methods; Linear Regression; and Logistic Regression.
    0
    No votes yet
  • The Comprehensive Epidemiologic Data Resource is a collection of data sets. It includes definitions of each variable in the data set. It requires a login to retrieve the data sets. Registering involves giving your name and address and the name of the study and a detailed description of the intended use of the data.
    0
    No votes yet

Pages