Tim Erickson (Epistemological Engineering)
Abstract
Bring your laptop! We’ll do a quick, deep dive into a 10M+-case dataset using CODAP, a free web-based data analysis tool. We’ll figure out, for example, how many people took the train to a baseball game in San Francisco using actual passenger data from BART; or how much a college degree is worth—and for whom. We will experience the data science cycle as we refine the data and improve visualizations to answer our questions. What is it about this that “smells like” data science? Would it be good preparation for using professional data science tools such as R or Python? We’ll discuss this and more, and see actual (high-school!) student work on similar tasks.