Plots, Outliers, and Justin Timberlake: Data Visualization, Part 2: Crash Course Statistics #6

Today we’re going to finish up our unit on data visualization by taking a closer look at how dot plots, box plots, and stem and leaf plots represent data. We’ll also talk about the rules we can use to identify outliers and apply our new data viz skills by taking a closer look at how Justin Timberlake’s song lyrics have changed since he went solo.

We scraped our Justin Timberlake song data from lyrics.com.

If you’re interested in how we did it or would like to try out the code on a different artist, check out our code on GitHub: https://github.com/cmparlettpelleriti/CC2018/tree/master/unique_lyrs

DISCLAIMER: Please be respectful to lyrics websites when scraping data. Some sites may have limits on the number of requests you can make each day.