STA/ISS 313 - Spring 2024 - Project 1
Data from Tidy Tuesday’s London Marathon page, which has two datasets: winners.csv
and london_marathon.csv
winners
: year, category of race (men, women, wheelchair men, wheelchair women), nationality, and winning time.
london_marathon
: date of race, number of applicants, number accepted, number of starters, number of finishers, how much was raised for charity, and each year’s official charity.
We want to answer how winning performances among the four different race categories have changed over time and how these performances vary from year to year.
geom_line()
and geom_point()
geom_col()
to create a sideways bar chart showing the relationship between winning time and temperatureWheelchair Men and Women started with the largest relative win time (+85% and +95% for men and women wheelchair, respectively), but has decreased significantly over time
Included zoomed-in version of the plot to gain more insight into trends of men & women running over time
There appears to be no apparent relationship between temperature and marathon winning time
We’re seeking to better understand how the London Marathon acceptance rate varies from year to year from 1981-2020 and gain insight into potential trends.
geom_point()
and geom_segment()
annotate()
to display mean acceptance rate, provide context, and show outliersLondon Marathon acceptance rate was above the mean in 1981 and from 1988 to 2008 and was below the mean acceptance rate for all other years
2020 had an almost 0% acceptance rate due to COVID-19 pandemic, as only elite runners were permitted to compete
Decade 1 and 4 were below the mean acceptance rate of 36.3% while Decade 2 and 3 were above