Engineering students dig through snowplow data to gauge Toronto鈥檚 response to winter storms
Last January, as 55 centimetres of snow blanketed Toronto over a period of just 15 hours, the city鈥檚 snow-clearing fleet appeared to struggle to keep up. But was it actually different than other storms, or did it just seem that way?
For three students in the 重口味SM鈥檚 Faculty of Applied Science & Engineering who were taking 鈥淒ata Science for Engineers,鈥 a graduate-level course taught by Sebastian Goodfellow, an assistant professor in the department of civil and mineral engineering, it was the perfect case study to test out their new number-crunching skills.
鈥淭here was a lot of news coverage at the time saying the city had poorly responded,鈥 says Katia Ossetchkina, a master鈥檚 candidate. 鈥淲e wanted to see if there was a way to analyze the movement and dispatch of snowplows and salt trucks across the city.鈥
Real-time data on the locations of Toronto鈥檚 more than 800 snowplows and salt trucks is publicly available during the winter months. There is even . But the team 鈥 which also included master鈥檚 candidates Thomas de Boer and Lucas Herzog 鈥 soon realized they needed more.
鈥淭here鈥檚 no historic storage,鈥 says de Boer. 鈥淵ou can鈥檛 just download it as a file, so we had to create an algorithm that would ping that web server and download the data and store it on our computer, which we could then use to build up our own historic database and do our analysis off that.鈥
By the time the team had its technique up and running, it was too late to gather data from the January storm. But by analyzing data from subsequent storms 鈥揳nd gleaning stats about the earlier ones from the city and local news articles 鈥 the researchers were able to verify that the city鈥檚 response improved as the winter went on.
鈥淲e learned that Toronto had increased the number of plows on the road in February, compared to January, and the crews were quicker to reach certain benchmarks, such as the percentage of roads that had been plowed by a certain point during the storm,鈥 says de Boer.
Herzog says that the team picked up other interesting trends as well.
鈥淥f course, they plow the arterial roads first, but we saw that they would stop plowing around 6 a.m. 鈥 just before the morning commute,鈥 says Herzog.
鈥淎nd that鈥檚 where a lot of these Twitter complaints stemmed from,鈥 adds de Boer. 鈥淧eople were wondering how they are supposed to get to an arterial road when the street outside their driveway is blocked by two feet of snow.鈥
Spurred on by these sorts of observations, the team decided to take the project a step further by applying their data analysis to Twitter messages. The team used Twitter鈥檚 application programming interface (API) to gather the comments of those tweeting to Toronto 311 and the City of Toronto Winter Operations account. They were then able to perform what is known as 鈥渟entiment analysis,鈥 measuring whether the words used in the tweets were positive or negative.
That allowed the team to compare the public response from the January storm to another one that occurred in February.
鈥淲e saw lots of negative tweets in January with people complaining about not being serviced yet, and that came with a lot of geographical information as well, so we could see the hardest hit areas,鈥 says Ossetchkina.
鈥淭hen we saw this reversing trend in February where people were saying, 鈥楾hank you,鈥 and saying that the city was doing a good job in specific regions. It was a very interesting performance metric.鈥
The team says that this type of data analysis could help other engineers on future projects. They have made their historical database publicly available, and have even crafted detailed instructions so that other teams can replicate their approach.
Goodfellow says he was very impressed with the students鈥 work.
鈥淲hat I like about this project is that it鈥檚 entirely unique,鈥 he says. 鈥淭his is a new dataset that the students have made publicly available, and that can now be used by other engineers to investigate new questions or to hone their data science skills.
鈥淓ven better than that, it鈥檚 a dataset from the city we all live in, which provided a special motivation for the students to truly go above and beyond.鈥