COMBINING SEVERAL OPEN DATA SOURCES
While discussing what external datasets we could use to create a prediction model of burglaries I came up with the idea that perhaps there was a relation between these same datasets and the number of traffic accidents. I had wanted to make a circular visualization for a while and this seemed like a good opportunity
I found all datasets, the traffic accidents, the amount of rain, the snow days (this was actually tucked away in PDF reports), holidays and the number of daylight hours (had to webscrape this) online. I created all three line charts (daylight hours, traffic accidents & LOESS curve and rain fall) in R and then imported these graphs in Illustrator where I bend them into a circle and added all the colors, icons and annotations. For the latter I searched the news for that particular date to see if anything could be found relating to traffic jams and such
In the end I do see the correlation between holidays or daylight hours and the number of traffic jams. From the annotations I also know that snow during rush hour gives the worst days but not all snow week days resulted in many accidents. Rain shows even less correlation. I guess using the daily average of one station in the center of the Netherlands is too generalizing even if the Netherlands is already rather small
You can get a print/poster of this data visualization in my store
Comments are closed.