Back in February my brother asked us to take a look at some data for his Resident's Association. The data related to AirBnB listings in Dublin city centre. We don't do a lot of pro-bono work, but I'd forgotten to get him a Christmas present or a birthday present and was feeling guilty so I agreed to look at the data.
First of all, it was just data. A comma separated variable file with a munge of text and numeric data. Scrolling and filtering would provide some answers, but they wouldn't spark a discussion. In my experience this is a common complaint. To paraphrase the Ancient Mariner - "Data, Data, Everywhere...."
What we did didn't take very long. But it does highlight the importance of information-centric thinking rather than technology or tool driven thinking when it comes to data and analysis. It also highlights the need to think about how information will be presented. Handing someone a spreadsheet of data may not result in actionable insight or intelligence.
One of the first challenges was to determine how many days availability constituted "not a place someone is actually living in". We set a few arbitrary levels and I suggested a conservative figure be used for the analysis. Available for 85% of the year sounded about right - that's roughly 320 days or more.
Our findings are a bit disturbing given the level of homelessness in Dublin City and the fact that families in Ireland are living in small hotel rooms rather than anything approaching a reasonable family home. Based on our analysis of the InsideAirBnB data provided these two statistics stands out.
The Microsoft Mix video below contains some other analysis. It also shows how picturing data using graphs or visual metaphors can help get a message across.
Our analysis in this case was a little rough and ready (I spent about an hour in total on it). With more time (and a bit more data) a lot more could be done.