Hey Dan,
I'm just starting out too, and thought this would be an interesting one ot start with, I had some ideas about where to begin. I'm going to extract the data and basically look at afew instances of flights history and brgin
making some small basic groupings, so identifying schdeuled flights, looking at certain routes, and get a really good feel for the data first with some basic visualisations.
If you are interested we could team up and share insights/models. I am going to use Qlikview for the basic visualisation stuff, mapping etc and R/Python for the modeling. I was going to rent an EC2 for the crunching. I'm based
in the UK and will be pottering on this in the evenings.
Let me know
Mark
with —