Post on 25-Feb-2016
description
Last Minute Vacation
Taylor McGannDanny Christensen
Russ Taylor
Agenda
Introduction
Data Collection
Data Summarization
Models
Conclusions
Future Research
Last minute vacation?...What?
How can we get a cheap ticket?
1What is the best day to purchase tickets?
2How far in advance should we purchase tickets?
3Are we getting the best price?
Ruby
Selenium WebDriver
Kayak
Loads of CSV files
"We don't want bots running about trying to book airline tickets. They tend to try to cram large suitcases in the overhead bin, and they prattle on about celebrities they
know while you are trying to watch the movie."
How about that data?
Name Type Description
Airline Text An airline. For example, AA equals American Airlines.
Arrive Text Arrival city as a three letter code
Arrive Date Date Arrival date.
Arrive Time Integer Flight arrival time in military hours.
Class Text Class of a flight. There are five different types of classes: Business, coach, mixed, premium, and first.
Depart Text Departure city as a three letter code.
Depart Date Date Departure date.
Depart Time Integer Flight departure time in military hours.
Departure Day Text Day of the week a flight departs.
Difference Integer Number of days between the download and departure data.
Download Date Date Download date or date a flight would have been purchased.
Download Day Text Day of the week flight data was downloaded.
Duration Integer Number of minutes in flight.
Equipment Text Type of plane.
Flight Integer Flight number.
Price Integer Total cost for a flight.
Record Integer Record number of each item in an individual csv. In the combined data set this number is meaningless.
Stops Integer Total number a stops or layovers for a flight.
Name Type Description
Airline Text An airline. For example, AA equals American Airlines.
Arrive Text Arrival city as a three letter code
Arrive Time Integer Flight arrival time in military hours.
Class Text Class of a flight. There are five different types of classes: Business, coach, mixed, premium, and first.
Departure Day Text Day of the week a flight departs.
Difference Integer Number of days between the download and departure data.
Download Day Text Day of the week flight data was downloaded.
Duration Integer Number of minutes in flight.
Price Integer Total cost for a flight.
Stops Integer Total number a stops or layovers for a flight.
Predicting Price
Final Variables
ANN
ANN
ANN
KNN
KNN
Linear Regression
Linear Regression
*NOTE: Table does not include all variables
1What is the best day to purchase tickets?
Tuesday Sunday
Conclusions
2How far in advance should we purchase tickets?
Conclusions1 Day
60 Days
120 Days
180 Days
3Are we getting the best price?
Conclusions
Hard to determine.
Too many variables.
Out of scope.
Future Research
Explore variables in isolation(E.g. destination, airline, etc.)
Add a "Buy/Buy Not" variable to do categorical analysis.
Future Research
Collect data over a longer period of time.
Isolate seasonal and cyclical trends.
Future Research
Questions