East West Airlines Cluster Analysis

Essay by balajivenky06 • July 3, 2018 • Case Study • 705 Words (3 Pages) • 3,589 Views

Essay Preview: East West Airlines Cluster Analysis

Page 1 of 3

Do you need to normalize the data before applying any clustering technique? Why or why not?

Yes, we need to normalize the data before applying any data. The reason is scale will be biased while calculating distance between clusters and also within clusters. Also, if we do not normalize the data, the large values will impact the variables having small values while calculating distance.

In East West Airlines data, by having physical look, it is observed that columns like Balance, Bonus_miles, Days_since_enroll having high values when compared with other variables. Hence these column will highly skew the analysis if not normalized.

[pic 1]

Apply hierarchical clustering with Euclidean distance and Ward’s method. How many clusters do appear?

Ward's minimum variance method is a special case of the objective function approach originally presented by Joe H. Ward. Ward suggested a general agglomerative hierarchical clustering procedure, where the criterion for choosing the pair of clusters to merge at each step is based on the optimal value of an objective function. This objective function could be "any function that reflects the investigator's purpose."

[pic 2]

Ideally for hierarchical clustering, we can generate n clusters with each with single item. If we cut the cluster at 600, then we will get 3 primary clusters

[pic 3]

Here I used Ward.D method.

The difference between ward.D and ward.D2 is the difference between the two clustering criteria that in the manuscript are called Ward1 and Ward2.

It basically boils down to the fact that the Ward algorithm is directly correctly implemented in just Ward2 (ward.D2), but Ward1 (ward.D) can also be used, if the Euclidean distances (from dist()) are squared before inputing them to the hclust() using the ward.D as the method.

Compare cluster centroids to characterize different clusters and try to give each cluster a label—a meaningful name that characterizes the cluster.

[pic 4]

Cluster2 → Flight_miles in last 12 months is very much higher than other 2 clusters, also Qual_miles in top flight is also high, hence this cluster can be tagged as “FREQUENT BUSINESS CLASS TRAVELERS”

Cluster3 → Flight_miles in less than cluster2 but very much higher than Cluster1. Also, Qual_miles is very less, hence this cluster can be tagges as “FREQUENT ECONOMY CLASS TRAVELERS”

Cluster1 → These are fliers apart from other 2 category, which can be tagged as “OCCASIONAL TRAVELERS”

To check the stability of clusters, remove a random 5% of the data (by taking a random sample of 95% of the records), and repeat the analysis. Does the same picture emerge?

[pic 5]

If we compare the new dendrogram with old one, we can see the changes in scale when clustering groups though the picture looks same. Hence even 5% change in samples, it will impact the clustering groups

Cluster all passengers again using k-means clustering. How many clusters do you want to go with? How did you decide on the number of clusters? Explain your choice on the number of clusters.

[pic 6]

...

Download as: txt (4.5 Kb) pdf (413.3 Kb) docx (90.8 Kb)

Continue for 2 more pages »

Read Full Essay Save

Only available on AllBestEssays.com

Similar Essays

Augat Electronics Analysis

The Canadian market for cable connectors is a relatively small one that increases at a slow rate. There are on the other hand some signs

2,370 Words | 10 Pages
Situational Analysis of Singapore Airlines

1.0 INTRODUCTION The history of Singapore Airlines dates back to 1 May 1947, when the first scheduled flight of Malaysian Airlines took off from Singapore

2,025 Words | 9 Pages
Analysis on Southwest Airlines

Abstract Airline companies are facing many challenges keeping their cost down and profits up. Some of the main issues are gas prices and pilots pay.

877 Words | 4 Pages
Wilson-West Manufacturing Cost Analysis and Reporting

COST ANALYSIS AND REPORTING BY: SAMYRA RAMIREZ Being hired as the managerial accountant for Wilson-West Manufacturing's new cabinet division; I will be setting up a

1,896 Words | 8 Pages
South West Airlines Swot

SWOT: Strengths: * The key philosophy of the hotel was to provide high quality services at an competitive and cost effective pricing level. * Targeting

762 Words | 4 Pages
Southwest Airlines Analysis

September 23, 2015 To: Mr. Belin From: Tianying Wang section 10 Re: Andrew Inkpen “Southwest Airlines” Diagnosis: The biggest problem that Southwest Airline has is

487 Words | 2 Pages
Environmental Analysis of the Airline Industry

MGT 4335 February 13th, 2017 Introduction The airline industry has impacted travel as we know it today because it allows people and goods to be

2,395 Words | 10 Pages
The North West Company Case Analysis

Supply Chain Management Professional The North West Company Case Analysis Instructor: Robert Greene Written By: Connie Gong Date: June 23, 2014 ________________ Table of Contents

2,850 Words | 12 Pages