Mall Customer Segmentation Data로 해보는 군집화

1. Mall Customer Segmentation Data

1.1 Mall Customer Segmentation Data

Kaggle에 있는 쇼핑몰 고객 데이터
https://www.kaggle.com/vjchoudhary7/customer-segmentation-tutorial-in-python

2. Mall Customer Segmentation Data 실습

2.1 Data Load

import numpy as np
import pandas as pd
import matplotlib.pyplot as plt
import seaborn as sns

dataset = pd.read_csv('https://raw.githubusercontent.com/hmkim312/datas/main/mallcustomer/Mall_Customers.csv')
dataset.tail()

	CustomerID	Gender	Age	Annual Income (k$)	Spending Score (1-100)
195	196	Female	35	120	79
196	197	Female	45	126	28
197	198	Male	32	126	74
198	199	Male	32	137	18
199	200	Male	30	137	83

총 200개의 데이터
id : 고윳값
gender : 성별,
income : 소득,
spendig score : 쇼핑몰에서 부여한 고객의 점수 (소비금액 및 행동 패턴 기반)

2.2 5개 군집으로 해보기

from sklearn.cluster import KMeans

X = dataset.iloc[:, [3,4]].values
model = KMeans(n_clusters= 5, init = 'k-means++', random_state = 13)
cluster = model.fit_predict(X)

데이터에서 income과 Score만 가지고 군집화를 해봄

2.3 시각화

plt.figure(figsize=(12, 10))
plt.scatter(X[cluster == 0, 0], X[cluster == 0, 1], s= 100, c = 'red', label = 'Cluster 1')
plt.scatter(X[cluster == 1, 0], X[cluster == 1, 1], s= 100, c = 'blue', label = 'Cluster 2')
plt.scatter(X[cluster == 2, 0], X[cluster == 2, 1], s= 100, c = 'green', label = 'Cluster 3')
plt.scatter(X[cluster == 3, 0], X[cluster == 3, 1], s= 100, c = 'cyan', label = 'Cluster 4')
plt.scatter(X[cluster == 4, 0], X[cluster == 4, 1], s= 100, c = 'magenta', label = 'Cluster 5')
plt.scatter(model.cluster_centers_[:, 0], model.cluster_centers_[:,1], s = 300, c = 'yellow', label = 'Centroids')
plt.title('Clusters of customers')
plt.xlabel('Annual Income (k$)')
plt.ylabel('Spending Score (1 - 100)')
plt.legend()
plt.show()

x = X[cluster == 4, 0], y = X[cluster == 4, 1] 뜻으로, x는 cluster가 4인것의 x축(income), y는 cluster가 4인것의 y축(score)란 뜻
같은 의미로 model.cluster.centers_로 각 센터의 x축([:, 0])과 y축([:, 1])임
쇼핑몰 고객을 5개의 군집으로 나눈것도 괜찮아 보임

Mall Customer Segmentation Data로 해보는 군집화

1. Mall Customer Segmentation Data

1.1 Mall Customer Segmentation Data

2. Mall Customer Segmentation Data 실습

2.1 Data Load

2.2 5개 군집으로 해보기

2.3 시각화

Recent Update

Trending Tags

Contents

Trending Tags

Mall Customer Segmentation Data로 해보는 군집화

1. Mall Customer Segmentation Data

1.1 Mall Customer Segmentation Data

2. Mall Customer Segmentation Data 실습

2.1 Data Load

2.2 5개 군집으로 해보기

2.3 시각화

Recent Update

Trending Tags

Contents

Further Reading

군집 분석 (2) (Clustering)

클러스터링(Clustering)

신용카드 부정 사용자 데이터로 해보는 부스팅

Trending Tags