๊ด€๋ฆฌ ๋ฉ”๋‰ด

yeon's ๐Ÿ‘ฉ๐Ÿป‍๐Ÿ’ป

[ํ”„๋žœ์ฐจ์ด์ฆˆ ์ž…์ ๋ถ„์„] countplot, scatterplot, jointplot ๋ณธ๋ฌธ

Computer ๐Ÿ’ป/๋ฐ์ดํ„ฐ ๋ถ„์„

[ํ”„๋žœ์ฐจ์ด์ฆˆ ์ž…์ ๋ถ„์„] countplot, scatterplot, jointplot

yeon42 2021. 8. 5. 16:42
728x90

8.2 ๋ฒ”์ฃผํ˜• ๊ฐ’์œผ๋กœ countplot ๊ทธ๋ฆฌ๊ธฐ

 

  • '๋ธŒ๋žœ๋“œ'๋ช…์˜ ๋นˆ๋„์ˆ˜ ๊ตฌํ•˜๊ธฐ
brand_count = df_31["๋ธŒ๋žœ๋“œ๋ช…"].value_counts()

 

 

 

 

 

 

df_31["๋ธŒ๋žœ๋“œ๋ช…"].value_counts().plot.barh()

 

 

 

 

 

 

  • normalize=True๋กœ ๋นˆ๋„์ˆ˜์˜ ๋น„์œจ ๊ตฌํ•˜๊ธฐ
df_31["๋ธŒ๋žœ๋“œ๋ช…"].value_counts(normalize=True).plot.barh()

 

 

 

 

 

 

  • countplot ๊ทธ๋ฆฌ๊ธฐ
g = sns.countplot(data=df_31, x="๋ธŒ๋žœ๋“œ๋ช…")

g.text(x=0, y=brand_count[0], s=brand_count[0])
g.text(x=1, y=brand_count[1], s=brand_count[1])
g = sns.countplot(data=df_31, x="๋ธŒ๋žœ๋“œ๋ช…")

for i, value in enumerate(brand_count.index):
	g.text(x=i, y=brand_count[i], s=brand_count[i])

 

 

  - x๋Š” x์ถ•์˜ ์ขŒํ‘œ, y๋Š” y์ถ•์˜ ์ขŒํ‘œ์ด๋ฉฐ, s๋Š” ์ถœ๋ ฅํ•˜๊ณ ์ž ํ•˜๋Š” ๋ฌธ์ž

 

 

 

 

 

 

 

  • ์‹œ๊ตฐ๊ตฌ๋ช…์œผ๋กœ ๋นˆ๋„์ˆ˜ ์„ธ๊ณ , ๋ธŒ๋žœ๋“œ๋ช…์œผ๋กœ ์ƒ‰์ƒ ๋‹ค๋ฅด๊ฒŒ ํ‘œํ˜„ํ•˜๋Š” countplot
plt.figure(figsize=(15, 4))
g = sns.countplot(data=df_31, x="์‹œ๊ตฐ๊ตฌ๋ช…", hue="๋ธŒ๋žœ๋“œ๋ช…")

  - ๋Œ€์ฒด์ ์œผ๋กœ ๋ฐฐ์Šคํ‚จ๋ผ๋นˆ์Šค์˜ ๋งค์žฅ์ด ๋” ๋งŽ๋‹ค.

 

 

 

 


 

8.3 scatterplot ๊ทธ๋ฆฌ๊ธฐ

 

  • pandas์˜ scatterplot
df_31[["์œ„๋„", "๊ฒฝ๋„"]].plot.scatter(x="๊ฒฝ๋„", y="์œ„๋„")

 

 

 

 

 

  • hue ์ง€์ •
sns.scatterplot(data=df_31, x="๊ฒฝ๋„", y="์œ„๋„", hue="๋ธŒ๋žœ๋“œ๋ช…")

 

 

 

 

 

  • jointplot
sns.jointplot(data=df_31, x="๊ฒฝ๋„", y="์œ„๋„")

 

- jointplot : scatterplot๊ณผ histogram์„ ํ•จ๊ฒŒ ๊ทธ๋ ค์คŒ

  - ๊ฒฝ๋„, ์œ„๋„ ๋ฐ์ดํ„ฐ๋„ ์–ด๋””์— ๋” ๋นˆ๋„์ˆ˜๊ฐ€ ๋งŽ์€์ง€ ํŒ๋‹จ ๊ฐ€๋Šฅ

 

 

sns.jointplot(data=df_31, x="๊ฒฝ๋„", y="์œ„๋„", kind="hex")

  - ๋นˆ๋„์ˆ˜์— ๋”ฐ๋ผ ์ƒ‰์ด ์ง„ํ•˜๊ณ  ์˜…์Œ์„ ๋‚˜ํƒ€๋‚ด ์ˆ˜์˜ ๋งŽ๊ณ  ์ ์Œ์„ ์•Œ ์ˆ˜ ์žˆ๋‹ค.

Comments