hye-log

[๋ถ€์ŠคํŠธ์บ ํ”„ AI Tech]WEEK 07_DAY 30 ๋ณธ๋ฌธ

Boostcourse/AI Tech 4๊ธฐ

[๋ถ€์ŠคํŠธ์บ ํ”„ AI Tech]WEEK 07_DAY 30

iihye_ 2022. 11. 1. 18:57

๐Ÿฅช ๊ฐœ๋ณ„ํ•™์Šต


[9] Ensemble

1. Ensemble

1) ๋” ๋‚˜์€ ์„ฑ๋Šฅ์„ ์œ„ํ•ด ์„œ๋กœ ๋‹ค๋ฅธ ์—ฌ๋Ÿฌ ํ•™์Šต ๋ชจ๋ธ์„ ์‚ฌ์šฉํ•˜๋Š” ๊ฒƒ

2) Ensemble of Deep NN

- Low Bias, High Variance -> Overfitting

3) Model Averaging(Voting)

- ์„œ๋กœ ๋‹ค๋ฅธ ๋ชจ๋ธ์—๊ฒŒ ๊ฐ๊ฐ์˜ ํŠน์„ฑ์ด ์žˆ์„ ๋•Œ ensemble ํšจ๊ณผ๊ฐ€ ์žˆ์Œ

- hard voting : ๋†’์€ ๋ชจ๋ธ์˜ ๊ฒฐ๊ณผ๋งŒ ๊ณ ๋ คํ•˜๋Š” ๊ฒƒ(๋‹ค์ˆ˜๊ฒฐ)

- soft voting : ๋ชจ๋“  ๋ชจ๋ธ์˜ ๊ฒฐ๊ณผ๋ฅผ ๊ณ ๋ คํ•˜๋Š” ๊ฒƒ(ํ‰๊ท )

4) Cross Validation

- valid set์„ ํ•™์Šต์— ํ™œ์šฉํ•  ์ˆ˜๋Š” ์—†์„๊นŒ?

5) Stratified K-Fold Cross Validation

- ๊ฐ€๋Šฅํ•œ ๊ฒฝ์šฐ๋ฅผ ๋ชจ๋‘ ๊ณ ๋ ค

- ๋ถ„๋ฆฌํ•  ๋•Œ Class ๋ถ„ํฌ๊นŒ์ง€ ๊ณ ๋ คํ•จ

- ๋งŒ์•ฝ K=5 ๋ผ๋ฉด 5๊ฐœ์˜ ๋ชจ๋ธ์ด ๋งŒ๋“ค์–ด์ง€๊ณ  ๊ฐ ๋ชจ๋ธ์€ 80%์˜ train, 20%์˜ valid ๋ถ„ํฌ๋ฅผ ๊ฐ€์ง€๊ฒŒ ๋จ

6) TTA(Test Time Augmentation)

- ํ…Œ์ŠคํŠธํ•  ๋•Œ augmentation ํ•˜๋Š” ๊ฒƒ

- ์ถœ๋ ฅ๋œ ์—ฌ๋Ÿฌ ๊ฐ€์ง€ ๊ฒฐ๊ณผ๋ฅผ ์•™์ƒ๋ธ”

- ๋ชจ๋ธ์˜ ์ผ๋ฐ˜ํ™”๋ฅผ ์œ„ํ•ด์„œ ์‚ฌ์šฉํ•จ

7) ์•™์ƒ๋ธ” ํšจ๊ณผ๊ฐ€ ํ™•์‹คํžˆ ์žˆ์ง€๋งŒ ๊ทธ๋งŒํผ ํ•™์Šต, ์ถ”๋ก  ์‹œ๊ฐ„์ด ์†Œ์š”๋จ

 

2. Hyperparameter Optimization

1) Hyperparameter

- ์‹œ์Šคํ…œ์˜ ๋งค์ปค๋‹ˆ์ฆ˜์— ์˜ํ–ฅ์„ ์ฃผ๋Š” ์ฃผ์š”ํ•œ ํŒŒ๋ผ๋ฏธํ„ฐ

- learning rate, batch size, loss, optimizer, dropout, regularization, k-fold, hidden layer, ...

2) Optumna

- ํŒŒ๋ผ๋ฏธํ„ฐ ๋ฒ”์œ„๋ฅผ ์ฃผ๊ณ  ๊ทธ ๋ฒ”์œ„ ์•ˆ์—์„œ trials ๋งŒํผ ์‹œํ–‰



๐Ÿฅช ์˜ค๋Š˜์˜ ํšŒ๊ณ 

๋ฐ์ผ๋ฆฌ ์Šคํฌ๋Ÿผ ๋•Œ๋Š” validation ๊ฒฐ๊ณผ๋ž‘ test ๊ฒฐ๊ณผ๋ž‘ ์ฐจ์ด๊ฐ€ ๋งŽ์ด ๋‚˜์„œ ์™œ ๊ทธ๋Ÿด์ง€ ๊ณ ๋ฏผํ•ด๋ดค๋‹ค. validation์€ 99% ๊ฐ€๊นŒ์ด ๋‚˜์˜ค๋Š” ๋ฐ˜๋ฉด์— test๋Š” 70% ์–ธ์ €๋ฆฌ์— ์žˆ์–ด์„œ ๋ฐ์ดํ„ฐ ๋ถ„ํฌ ๋•Œ๋ฌธ์— ์ฐจ์ด๊ฐ€ ๋‚œ๋‹ค๊ณ  ์ƒ๊ฐํ–ˆ๋‹ค. ๊ทธ๋Ÿฐ๋ฐ ๋ฐค์— ๋Œ๋ ค๋ณธ ๋ชจ๋ธ๋„ validation์€ ์ž˜ ๋‚˜์˜ค๋Š”๋ฐ test๊ฐ€ ์ž˜ ๋‚˜์˜ค์ง€ ์•Š์•„์„œ overfitting์ด ์•„๋‹Œ๊ฐ€ ์˜์‹ฌํ–ˆ์—ˆ๋‹ค.. ํ”ผ์–ด์„ธ์…˜ ๋•Œ ๋‚ด๋ฆฐ ๊ฒฐ๋ก ์€ train๊ณผ valid๋ฅผ ๋‚˜๋ˆŒ ๋•Œ ๋žœ๋คํ•˜๊ฒŒ ๋‚˜๋ˆ„๋‹ค๋ณด๋‹ˆ feature๊ฐ€ ์•„๋‹ˆ๋ผ ์‚ฌ๋žŒ์„ ํ•™์Šตํ•ด์„œ ์šฐ๋ฆฌ๊ฐ€ ๋งŒ๋“  ๋ชจ๋ธ์ด overfitting ๋œ ๊ฒƒ ๊ฐ™๋‹ค๋Š” ์˜๊ฒฌ์ด ๋ชจ์˜€๋‹ค. ๊ทธ๋ž˜์„œ overfitting์„ ํ•ด๊ฒฐํ•˜๊ธฐ ์œ„ํ•ด์„œ ๋‹ค์–‘ํ•œ ๋ฐฉ๋ฒ•์„ ์ œ์•ˆํ–ˆ๊ณ , ๊ทธ ์ค‘์—์„œ ๋‚˜๋Š” ๊ฒŒ์‹œํŒ์— ์˜ฌ๋ผ์˜จ multi-label classification์„ ์ ์šฉ์‹œ์ผœ๋ณด๊ธฐ๋กœ ํ–ˆ๋‹ค. ๋ฐฉ๋ฒ•์€ ๋‹ค ์ฃผ์–ด์ ธ ์žˆ์–ด์„œ ํ•˜๋‚˜์”ฉ ๋”ฐ๋ผํ–ˆ๋Š”๋ฐ ์•ฝ๊ฐ„ ์ฝ”๋“œ์— ์˜ค๋ฅ˜๊ฐ€ ์žˆ์–ด์„œ ๊ณ„์† ๋””๋ฒ„๊น… ์ค‘์ด๋‹คใ… -ใ… 

728x90
Comments