hye-log

[๋ถ€์ŠคํŠธ์บ ํ”„ AI Tech]WEEK 03_DAY 11 ๋ณธ๋ฌธ

Boostcourse/AI Tech 4๊ธฐ

[๋ถ€์ŠคํŠธ์บ ํ”„ AI Tech]WEEK 03_DAY 11

iihye_ 2022. 10. 5. 02:03

๐ŸŒฑ ๊ฐœ๋ณ„ํ•™์Šต


[1] Historical Review

1. Introduction

1) ์ธ๊ณต์ง€๋Šฅ(Artificial Intelligence) : ์‚ฌ๋žŒ์˜ ์ง€๋Šฅ์„ ๋ชจ๋ฐฉํ•œ ๊ฒƒ

2) ๋จธ์‹ ๋Ÿฌ๋‹(Machine Learning) : ๋ฐ์ดํ„ฐ๋ฅผ ํ†ตํ•ด ํ•™์Šตํ•˜๋Š” ๊ฒƒ

3) ๋”ฅ๋Ÿฌ๋‹(Deep Learning) : Neural Network๋ฅผ ํ™œ์šฉํ•˜์—ฌ ํ•™์Šตํ•˜๋Š” ๊ฒƒ

-  ๋”ฅ๋Ÿฌ๋‹์˜ ์ฃผ์š” ์š”์†Œ : data, model, loss, algorithm

 

2. Historical Review

1) 2012 AlexNet : CNN์„ ์‚ฌ์šฉํ•œ ๋ชจ๋ธ์ด๋ฉฐ, ๋”ฅ๋Ÿฌ๋‹์„ ์ด์šฉํ•œ ImageNet ๋Œ€ํšŒ์—์„œ 1๋“ฑ์„ ์ฐจ์ง€ํ•œ ๋ชจ๋ธ

AlexNet

2) 2013 DQN : ์•ŒํŒŒ๊ณ ๋ฅผ ๋งŒ๋“  ๋”ฅ๋งˆ์ธ๋“œ์—์„œ Atari Games์„ ํ”Œ๋ ˆ์ดํ•˜๋„๋ก ํ›ˆ๋ จํ•˜๋Š” ๋ฐ ์„ฑ๊ณตํ•œ ๋ชจ๋ธ

3) 2014 Encoder/Decoder : ๋‹จ์–ด๊ฐ€ ์—ฐ์†๋œ ๋ฌธ์žฅ์—์„œ ๋‹ค๋ฅธ ์–ธ์–ด๋กœ ๋ฌธ์žฅ์„ ์ƒ์„ฑ. ๊ธฐ๊ณ„์–ด ๋ฒˆ์—ญ์˜ ํŠธ๋ Œ๋“œ๋ฅผ ์ด๋”

4) 2014 Adam Optimizer : Adam optimizer๋ฅผ ์‚ฌ์šฉํ•˜๋ฉด ๊ฒฐ๊ณผ๊ฐ€ ์ž˜ ๋‚˜์˜ค๊ธฐ ๋•Œ๋ฌธ์— ์‚ฌ์šฉ

5) 2015 Generative Adversarial Network : ์ผ๋ช… GAN์œผ๋กœ ์ƒ์„ฑ ๋ชจ๋ธ์„ ์˜๋ฏธ

6) 2015 Residual Networks : network๋ฅผ ๊นŠ๊ฒŒ ์Œ“์œผ๋ฉด์„œ ์ง„์ •ํ•œ ๋”ฅ๋Ÿฌ๋‹์„ ๊ตฌํ˜„

7) 2017 Transformer : NLP, ์ปดํ“จํ„ฐ ๋น„์ „์—์„œ ์‚ฌ์šฉํ•˜๋Š” ์•„ํ‚คํ…์ฒ˜

8) 2018 BERT : ์ž์—ฐ์–ด ์ฒ˜๋ฆฌ ๋ชจ๋ธ

9) 2020 Self Supervised Learning : ํ•œ์ •์  ํ•™์Šต ๋ฐ์ดํ„ฐ์— unlabeled ๋ฐ์ดํ„ฐ๋ฅผ ํ™œ์šฉํ•˜์—ฌ visual representation์„ ํ•™์Šต

์ฐธ๊ณ ) https://dennybritz.com/posts/deep-learning-ideas-that-stood-the-test-of-time/


[2] Neural Networks & Multi-Layer Perceptron

1. Neural Networks

1) ๋น„์„ ํ˜• ๊ตฌ์กฐ๋ฅผ ํ–‰๋ ฌ๊ณฑ ์—ฐ์‚ฐ์„ ์ด์šฉํ•˜์—ฌ ํ•จ์ˆ˜๋ฅผ ๊ทผ์‚ฌ์‹œํ‚จ ๋„คํŠธ์›Œํฌ

2) Activation Function : ReLU, Sigmoid, Hyperbolic Tangent, ...

 

2. Multi-Layer Perceptron(MLP)

1) input๊ณผ output ์‚ฌ์ด์— hidden layer๋ฅผ ์ถ”๊ฐ€ํ•˜์—ฌ ๊นŠ๊ฒŒ ๋งŒ๋“ค ์ˆ˜ ์žˆ์Œ

2) loss function

- Regression Task : outlier์— ๋Œ€ํ•œ panalty๋ฅผ ํฌ๊ฒŒ ํ•จ

- Classification Task : output์ด one-hot vector๋กœ ๋‚˜์˜ค๊ธฐ ๋•Œ๋ฌธ์— ํ•ด๋‹นํ•˜๋Š” ์ฐจ์›์˜ ์ถœ๋ ฅ๊ฐ’๋งŒ ํฌ๊ฒŒ ํ•จ

- Probabilistic Task : uncetainty ์ •๋ณด๋ฅผ ํ•จ๊ป˜ ์ฐพ๊ณ  ์‹ถ์„ ๋•Œ ํ™•๋ฅ ์  ํƒœ์Šคํฌ์—์„œ ํ™œ์šฉ

 


๐ŸŒฑ ์˜ค๋Š˜์˜ ํšŒ๊ณ 

์›”์š”์ผ์€ ํ‘น(?) ์‰ฌ๊ณ  ํ™”์š”์ผ์— ๋งž๋Š” 3์ฃผ์ฐจ ๋ถ€์บ ! ์˜ค์ „์—๋Š” ๋ฉ˜ํ† ๋‹˜์˜ ๋ฉ˜ํ† ๋ง์œผ๋กœ ์‹œ์ž‘ํ–ˆ๋‹ค. ์—ฌ๋Ÿฌ ๊ฐ€์ง€ ์ธ์‚ฌ์ดํŠธ๋ฅผ ์–ป๋Š” ๋…ผ๋ฌธ ์‚ฌ์ดํŠธ๋“ค์„ ์•Œ๋ ค์ฃผ์…จ๋Š”๋ฐ paperwithcode ๋ฐ–์— ๋ชฐ๋ž๋Š”๋ฐ, ๋…ผ๋ฌธ์„ ๋ณผ ์ˆ˜ ์žˆ๋Š” ๋‹ค์–‘ํ•œ ์‚ฌ์ดํŠธ๋“ค์„ ์•Œ๋ ค์ฃผ์…”์„œ ์œ ์šฉํ–ˆ๋‹ค +_+ ์˜ค์ „์€ ๋งˆ๋ฌด๋ฆฌํ•˜๊ณ  ์ ์‹ฌ ๋จน๊ณ  ์˜คํ›„์—๋Š” ๋”ฅ๋Ÿฌ๋‹ ๊ธฐ๋ณธ ๊ฐ•์˜๋ฅผ ๋“ค์—ˆ๋‹ค. ์ฒซ ๋ฒˆ์งธ ๊ฐ•์˜๋Š” ๋”ฅ๋Ÿฌ๋‹ ๊ธฐ๋ณธ์— ๋Œ€ํ•œ ๊ฐ•์˜์˜ ์ „๋ฐ˜์ ์ธ overview ๊ฐ•์˜์˜€๋Š”๋ฐ, ๋”ฅ๋Ÿฌ๋‹์€ ๋ฐฐ์šฐ๋ฉด ๋ฐฐ์šธ์ˆ˜๋ก ์ฒ˜์Œ ๋ฐฐ์šฐ๋Š” ๊ฒƒ์ฒ˜๋Ÿผ ์ƒˆ๋กญ๋‹ค. ์ฒ˜์Œ์—๋Š” ํ•œ 20% ์ดํ•ด๊ฐ€๊ณ , ๋‚˜์ค‘์— ๋“ค์œผ๋ฉด ํ•œ 40% ์ดํ•ด๊ฐ€๊ณ , ๋‚˜์ค‘์—๋Š” ์ดํ•ดํ•ด์„œ ๊ทธ ๋•Œ๋Š” ์™œ ์ดํ•ด ๋ชปํ–ˆ์„๊นŒ ์ƒ๊ฐํ•œ๋‹ค. ์‹ค์Šต๊นŒ์ง€ ํ–ˆ์–ด์•ผ ํ–ˆ๋Š”๋ฐ ์•„์‰ฝ๊ฒŒ๋„ ์‹ค์Šต์€ ๋ชปํ•˜๊ณ  ํ€ด์ฆˆ๊นŒ์ง€๋งŒ ์™„๋ฃŒ! ๋‚ด์ผ์€ ์‹ค์Šต์ด๋ž‘ ๊ณผ์ œํ•˜๊ณ  ๊ฐ•์˜ ์ด์–ด์„œ ๋“ฃ๊ธฐ๋กœ~!

728x90
Comments