hye-log

[๋ถ€์ŠคํŠธ์บ ํ”„ AI Tech]WEEK 03_DAY 14 ๋ณธ๋ฌธ

Boostcourse/AI Tech 4๊ธฐ

[๋ถ€์ŠคํŠธ์บ ํ”„ AI Tech]WEEK 03_DAY 14

iihye_ 2022. 10. 11. 01:57

๐ŸŒฑ ๊ฐœ๋ณ„ํ•™์Šต


[7] Recurrent Neural Networks

1. Sequential Model

1) Naive sequence model : ์ด์ „ ๋ฐ์ดํ„ฐ๋กœ ํ˜„์žฌ ๋ฐ์ดํ„ฐ๋ฅผ ์˜ˆ์ธกํ•จ

2) Autoregressive model : ๊ณผ๊ฑฐ์˜ τ๊ฐœ๋กœ ํ˜„์žฌ ๋ฐ์ดํ„ฐ๋ฅผ ์˜ˆ์ธกํ•จ

3) Markov model : ๋ฐ”๋กœ ์ง์ „ ๊ฐ’๋งŒ ์˜ํ–ฅ์„ ๋ฐ›์Œ

4) Latent autoregressive model : hidden state๋กœ ๊ณผ๊ฑฐ์˜ ์ •๋ณด๋ฅผ ๊ธฐ์–ต, ์š”์•ฝ

 

2. Recurrent Neural Network

1) Recurrent Neural Network : ๋ชจ๋ธ์˜ ์ถœ๋ ฅ์ด ๋‹ค์‹œ ์ž…๋ ฅ์œผ๋กœ ๊ฐ€๋Š” ๊ตฌ์กฐ

https://colah.github.io/posts/2015-08-Understanding-LSTMs/

2) ๋‹จ์  : Long-term dependencies

- ๋„คํŠธ์›Œํฌ์˜ ํฌ๊ธฐ๊ฐ€ ํด์ˆ˜๋ก ๊ณผ๊ฑฐ์˜ ์ •๋ณด๋ฅผ ์žƒ์–ด๋ฒ„๋ฆด ์ˆ˜ ์žˆ์Œ

- vanishing/exploding gradient ๋ฌธ์ œ

 

3. Long Short Term Memory(LSTM)

1) ๊ตฌ์กฐ

- Input

- Output

- Gate : Forget gate, Input gate, Output gate

- Cell state : Previous cell state, Next cell state

- Hidden state : Previous cell state, Next cell state

https://colah.github.io/posts/2015-08-Understanding-LSTMs/

2) Forget gate : ์–ด๋–ค ์ •๋ณด๋ฅผ ๋ฒ„๋ฆด์ง€

3) Input gate : ์–ด๋–ค ์ •๋ณด๋ฅผ ์ €์žฅํ• ์ง€

4) Update cell : cell state์— ์—…๋ฐ์ดํŠธ

5) Output gate : ์–ด๋–ค ๊ฐ’์„ ๋ฐ–์œผ๋กœ ๋‚ด๋ณด๋‚ผ์ง€


[8] Transformer

1. Transformer

1) attention ๊ตฌ์กฐ๋ฅผ ํ™œ์šฉํ•œ ์‹œํ€€์Šค ๋ณ€ํ™˜ ๋ชจ๋ธ

2) ๊ธฐ๊ณ„์–ด ๋ฒˆ์—ญ ๋“ฑ์— ์‚ฌ์šฉ

3) encoder-decoder ๊ตฌ์กฐ๋กœ ๊ตฌ์„ฑ

4) ํ•˜๋‚˜์˜ encoder๋Š” self-attention, feed-forward๋กœ ๊ตฌ์„ฑ

- self-attention์˜ path๋Š” ์˜์กด์ (dependency)

- feed-forward์˜ path๋Š” ๋…๋ฆฝ์ (independency)



๐ŸŒฑ ์˜ค๋Š˜์˜ ํšŒ๊ณ 

๋งˆ๋ฌด๋ฆฌํ•˜์ง€ ๋ชปํ•œ ๊ธฐ๋ณธ ๊ณผ์ œ๊ฐ€ ์žˆ์–ด์„œ ์˜ค์ „์—๋Š” ๊ฐ•์˜ ๋“ฃ๊ณ  ๊ธฐ๋ณธ ๊ณผ์ œ ํ’€์–ด์„œ ์ œ์ถœํ–ˆ๋‹ค. ์˜คํ›„์—๋Š” transformer ๊ฐ•์˜๋ฅผ ๋“ค์—ˆ๋Š”๋ฐ... ์–ด๋ ต๋‹ค.. ์‚ฌ์‹ค ๊ฐ•์˜ ๋‚ด์šฉ์„ ์ „๋ถ€ ์†Œํ™”ํ•˜์ง€ ๋ชปํ•ด์„œ ์•„์‰ฌ์› ๋‹ค.. ๋‹คํ–‰ํžˆ๋„ ํ”ผ์–ด์„ธ์…˜ ๋•Œ ๋‹ค๋ฅธ ์บ ํผ๋ถ„์ด generative model์— ๋Œ€ํ•ด์„œ ์„ค๋ช…ํ•ด์ฃผ์…”์„œ ๊ทธ๋‚˜๋งˆ ์ดํ•ด๊ฐ€ ๊ฐ”๋‹ค! ๊ธˆ์š”์ผ์€ ์ŠคํŽ˜์…œ ํ”ผ์–ด์„ธ์…˜๋„ ์ค€๋น„๋˜์–ด ์žˆ๋Š” ๋‚ ! ๋‹ค๋ฅธ ์บ ํผ๋ถ„๋“ค์ด๋ž‘ ๋งŒ๋‚˜์„œ ์–ด๋–ค ๋ถ„์•ผ์— ๊ด€์‹ฌ ์žˆ๋Š”์ง€, ํ”ผ์–ด์„ธ์…˜์€ ์–ด๋–ป๊ฒŒ ์šด์˜ํ•˜๊ณ  ์žˆ๋Š”์ง€์— ๋Œ€ํ•ด์„œ ์ด์•ผ๊ธฐ ๋‚˜๋ˆ„์—ˆ๋‹ค. ์ด๋ฒˆ ์ฃผ๊นŒ์ง€๊ฐ€ level 2๋ฅผ ์œ„ํ•œ ์กฐ๋ฅผ ์งœ๋Š” ์‹œ๊ฐ„์ธ๋ฐ ์–ด๋–ค ์ฃผ์ œ๋กœ ์–ด๋–ป๊ฒŒ ํ”„๋กœ์ ํŠธ๋ฅผ ์ง„ํ–‰ํ•ด์•ผ ํ• ์ง€ ์•„์ง๊นŒ์ง€๋„ ๊ฐ์ด ์ž˜ ์•ˆ ์˜จ๋‹คใ… ใ…  ์˜คํ”ผ์Šค์•„์›Œ์—๋Š” ์‹ฌํ™” ๊ณผ์ œ์— ๋Œ€ํ•œ ์„ค๋ช…ํ•˜๋Š” ์‹œ๊ฐ„์„ ๊ฐ€์กŒ๋Š”๋ฐ, ์‹œ์ž‘ํ•˜๋Š” ์ธํŠธ๋กœ ๋…ธ๋ž˜๊ฐ€ ๋‰ด์ง„์Šค์˜ Attention.. (is all you need) (๊ตฟ!) ๋‹ค์Œ ์ฃผ๋ถ€ํ„ฐ๋Š” ์›”, ํ™”, ์ˆ˜ ํ”ผ์–ด์„ธ์…˜ ๋•Œ ํ•˜๋Š” ์ผ์ด ๊ฑฐ์˜ ์—†๋Š”๊ฑฐ ๊ฐ™์•„์„œ ๋ฐ์ด์ฝ˜ ๋Œ€ํšŒ๋ฅผ ์‹œ์ž‘ํ•˜๊ธฐ๋กœ ํ–ˆ๋‹ค!

728x90
Comments