hye-log

[๋ถ€์ŠคํŠธ์บ ํ”„ AI Tech]WEEK 14_DAY 65 ๋ณธ๋ฌธ

Boostcourse/AI Tech 4๊ธฐ

[๋ถ€์ŠคํŠธ์บ ํ”„ AI Tech]WEEK 14_DAY 65

iihye_ 2022. 12. 21. 03:51

๐ŸŽ„ ๊ฐœ๋ณ„ํ•™์Šต


[3] Semantic Segmentation์˜ ๊ธฐ์ดˆ์™€ ์ดํ•ด

1. ๋Œ€ํ‘œ์ ์ธ ๋”ฅ๋Ÿฌ๋‹์„ ์ด์šฉํ•œ ์„ธ๊ทธ๋ฉ˜ํ…Œ์ด์…˜ FCN

1) Abstract

Long, J., Shelhamer, E., & Darrell, T. (2015). Fully convolutional networks for semantic segmentation. In  Proceedings of the IEEE conference on computer vision and pattern recognition  (pp. 3431-3440).

(1) backbone(extracting feature) : VGG ๋„คํŠธ์›Œํฌ

(2) VGG FC layer -> Convolution์œผ๋กœ ๋Œ€์ฒด

(3) Transposed Convolution -> Pixel Wise Prediction ์ˆ˜ํ–‰

2) VGG

- Image classifcation์—์„œ ์ข‹์€ ์„ฑ๋Šฅ

- pretrained network๋ฅผ ๊ทธ๋Œ€๋กœ ์‚ฌ์šฉํ•  ์ˆ˜ ์žˆ์Œ

3) Fully Connected Layer vs Convolution Layer

- Convolution Layer : ๊ฐ ํ”ฝ์…€์˜ ์œ„์น˜ ์ •๋ณด๋ฅผ ๊ทธ๋Œ€๋กœ ๊ฐ€์ ธ์˜ด

- Fully Connected Layer : flatten์„ ํ•˜๊ธฐ ๋•Œ๋ฌธ์— ์œ„์น˜ ์ •๋ณด๋ฅผ ํ•ด์นจ

- 1x1 Conv๋ฅผ ์‚ฌ์šฉํ•˜๋Š” ์ด์œ  : kernel์˜ ํŒŒ๋ผ๋ฏธํ„ฐ์— ์˜ํ•ด ์˜ํ–ฅ์„ ๋ฐ›๊ณ , ์ด๋ฏธ์ง€๋‚˜ ๋ ˆ์ด์–ด ํฌ๊ธฐ(height, width)์™€๋Š” ์ƒ๊ด€ ์—†์Œ

4) Transposed Convolution

- ์ค„์–ด๋“  ์ด๋ฏธ์ง€๋ฅผ ๋ณต์›ํ•˜๋Š” ๊ณผ์ •์ด๊ธฐ ๋•Œ๋ฌธ์— upsampling

- convolution์˜ ์—ญ์—ฐ์‚ฐ์ด๊ธฐ ๋•Œ๋ฌธ์— deconvolution

- convolution์„ transpose ํ•ด์„œ transposed convolution

- ํ•™์Šต ๊ฐ€๋Šฅํ•œ ํŒŒ๋ผ๋ฏธํ„ฐ๋ฅผ ํ†ตํ•ด์„œ ์ค„์–ด๋“  ์ด๋ฏธ์ง€๋ฅผ ๋‹ค์‹œ ํ‚ค์šฐ๋Š” convolution

5) FCN์—์„œ ์„ฑ๋Šฅ์„ ํ–ฅ์ƒ์‹œํ‚ค๊ธฐ ์œ„ํ•œ ๋ฐฉ๋ฒ•

- MaxPooling์— ์˜ํ•ด์„œ ์žƒ์–ด๋ฒ„๋ฆฐ ์ •๋ณด๋ฅผ ๋ณต์›ํ•ด์ฃผ๋Š” ์ž‘์—…์„ ์ง„ํ–‰

- Upsampled Size๋ฅผ ์ค„์—ฌ์ฃผ๊ธฐ ๋•Œ๋ฌธ์— ํšจ์œจ์ ์ธ ์ด๋ฏธ์ง€ ๋ณต์› ๊ฐ€๋Šฅ

6) ํ‰๊ฐ€ ์ง€ํ‘œ

- Pixel Accuracy : True pixel / Total pixel

- Mean IoU : ํด๋ž˜์Šค๋ณ„ (Ground Truth∩Predict / Ground Truth∪Predict) ํ‰๊ท 



๐ŸŽ„ ์˜ค๋Š˜์˜ ํšŒ๊ณ 

๊ฐ•์˜..๋„ ๋“ค์–ด์•ผ ํ•˜์ง€๋งŒ ์˜ค์ „์—๋Š” ์ตœ์ข… ํ”„๋กœ์ ํŠธ ๊ด€๋ จํ•ด์„œ ์–ด๋–ค ๋ฐ์ดํ„ฐ์…‹ ์žˆ๋Š”์ง€, ์–ด๋–ค ์ฃผ์ œ๋กœ ํ•˜๋ฉด ์ข‹์„์ง€ ์ƒ๊ฐํ•ด๋ณด์•˜๋‹ค. ์›๋ž˜๋Š” ์ฐจ๋Ÿ‰ ํŒŒ์† ๊ด€๋ จํ•ด์„œ ํ”„๋กœ์ ํŠธ๋ฅผ ์ง„ํ–‰ํ•˜๋ ค๊ณ  ํ–ˆ๋Š”๋ฐ, AI hub์—์„œ ๋ฐ์ดํ„ฐ๋„ ๋‚ด๋ ค ๋ฐ›์•„์„œ ํ•˜๋‚˜์”ฉ ๋ณด๊ณ , ์บ๊ธ€์—์„œ๋„ ์–ด๋–ค ๋Œ€ํšŒ๋“ค ์—ด๋ ธ๋Š”์ง€ ์ฐพ์•„๋ณด์•˜๋‹ค. ์–ด๋–ค ํ”„๋กœ์ ํŠธ๋ฅผ ์ง„ํ–‰ํ•˜๋“  ๋ฐ์ดํ„ฐ๊ฐ€ ์ฃผ์–ด์ ธ์•ผ ํ•  ์ˆ˜ ์žˆ์œผ๋‹ˆ ๋ฐ์ดํ„ฐ๋ฅผ ๋งŽ์ด ์ฐพ์•„๋ณด๋Š” ์ˆ˜ ๋ฐ–์— ์—†๋Š” ๊ฑฐ ๊ฐ™๋‹ค. ๊ฐ•์˜๋Š” ํ•˜๋‚˜ ์ •๋„ ๋“ฃ๊ณ , ์„œ๋ฒ„ ์—ด์–ด์„œ ssh ์„ค์ •ํ•˜๋Š”๋ฐ ์™œ ํ•  ๋•Œ๋งˆ๋‹ค ํ—ท๊ฐˆ๋ฆฌ๋Š”๊ฑธ๊นŒ?(ใ… ใ… ) ํ•„์š”ํ•œ ๊ฒƒ๋“ค ๊น”๊ณ  ์ฃผ์–ด์ง„ ๋ฒ ์ด์Šค๋ผ์ธ ์ฝ”๋“œ๋„ ์‚ดํŽด๋ณด์•˜๋‹ค. ๋ฉ˜ํ† ๋ง ๋•Œ์—๋Š” segmentation ๋Œ€ํšŒ ๊ด€๋ จํ•ด์„œ ๋ฉ˜ํ† ๋‹˜์ด ๊ฐ„๋‹จํ•˜๊ฒŒ ๋ฆฌ๋ทฐํ•ด์ฃผ์…จ๋‹ค. ํ  detection ๋•Œ๋„ ๊ทธ๋ ‡๊ณ  ํ™•์‹คํžˆ transformer ์ด์šฉํ•œ ๋ชจ๋ธ์ด ๋งŽ์ด ๋‚˜์˜ค๋Š”๊ฑฐ ๊ฐ™๋‹ค..!

728x90
Comments