• Mindscape ๐Ÿ”ฅ
    • Playlist ๐ŸŽง
  • Algorithm

    • 1018๋ฒˆ: ์ฒด์ŠคํŒ ๋‹ค์‹œ ์น ํ•˜๊ธฐ
    • 1966๋ฒˆ: ํ”„๋ฆฐํ„ฐ ํ
    • Python ์‹œ๊ฐ„ ์ดˆ๊ณผ ๋ฐฉ์ง€๋ฅผ ์œ„ํ•œ ํŒ
    • C++ std::vector ์‚ฌ์šฉ๋ฒ• ์ •๋ฆฌ
    • Vim ์‚ฌ์šฉ ๋งค๋‰ด์–ผ
  • Ubuntu

    • ๋ฆฌ๋ˆ…์Šค ์šฐ๋ถ„ํˆฌ GRUB ํฐํŠธ ๋ณ€๊ฒฝ
    • ์šฐ๋ถ„ํˆฌ ์ด๋ฏธ์ง€ ๋น„๋””์˜ค ์ธ๋„ค์ผ(๋ฏธ๋ฆฌ๋ณด๊ธฐ) ์•ˆ ๋ณด์ž„ ๋ฌธ์ œ ํ•ด๊ฒฐ
    • Wine ํ™˜๊ฒฝ์—์„œ ์นด์นด์˜คํ†ก ์‹คํ–‰ ์‹œ explorer.exe ๋œจ์ง€ ์•Š๊ฒŒ ํ•˜๋Š” ๋ฒ•
    • ์šฐ๋ถ„ํˆฌ Wine ์นด์นด์˜คํ†ก ์‚ฌ์ง„ ์ด๋ฏธ์ง€ ์Šคํฌ๋ฆฐ์ƒท ๋ถ™์—ฌ๋„ฃ๊ธฐ
    • Wine ์นด์นด์˜คํ†ก ์ด๋ชจ์ง€ ๊นจ์ง ๋ฌธ์ œ ํ•ด๊ฒฐ
    • Ubuntu ์œˆ๋„์šฐ ์• ๋‹ˆ๋ฉ”์ด์…˜ ๋„๊ธฐ
  • Wellness

    • ์ฐจ์ „์žํ”ผ (Psyllium Husk)
    • ์—‘์ŠคํŠธ๋ผ ๋ฒ„์ง„ ์˜ฌ๋ฆฌ๋ธŒ์œ  (Extra Virgin Olive Oil)
    • ์ž๊ฐ€๋น„๊ฐ•์„ธ์ฒ™ (Nasal Irrigation)
    • QCY HT08 (MeloBuds Pro Plus)
    • ์ฝ˜์„œํƒ€ (Concerta)
    • ์ธ๋ฐ๋†€ (Inderal)
    • ์„คํŠธ๋ž„๋ฆฐ (Sertraline)
    • ๋ฉœ๋ผํ† ๋‹Œ (Melatonin)
    • ์น˜๊ฒฝ๋ถ€ ๋งˆ๋ชจ์ฆ
    • ๋ฐ”๋ฒจ ์Šค์ฟผํŠธ (Barbell Squat)
  • Humanities

    • Nordvik, Russia
    • North Sentinel Island
    • ๋กฑ๊ณ ๋กฑ๊ณ (Rongorongo)
    • ๋ฐ”๋กœํฌ ์Œ์•… (Baroque Music)
  • Design

    • ๊ตฌ๊ธ€์˜ ์•„์ด์ฝ˜ ๋Œ€๊ฐœํŽธ โ€” 6๋…„ ๋งŒ์˜ ์‹ค์ˆ˜ ์ธ์ •
    • ์ œ๋Ÿด๋“œ ์  ํƒ€ โ€” ๋Ÿญ์…”๋ฆฌ ์Šคํฌ์ธ  ์›Œ์น˜์˜ ์ฐฝ์‹œ์ž
    • ๋ฐ”์šฐํ•˜์šฐ์Šค โ€” ํ˜„๋Œ€ ๋””์ž์ธ์˜ ์›์ 
  • Brands

    • NOMOS Glashรผtte
    • Frรฉdรฉrique Constant
    • KZ (Knowledge Zenith)
    • ์—์ŠคํŠธ๋ผ (AESTURA)
    • JINHAO (้‡‘่ฑช)
    • Herman Miller
    • ๋ฐ์Šค์ปค (DESKER)
    • ๋ฌด์‹ ์‚ฌ ์Šคํƒ ๋‹ค๋“œ (Musinsa Standard)
  • Finance

    • ํ˜„๋Œ€์นด๋“œ ZERO โ€” Edition2 vs Edition3 ๋น„๊ต
    • ์‹ ํ•œ์นด๋“œ ์ฒ˜์Œ
    • S&P 500 ETF ํˆฌ์ž ๊ฐ€์ด๋“œ
    • ํŒŒํ‚นํ†ต์žฅ vs CMA ํ†ต์žฅ
    • ๋ฒ„ํฌ์…” ํ•ด์„œ์›จ์ด (Berkshire Hathaway)
    • ๋น„ํŠธ์ฝ”์ธ(Bitcoin)
  • Products

    • ์˜ค๋””์˜ค ์ธํ„ฐํŽ˜์ด์Šค (Audio Interface)
    • ์ฟ ๋ฃจํ† ๊ฐ€ (KURUTOGA)
    • CX31993 DAC ๋™๊ธ€
    • ํด๋ Œ์ง• ๋ฐ€ํฌ (Cleansing Milk)
    • ํ”ผ์ ฏ ํ† ์ด (Fidget Toy)
    • ThinkPad
  • Programming Languages

    • 8.0. Statement Level Control Structures
    • 8. Subprogram
    • 9. Implementing Subprogram
    • 10.1. Abstract Data Types and Encapsulation Constructs
    • 10.2. Support for Object Oriented Programming
    • 11. Concurrency
    • 12. FPL (1)
    • 13. FPL (2)
    • 14. Exception Handling and Event Handling
    • Final Exam

24. Convolutional Neural Networks

์ž‘์„ฑ 2026. 6. 12.ยท์ˆ˜์ • 2026. 6. 12.

A Bit of History

Once upon a time...

Convolutional Neural Networks

  • Document recognition์— Gradient-based learning ์ ์šฉ
  • LeCun, Bottou, Bengio, Haffner 1998

The First Strong Result in Deep Learning

  • Deep Convolutional Neural Networks๋ฅผ ํ™œ์šฉํ•œ ImageNet classification
  • ์ผ๋ช… "AlexNet"
  • Alex Krizhevsky, Ilya Sutskever, Geoffrey E Hinton, 2012
  • ILSVRC (ImageNet Large Scale Visual Recognition Challenge) 2012 ์šฐ์Šน

Famous Image Datasets: ImageNet

Famous Image Datasets: MNIST

  • The MNIST database
  • "Modified National Institute of Standards and Technology database"
  • ๋‹ค์–‘ํ•œ Image processing systems ํ•™์Šต์— ์ผ๋ฐ˜์ ์œผ๋กœ ์‚ฌ์šฉ๋˜๋Š” ๋Œ€๊ทœ๋ชจ ์†๊ธ€์”จ ์ˆซ์ž Database
  • 60,000๊ฐœ์˜ Training images์™€ 10,000๊ฐœ์˜ Testing images ํฌํ•จ

Famous Image Datasets: CIFAR 10 vs. CIFAR 100

  • CIFAR-10
    • 10๊ฐœ Class, Class๋‹น 6,000์žฅ์˜ ์ด๋ฏธ์ง€๋กœ ๊ตฌ์„ฑ๋œ 60,000์žฅ์˜ 32x32 ์ปฌ๋Ÿฌ ์ด๋ฏธ์ง€
    • 50,000๊ฐœ์˜ Training images์™€ 10,000๊ฐœ์˜ Test images ์กด์žฌ
  • CIFAR-100
    • CIFAR-10๊ณผ ์œ ์‚ฌํ•˜๋‚˜, ๊ฐ๊ฐ 600์žฅ์˜ ์ด๋ฏธ์ง€๋ฅผ ํฌํ•จํ•˜๋Š” 100๊ฐœ Class ๋ณด์œ 
    • Class๋‹น 500๊ฐœ์˜ Training images์™€ 100๊ฐœ์˜ Testing images ์กด์žฌ

Fast-Forward to Today: ConvNets Are Everywhere

  • Classification
  • Retrieval
  • Detection
  • Segmentation
  • Image Captioning
  • Style Transfer
  • Self-driving Cars
  • Diffusion models

Convolutional Neural Networks

Convolutional Layer

Recap: Fully Connected Layer (Simple FF Networks)

  • 32ร—32ร—3ย imageโ†’32 \times 32 \times 3 ~\text{image} \to32ร—32ร—3ย imageโ†’ stretch to 3072ร—13072 \times 13072ร—1

Convolution Layer

32ร—32ร—3ย imageโ†’32 \times 32 \times 3 ~\text{image} \to32ร—32ร—3ย imageโ†’ preserve spatial structure

Convolutional Networks

A Closer Look at Spatial Dimensions

In Practice: Common to Zero Pad the Border

Caution

Summary: Convolutional Layer

  • ์ž…๋ ฅ์ด W1ร—H1ร—CW_1 \times H_1 \times CW1โ€‹ร—H1โ€‹ร—C๋ผ๊ณ  ๊ฐ€์ •
  • Conv layer๋Š” 4๊ฐ€์ง€ Hyperparameters ํ•„์š”:
    • Number of filters KKK
    • The filter size FFF
    • The stride SSS
    • The zero padding PPP
  • ๋‹ค์Œ์˜ ์ถœ๋ ฅ W2ร—H2ร—KW_2 \times H_2 \times KW2โ€‹ร—H2โ€‹ร—K ์ƒ์„ฑ:
    • W2=(W1โˆ’F+2P)/S+1W_2 = (W_1 - F + 2P) / S + 1W2โ€‹=(W1โ€‹โˆ’F+2P)/S+1
    • H2=(H1โˆ’F+2P)/S+1H_2 = (H_1 - F + 2P) / S + 1H2โ€‹=(H1โ€‹โˆ’F+2P)/S+1
  • Number of parameters: Fร—Fร—Cร—KF \times F \times C \times KFร—Fร—Cร—K weights + KKK biases

Pooling Layer

  • Representations๋ฅผ ๋” ์ž‘๊ณ  ๊ด€๋ฆฌํ•˜๊ธฐ ์‰ฝ๊ฒŒ ๋งŒ๋“ฆ
  • ๊ฐ Activation map์— ๋Œ€ํ•ด ๋…๋ฆฝ์ ์œผ๋กœ ์ž‘๋™

Max Pooling

Summary: Pooling Layer

  • ์ž…๋ ฅ์ด W1ร—H1ร—CW_1 \times H_1 \times CW1โ€‹ร—H1โ€‹ร—C๋ผ๊ณ  ๊ฐ€์ •
  • Pooling Layer๋Š” 2๊ฐ€์ง€ Hyperparameters ํ•„์š”:
    • The spatial extent FFF
    • The stride SSS
  • ๋‹ค์Œ์˜ ์ถœ๋ ฅ W2ร—H2ร—CW_2 \times H_2 \times CW2โ€‹ร—H2โ€‹ร—C ์ƒ์„ฑ:
    • W2=(W1โˆ’F)/S+1W_2 = (W_1 - F) / S + 1W2โ€‹=(W1โ€‹โˆ’F)/S+1
    • H2=(H1โˆ’F)/S+1H_2 = (H_1 - F) / S + 1H2โ€‹=(H1โ€‹โˆ’F)/S+1
  • Number of parameters: 0

Fully-Connected (FC) Layer

  • ์ผ๋ฐ˜์ ์ธ Feedforward Neural Networks์™€ ๊ฐ™์ด ์ „์ฒด Input volume์— ์—ฐ๊ฒฐ๋œ Neurons ํฌํ•จ

Summary

  • ConvNets๋Š” CONV, POOL, FC layers๋ฅผ ์Œ“์Œ
  • ์—ญ์‚ฌ์ ์ธ Architectures ํ˜•ํƒœ: [(CONV-RELU)*N-POOL?]*M, (FC-RELU)*K, SOFTMAX
    • NNN์€ ๋ณดํ†ต 5๊นŒ์ง€, MMM์€ ํผ, 0โ‰คKโ‰ค20 \le K \le 20โ‰คKโ‰ค2
  • ๋” ์ž‘์€ Filters์™€ ๋” ๊นŠ์€ Architectures๋ฅผ ํ–ฅํ•œ ์ถ”์„ธ
  • POOL/FC layers๋ฅผ ์ œ๊ฑฐํ•˜๋Š” ์ถ”์„ธ (Just CONV)

Representative CNN Architectures

GoogleNet, VGGNet, ResNet, โ€ฆ

ImageNet Large Scale Visual Recognition Challenge (ILSVRC) Winners

VGGNet

  • ์ž‘์€ Filters, ๋” ๊นŠ์€ Networks
  • 8 layers (AlexNet) โ†’ 16 - 19 layers (VGG16Net)
  • 3x3 CONV stride 1, pad 1๊ณผ 2x2 MAX POOL stride 2๋งŒ ์‚ฌ์šฉ

ResNet

  • Residual connections๋ฅผ ์‚ฌ์šฉํ•˜๋Š” ๋งค์šฐ ๊นŠ์€ Networks
  • ImageNet์„ ์œ„ํ•œ 152-layer model - ILSVRCโ€™15 classification ์šฐ์Šน
  • ILSVRCโ€™15 ๋ฐ COCOโ€™15์˜ ๋ชจ๋“  Classification ๋ฐ Detection competitions ์„๊ถŒ

CNN for Text Classification

  • Sentence Classification์„ ์œ„ํ•œ Convolutional Neural Networks
  • Yoon Kim, EMNLP 2014
์ตœ๊ทผ ์ˆ˜์ •: 26. 6. 12. ์˜คํ›„ 3:28
Contributors: kmbzn, Claude Sonnet 4.6

BUILT WITH

CloudflareNode.jsGitHubGitVue.jsJavaScriptVSCodenpm

All trademarks and logos are property of their respective owners.
ยฉ 2026 kmbzn ยท MIT License