Bounding box regression

<Bounding box Regression>

Selective search를 통해 찾은 박스 위치는 정확하지 않다.

Predicted box와 ground truth box와의 차이를 줄여주는 bounding box regression이 필요

Linear regression model로 볼 수 있다.

Predicted box

Ground truth box

(x 좌표, y 좌표, width, height)

[Bounding box regression의 목표]

predicted box가 ground truth box와 유사하도록 학습하는 것.

(P → G transform)

여기서 P의 요소들을 Ground truth와 유사하게 이동시켜주는 함수들이 있으며 이는 아래와 같음.

이 함수들을 학습한 후 아래와 같이 P에 대해서 transformation을 진행한다.

d(P)는 transformation 함수이며, 아래 수식과 같다.

Φ : 학습한 CNN layer 중 pool5에서 구한 feature vector

w : 학습 가능한 weight vector

-> 우리가 얻고자 하는 것은 결국 transformation 함수 d이다.

(d를 알아야 proposal box와 ground-truth box와의 차이를 줄이도록 할 수 있기 때문)

논문에서는 lambda 값을 1000으로 사용함.

또, 이 weight를 학습시키기 위해 regularized least squares objective(ridge regression)를 사용한다.

여기서 regression targets t는 아래와 같이 정의된다.

Bounding box regression의 경우 모든 output bounding box에 대해서 진행할 필요가 없으며

R-CNN에서는 ground truth box와 predict box와의 IoU가 0.6 이상인 값에 대해서만 진행

위 내용을 아래와 같이 그림으로 나타낼 수 있음

그림 출처 : https://lilianweng.github.io/lil-log/2017/12/31/object-recognition-for-dummies-part-3.html

x, y : scale-invariant transformation

w, h : log-scale transformation

참조

https://lilianweng.github.io/lil-log/2017/12/31/object-recognition-for-dummies-part-3.html

Object Detection for Dummies Part 3: R-CNN Family

In Part 3, we would examine four object detection models: R-CNN, Fast R-CNN, Faster R-CNN, and Mask R-CNN. These models are highly related and the new versions show great speed improvement compared to the older ones.

lilianweng.github.io

저작자표시 (새창열림)

'딥러닝관련 > Detection' 카테고리의 다른 글

Detectron2 (1) 환경 세팅 및 데모 (0)	2021.09.29
Fast R-CNN 정리 (0)	2021.06.28
Non-maximum Suppression (0)	2021.06.24
Detection metrics 정리 (IOU, Precision, Recall, mAP...) (0)	2021.06.18
Exhaustive search (0)	2021.06.15

Bounding box regression

<Bounding box Regression>

'딥러닝관련 > Detection' 카테고리의 다른 글

관련글

티스토리툴바