Learn Before
Object Detection Prediction Pipeline Using Anchor Boxes
At inference time, an anchor-box-based object detection model follows a multi-step prediction pipeline for each input image. First, the model generates multiple anchor boxes at various positions and scales across the image. Next, it predicts a class label and a set of bounding-box offsets for every anchor box simultaneously. The predicted offsets are then applied to adjust the positions and sizes of the corresponding anchor boxes, producing a set of predicted bounding boxes. Finally, a filtering step retains only those predicted bounding boxes that satisfy certain criteria (such as confidence thresholds or non-maximum suppression), which are output as the final detections.
0
1
Tags
D2L
Dive into Deep Learning @ D2L