Concept

Practical Scale-Ratio Combination Strategy for Anchor Boxes

When generating anchor boxes with nn scales s1,,sns_1, \ldots, s_n and mm aspect ratios r1,,rmr_1, \ldots, r_m, using every possible (si,rj)(s_i, r_j) combination at each pixel would produce whnmwhnm total anchor boxes, which is computationally prohibitive. In practice, only those pairings that include either the first scale s1s_1 or the first aspect ratio r1r_1 are retained:

(s1,r1),(s1,r2),,(s1,rm),(s2,r1),(s3,r1),,(sn,r1)(s_1, r_1), (s_1, r_2), \ldots, (s_1, r_m), (s_2, r_1), (s_3, r_1), \ldots, (s_n, r_1)

This yields n+m1n + m - 1 distinct anchor boxes per pixel and wh(n+m1)wh(n + m - 1) anchor boxes for the entire image, dramatically reducing the computational burden while still providing sufficient shape diversity to cover most ground-truth objects.

0

1

Updated 2026-05-20

Contributors are:

Who are from:

Tags

D2L

Dive into Deep Learning @ D2L