Learn Before
Concept

Region of Interest Pooling Layer

The region of interest (RoI) pooling layer is introduced in Fast R-CNN to extract fixed-shape features from regions of interest of varying sizes. Unlike standard pooling layers that indirectly control output shape through window size, padding, and stride, RoI pooling allows the output shape (e.g., h2×w2h_2 \times w_2) to be specified directly. For an input region of shape h×wh \times w, the RoI pooling layer divides the region into an h2×w2h_2 \times w_2 grid of subwindows, where each subwindow is approximately of shape (h/h2)×(w/w2)(h/h_2) \times (w/w_2). The subwindow dimensions are rounded up, and the maximum element in each subwindow is taken as its output.

Image 0

0

1

Updated 2026-05-21

Contributors are:

Who are from:

Tags

D2L

Dive into Deep Learning @ D2L