Posted 2025-03-11Updated 2026-03-23Reviewa minute read (About 161 words) visits

DETR

参考： https://www.youtube.com/watch?v=T35ba_VXkMY&t=1744s

DETR是一个使用transformer作为基本架构的 object detection 模型。

Insight

Object queries (something that can be learned):

>Visualization of all box predictions on all images from COCO 2017 val set for 20 out of total N = 100 prediction slots in DETR decoder. Each box prediction is represented as a point with the coordinates of its center in the 1-by-1 square normalized by each image size. The points are color-coded so that green color corresponds to small boxes, red to large horizontal boxes and blue to large vertical boxes. We observe that each slot learns to specialize on certain areas and box sizes with several operating modes. We note that almost all slots have a mode of predicting large image-wide boxes that are common in COCO dataset.

DETR

http://chen-yulin.github.io/2025/03/11/[OBS]Reconstruct Anything-Semantic-DETR/

Author

Chen Yulin

Posted on

2025-03-11

Updated on

2026-03-23

Licensed under

#Research-paper Transformer CV Object-Detection

DETR

Insight

Author

Posted on

Updated on

Licensed under

Comments

Catalogue

Archives

Recents

Tags