RelTR= Relation Transformer for Scene Graph  Generation

RelTR= Relation Transformer for Scene Graph Generation

Iterative Scene Graph Generation

Iterative Scene Graph Generation

SGTR+= End-to-end Scene Graph Generation with Transformer

SGTR+= End-to-end Scene Graph Generation with Transformer

DETR

DETR

ViLT

ViLT

Vision Transformers Need Registers

Vision Transformers Need Registers

DINOv2- Learning Robust Visual Features without Supervision
AN IMAGE IS WORTH 16X16 WORDS- TRANSFORMERS FOR IMAGE RECOGNITION AT SCALE

AN IMAGE IS WORTH 16X16 WORDS- TRANSFORMERS FOR IMAGE RECOGNITION AT SCALE

DINO

DINO

Attention Is All You Need

Attention Is All You Need