OMG-LLaVA
Scene Reconstruction with Functional Objects for Robot Autonomy
DETR
Semantic-SAM
MaskDINO
ALBEF
ViLT
ZegCLIP