 Part-level Scene Understanding for Robots/Pasted_image_20250414142333.png)
(Mindmap) Part-level Scene Understanding for Robots
概念梳理
Scene Graph
A scene graph is a structural representation, which can capture detailed semantics by explicitly Modeling:
- objects (‘‘man’’, ‘‘fire hydrant’’, ‘‘shorts’’)
- attributes of objects (‘‘fire hydrant is yellow’’)
- relations between paired objects (‘‘man jumping over fire hydrant’’)
A scene graph is a set of visual relationship triplets in the form of <subject, relation, object> or <object, is, attribute>
Scene graphs should serve as an **objective semantic representation** of the state of the scene

 Representation Learning for Scene Graph Completion via Jointly Structural and Visual Embedding/Pasted_image_20250318162533.png)
 Contextual Translation Embedding for Visual Relationship Detection and Scene Graph Generation/Pasted_image_20250318160643.png)
 Visual Translation Embedding Network for Visual Relation Detection/Pasted_image_20250318155431.png)
 Visual Translation Embedding Network for Visual Relation Detection/Pasted_image_20250318155444.png)



 Fully Convolutional Scene Graph Generation/Pasted_image_20250314123239.png)
 Fully Convolutional Scene Graph Generation/Pasted_image_20250317115153.png)
 Fully Convolutional Scene Graph Generation/Pasted_image_20250317115203.png)
 Fully Convolutional Scene Graph Generation/Pasted_image_20250317115216.png)
 Fully Convolutional Scene Graph Generation/Pasted_image_20250317121116.png)
 Fully Convolutional Scene Graph Generation/Pasted_image_20250317121142.png)
 Fully Convolutional Scene Graph Generation/Pasted_image_20250317121325.png)
