
概念梳理
Scene Graph
A scene graph is a structural representation, which can capture detailed semantics by explicitly Modeling:
- objects (‘‘man’’, ‘‘fire hydrant’’, ‘‘shorts’’)
- attributes of objects (‘‘fire hydrant is yellow’’)
- relations between paired objects (‘‘man jumping over fire hydrant’’)
A scene graph is a set of visual relationship triplets in the form of <subject, relation, object> or <object, is, attribute>
Scene graphs should serve as an **objective semantic representation** of the state of the scene
 Part-level Scene Understanding for Robots/Pasted_image_20250414142333.png)








 Representation Learning for Scene Graph Completion via Jointly Structural and Visual Embedding/Pasted_image_20250318162533.png)
 Contextual Translation Embedding for Visual Relationship Detection and Scene Graph Generation/Pasted_image_20250318160643.png)
 Visual Translation Embedding Network for Visual Relation Detection/Pasted_image_20250318155431.png)
 Visual Translation Embedding Network for Visual Relation Detection/Pasted_image_20250318155444.png)


