(Mindmap) Part-level Scene Understanding for Robots
Representation Learning for Scene Graph Completion via Jointly Structural and Visual Embedding
Factorizable Net= An Efficient Subgraph-based  Framework for Scene Graph Generation
RelTR= Relation Transformer for Scene Graph  Generation
Fully Convolutional Scene Graph Generation
Visual Relationship Detection  with Language Priors