 Representation Learning for Scene Graph Completion via Jointly Structural and Visual Embedding/Pasted_image_20250318162533.png)
Representation Learning for Scene Graph Completion via Jointly Structural and Visual Embedding
Factorizable Net= An Efficient Subgraph-based Framework for Scene Graph Generation
我的想法是将场景进行panoptic segmentation 之后再在每个物体上进行hierarchical part relation detection,异曲同工。