Chen Yulin's Blog

Posted 2025-03-13Updated 2026-03-08Reviewa few seconds read (About 0 words)

Posted 2025-03-13Updated 2026-03-08Reviewa few seconds read (About 21 words)

Building an efficient structured representation that captures comprehensive semantic knowledge is a crucial step towards a deeper understanding of visual scenes

#Scene-graph Visual-Relation Research-paper CV

Posted 2025-03-13Updated 2026-03-08Reviewa few seconds read (About 6 words)

From Pixels to Graphs= Open-Vocabulary Scene Graph Generation with Vision-Language Models

#Scene-graph Visual-Relation Research-paper Multi-modal VLM CV Open-Vocabulary

Posted 2025-03-13Updated 2026-03-08Reviewa minute read (About 180 words)

SGTR+= End-to-end Scene Graph Generation with Transformer

SGTR 是一种自上而下的方法，该方法首先使用基于Transformer的生成器来生成一组可学习的triplet queries (subject–predicate–object)，然后使用级联的triplet detector逐步完善这些查询并生成最终场景图。它还提出了一种基于结构化发生器的实体感知关系表示方法，该方法利用了关系的组成属性。

#Scene-graph Visual-Relation Research-paper Transformer CV

Posted 2025-03-13Updated 2026-03-08Reviewa few seconds read (About 0 words)

Neural Motifs= Scene graph parsing with global context

#Scene-graph Visual-Relation Research-paper CV

Posted 2025-03-13Updated 2026-03-08Reviewa few seconds read (About 3 words)

GPS-Net= Graph Property Sensing Network for Scene Graph Generation

#Scene-graph Visual-Relation Research-paper CV Message-Passing

Posted 2025-03-13Updated 2026-03-08Reviewa few seconds read (About 6 words)

Large-scale visual relationship understanding

#Scene-graph Visual-Relation Research-paper CV

Posted 2025-03-13Updated 2026-03-08Reviewa few seconds read (About 0 words)

Panoptic Segmentation

#Research-paper CV Semantic Segmentation Panoptic

Posted 2025-03-12Updated 2026-03-08Reviewa few seconds read (About 5 words)

Scene Graph Generation- A comprehensive survey

See [[Reconstruct-Anything Literature Review]]

#Scene-graph Visual-Relation Research-paper CV Survey

Posted 2025-03-12Updated 2026-03-08Reviewa few seconds read (About 94 words)

SceneGraphFusion- Incremental 3D Scene Graph Predictionfrom RGB-D Sequences

Overview of the proposed SceneGraphFusion framework. Our method takes a stream of RGB-D images a) as input to create an incremental geometric segmentation b). Then, the properties of each segment and a neighbor graph between segments are constructed. The properties d) and neighbor graph e) of the segments that have been updated in the current frame c) are used as the inputs to compute node and edge features f) and to predict a 3D scene graph g). Finally, the predictions are h) fused back into a globally consistent 3D graph.

#Scene-graph Research-paper CV Reconstruct 3D-Scene Semantic

Archives

Recents

Tags