Chen Yulin's Blog

Posted 2025-04-14Updated 2026-03-30Notea few seconds read (About 95 words)

(Mindmap) Part-level Scene Understanding for Robots

A scene graph is a structural representation, which can capture detailed semantics by explicitly Modeling:

Posted 2025-03-18Updated 2026-03-30Reviewa few seconds read (About 3 words)

Hierarchical Open-Vocabulary 3D Scene Graphs for Language-Grounded Robot Navigation

#Robotics Scene-graph Visual-Relation Research-paper 3D-Scene Embodied-AI Open-Vocabulary Hierarchical

Posted 2025-03-18Updated 2026-03-30Reviewa few seconds read (About 31 words)

Representation Learning for Scene Graph Completion via Jointly Structural and Visual Embedding

The architecture of RLSV is a three-layered hierarchical projection that projects a visual triple onto the attribute space, the relation space, and the visual space in order.

#Scene-graph Visual-Relation Research-paper CV Representation-Learning

Posted 2025-03-18Updated 2026-03-30Notea few seconds read (About 83 words)

(UVtransE) Contextual Translation Embedding for Visual Relationship Detection and Scene Graph Generation

#Scene-graph Visual-Relation Research-paper Image2Text CV Translation-Embedding

Posted 2025-03-18Updated 2026-03-30Reviewa few seconds read (About 7 words)

Visual Translation Embedding Network for Visual Relation Detection

VTransE

#Scene-graph Visual-Relation Research-paper Image2Text CV Translation-Embedding

Posted 2025-03-16Updated 2026-03-30Reviewa minute read (About 112 words)

Factorizable Net= An Efficient Subgraph-based Framework for Scene Graph Generation

我的想法是将场景进行panoptic segmentation 之后再在每个物体上进行hierarchical part relation detection，异曲同工。

#Scene-graph Visual-Relation Research-paper CV Subgraph

Posted 2025-03-16Updated 2026-03-30Reviewa few seconds read (About 3 words)

Scene Graph Generation by Iterative Message Passing

#Scene-graph Visual-Relation Research-paper CV Message-Passing

Posted 2025-03-14Updated 2026-03-30Reviewa few seconds read (About 86 words)

RelTR= Relation Transformer for Scene Graph Generation

RelTR是自下而上的方法, 使用基于Transformer的 object detector（例如DETR）生成对象候选者，然后使用relation transformer来预测object pairs之间的关系。它还设计了一种基于积分的关系表示方法，该方法将关系编码为二维矢量场。

#Scene-graph Visual-Relation Research-paper Transformer CV Object-Detection

Posted 2025-03-14Updated 2026-03-30Reviewa few seconds read (About 93 words)

Fully Convolutional Scene Graph Generation

这个模型受启发于 [[CenterNet]] 和 [[OpenPose Using Part Affinity Fields]]，通过添加一个新的用于生成RAF的卷积头来获取物体之间的关系。

#Scene-graph Visual-Relation Research-paper CV FPN

Posted 2025-03-14Updated 2026-03-30Reviewa few seconds read (About 3 words)

Visual Relationship Detection with Language Priors

#Scene-graph Visual-Relation Research-paper CV NLP

Archives

Recents

Tags