Chen Yulin's Blog

Posted 2025-03-18Updated 2026-03-08Review2 minutes read (About 355 words)

SayPlan= Grounding Large Language Models using 3D Scene Graphs for Scalable Robot Task Planning

主要的思想都在上面这个伪代码里，通过只展开部分场景图（严格层级结构），来控制输入llm的场景图大小。

Posted 2025-03-18Updated 2026-03-08Reviewa minute read (About 197 words)

Clio= Real-time Task-Driven Open-Set 3D Scene Graphs

贡献：

#Robotics Scene-graph Research-paper 3D-Scene Embodied-AI CLIP Open-Vocabulary Task-Planning

Posted 2025-03-18Updated 2026-03-08Reviewa few seconds read (About 3 words)

Hierarchical Open-Vocabulary 3D Scene Graphs for Language-Grounded Robot Navigation

#Robotics Scene-graph Visual-Relation Research-paper 3D-Scene Embodied-AI Open-Vocabulary Hierarchical

Posted 2025-03-12Updated 2026-03-08Review10 minutes read (About 1524 words)

Reconstruct Anything Literature Review

涉及的文章：

#Robotics Scene-graph Visual-Relation Research-paper Reconstruct Physical-Scene

Posted 2025-03-12Updated 2026-03-08Reviewa few seconds read (About 3 words)

Reasoning with Scene Graphs for Robot Planning under Partial Observability

#Robotics Scene-graph Visual-Relation Research-paper RobotLearning Task-Planning

Posted 2025-03-11Updated 2026-03-08Reviewa few seconds read (About 23 words)

Scene Reconstruction with Functional Objects for Robot Autonomy

和李飞飞[[ACDC- Automated Creation of Digital Cousins for Robust Policy Learning]]的思想类似。

#Robotics Scene-graph Research-paper Reconstruct RobotLearning 3D-Scene Embodied-AI Physical-Scene

Posted 2025-03-11Updated 2026-03-08Reviewa few seconds read (About 3 words)

Part-level Scene Reconstruction Affords Robot Interaction

#Robotics Scene-graph Visual-Relation Research-paper Reconstruct RobotLearning 3D-Scene Embodied-AI Physical-Scene

Posted 2025-02-15Updated 2026-03-08Review6 minutes read (About 919 words)

Scene-LLM

本文提出的模型主要想解决3D密集标注和交互式规划。
结合

#Robotics Research-paper LLM Multi-modal VLM 3D-Scene Embodied-AI CLIP

Posted 2025-02-13Updated 2026-03-08Review3 minutes read (About 505 words)

3D-LLM

Recent works have explored aligning images and videos with LLM for a new generation of multi-modal LLMs that equip LLMs with the ability to understand and reason about 2D images.
但是仍缺少对于3D物理空间进行分析的模型, which involves richer concepts such as spatial relationships, affordances, physics and interaction so on.

#Robotics Research-paper LLM Multi-modal 3D-Scene Embodied-AI

Posted 2025-02-13Updated 2026-03-08Reviewa few seconds read (About 0 words)

ProgPrompt

#Robotics Research-paper LLM Task-Planning

Archives

Recents

Tags