Tag: 3D-Scene - Chen Yulin's Blog

Scene-LLM

Posted 2025-02-15Updated 2026-03-30Review6 minutes read (About 919 words)

本文提出的模型主要想解决3D密集标注和交互式规划。
结合

#Robotics Research-paper LLM Multi-modal VLM 3D-Scene Embodied-AI CLIP

3D-LLM

Posted 2025-02-13Updated 2026-03-30Review3 minutes read (About 505 words)

Recent works have explored aligning images and videos with LLM for a new generation of multi-modal LLMs that equip LLMs with the ability to understand and reason about 2D images.
但是仍缺少对于3D物理空间进行分析的模型, which involves richer concepts such as spatial relationships, affordances, physics and interaction so on.

#Robotics Research-paper LLM Multi-modal 3D-Scene Embodied-AI

PointLLM

Posted 2025-02-13Updated 2026-03-30Reviewa few seconds read (About 0 words)

#Research-paper LLM Multi-modal 3D-Scene

LERF- Language Embedded Radiance Fields

Posted 2025-01-06Updated 2026-03-30Note5 minutes read (About 790 words)

LERF- Language Embedded Radiance Fields

NeRF+CLIP

#Research-paper LLM CV Reconstruct 3D-Scene Embodied-AI Semantic CLIP

CLIP-Fields- Weakly Supervised Semantic Fields for Robotic Memory

Posted 2025-01-06Updated 2026-03-30Note4 minutes read (About 541 words)

CLIP-Fields- Weakly Supervised Semantic Fields for Robotic Memory

疑问：

#Robotics Research-paper Reconstruct 3D-Scene Semantic CLIP

OK-Robot- What Really Matters in Integrating Open-Knowledge Models for Robotics

Posted 2025-01-06Updated 2026-03-30Note6 minutes read (About 959 words)

OK-Robot- What Really Matters in Integrating Open-Knowledge Models for Robotics

Creating a general-purpose robot has been a longstanding dream of the robotics community.

#Robotics Research-paper LLM CV Reconstruct RobotLearning 3D-Scene Embodied-AI Semantic CLIP Open-Vocabulary

Dynamic Open-Vocabulary 3D Scene Graphs for Long-term Language-Guided Mobile Manipulation

Posted 2024-12-17Updated 2026-03-30Reviewa few seconds read (About 42 words)

Dynamic Open-Vocabulary 3D Scene Graphs for Long-term Language-Guided Mobile Manipulation

和我的想法非常相近，完成度也很高啊喂。可以参考他的实现思路，引用的文章等等。

#Robotics Scene-graph Research-paper LLM CV Reconstruct 3D-Scene Embodied-AI Semantic Open-Vocabulary