Chen Yulin's Blog

Posted 2025-02-17Updated 2026-03-30Review2 minutes read (About 297 words)

ConceptFusion

将不同帧$X_t$中的特征集合在M中特征点的公式：

#Research-paper Multi-modal CV Reconstruct 3D-Scene Semantic CLIP

Posted 2025-02-15Updated 2026-03-30Review6 minutes read (About 919 words)

Scene-LLM

本文提出的模型主要想解决3D密集标注和交互式规划。
结合

#Robotics Research-paper LLM Multi-modal VLM 3D-Scene Embodied-AI CLIP

Posted 2025-02-13Updated 2026-03-30Review3 minutes read (About 505 words)

3D-LLM

Recent works have explored aligning images and videos with LLM for a new generation of multi-modal LLMs that equip LLMs with the ability to understand and reason about 2D images.
但是仍缺少对于3D物理空间进行分析的模型, which involves richer concepts such as spatial relationships, affordances, physics and interaction so on.

#Robotics Research-paper LLM Multi-modal 3D-Scene Embodied-AI

Posted 2025-02-13Updated 2026-03-30Reviewa few seconds read (About 0 words)

PointLLM

#Research-paper LLM Multi-modal 3D-Scene

Archives

Recents

Tags