3D-LLM

3D-LLM

Recent works have explored aligning images and videos with LLM for a new generation of multi-modal LLMs that equip LLMs with the ability to understand and reason about 2D images.
但是仍缺少对于3D物理空间进行分析的模型, which involves richer concepts such as spatial relationships, affordances, physics and interaction so on.

PointLLM
ProgPrompt

2025 Winter&Spring Schedule

让一切隐于晦朔,就在那月之暗面

让一切隐于晦朔,就在那月之暗面

Read more
Vision Transformers Need Registers