2025-01-09
DINOv2- Learning Robust Visual Features without Supervision
Note
AN IMAGE IS WORTH 16X16 WORDS- TRANSFORMERS FOR IMAGE RECOGNITION AT SCALE
2025-01-08
DINO
2025-01-06
CLIP
LERF- Language Embedded Radiance Fields
Some Thoughts Regarding -Reconstruct Anything-
CLIP-Fields- Weakly Supervised Semantic Fields for Robotic Memory
Simple Open-Vocabulary Object Detection with Vision Transformers
OK-Robot- What Really Matters in Integrating Open-Knowledge Models for Robotics
2024-12-31
One-Shot Visual Imitation Learning via Meta-Learning
Chen Yulin
SJTU student
Manchester by the Sea
Posts
285
Categories
10
Tags
199
2025-10-10
攀爬机器人结构调研
2025-10-09
2025 Summer Schedule
Schedule
2025-09-17
2025韩国
2025-09-14
Lec1