2025-01-09
DINOv2- Learning Robust Visual Features without Supervision
Note
AN IMAGE IS WORTH 16X16 WORDS- TRANSFORMERS FOR IMAGE RECOGNITION AT SCALE
2025-01-08
DINO
2025-01-06
CLIP
LERF- Language Embedded Radiance Fields
Some Thoughts Regarding -Reconstruct Anything-
CLIP-Fields- Weakly Supervised Semantic Fields for Robotic Memory
Simple Open-Vocabulary Object Detection with Vision Transformers
OK-Robot- What Really Matters in Integrating Open-Knowledge Models for Robotics
2024-12-31
One-Shot Visual Imitation Learning via Meta-Learning
Chen Yulin
SJTU student
Manchester by the Sea
Posts
256
Categories
8
Tags
186
2025-05-06
Deformable Convolutional Networks
Review
2025-05-01
2025 Summer Schedule
Schedule
2025-04-24
Associative Embedding= End-to-End Learning for Joint Detection and Grouping
CenterNet
2025-04-23
FCSGG Repo Explanation