VQ-VAE and Latent Action for Robotics

VQ-VAE and Latent Action for Robotics

Representation Learning for Scene Graph Completion via Jointly Structural and Visual Embedding

Representation Learning for Scene Graph Completion via Jointly Structural and Visual Embedding

Momentum Contrast for Unsupervised Visual Representation Learning

Momentum Contrast for Unsupervised Visual Representation Learning

DINOv2- Learning Robust Visual Features without Supervision
DINO

DINO