Vision-Language Interpreter for Robot Task Planning

Vision-Language Interpreter for Robot Task Planning

Pixtral 12B
ConceptGraphs= Open-Vocabulary 3D Scene Graphs for Perception and Planning

ConceptGraphs= Open-Vocabulary 3D Scene Graphs for Perception and Planning

From Pixels to Graphs= Open-Vocabulary Scene Graph Generation with  Vision-Language Models
OMG-LLaVA

OMG-LLaVA