SG-VLA: Learning Spatially-Grounded Vision-Language-Action Models for Mobile Manipulation

返回详情 VLA / Vision-Language-Action 每日论文卡