SG-VLA: Learning Spatially-Grounded Vision-Language-Action Models for Mobile Manipulation
返回详情
VLA / Vision-Language-Action 每日论文卡