VAMOS: A Hierarchical Vision-Language-Action Model for Capability-Modulated and Steerable Navigation
返回详情
VLA / Vision-Language-Action 每日论文卡