Transforming Monolithic Foundation Models into Embodied Multi-Agent Architectures for Human-Robot Collaboration

返回详情 VLA / Vision-Language-Action 每日论文卡