VLA-OPD: Bridging Offline SFT and Online RL for Vision-Language-Action Models via On-Policy Distillation

返回详情 VLA / Vision-Language-Action 每日论文卡