DIAL: Decoupling Intent and Action via Latent World Modeling for End-to-End VLA

返回详情 VLA / Vision-Language-Action 每日论文卡