DIAL: Decoupling Intent and Action via Latent World Modeling for End-to-End VLA
返回详情
VLA / Vision-Language-Action 每日论文卡