2D or 3D: Who Governs Salience in VLA Models? -- Tri-Stage Token Pruning Framework with Modality Salience Awareness

返回详情 VLA / Vision-Language-Action 每日论文卡