SparseOccVLA: Bridging Occupancy and Vision-Language Models via Sparse Queries for Unified 4D Scene Understanding and Planning

返回详情 VLA / Vision-Language-Action 每日论文卡