Perceiving and Acting in First-Person: A Dataset and Benchmark for Egocentric Human-Object-Human Interactions

返回详情 VLA / Vision-Language-Action 每日论文卡