I2E: From Image Pixels to Actionable Interactive Environments for Text-Guided Image Editing
返回详情
VLA / Vision-Language-Action 每日论文卡