[๋ ผ๋ฌธ๋ฆฌ๋ทฐ] InstructVLA: Vision-Language-Action Instruction Tuning from Understanding to Manipulation
Yang Tian์ด [arXiv]์ ๊ฒ์ํ โInstructVLA: Vision-Language-Action Instruction Tuning from Understanding to Manipulationโ ๋ ผ๋ฌธ์ ๋ํ ์์ธํ ๋ฆฌ๋ทฐ์ ๋๋ค.