[๋ ผ๋ฌธ๋ฆฌ๋ทฐ] Inverse-LLaVA: Eliminating Alignment Pre-training Through Text-to-Vision Mapping
Tyler Derr์ด [arXiv]์ ๊ฒ์ํ โInverse-LLaVA: Eliminating Alignment Pre-training Through Text-to-Vision Mappingโ ๋ ผ๋ฌธ์ ๋ํ ์์ธํ ๋ฆฌ๋ทฐ์ ๋๋ค.