Ngo Dinh, Luan, Hieu Le Ngoc, and Long Quoc Phan. “OHYEAH AT VLSP2022-EVJVQA CHALLENGE: A JOINTLY LANGUAGE-IMAGE MODEL FOR MULTILINGUAL VISUAL QUESTION ANSWERING”. Journal of Computer Science and Cybernetics 39, no. 4 (December 25, 2023): 381–391. Accessed December 21, 2024. https://vjs.ac.vn/index.php/jcc/article/view/18122.