[1]
Ngo Dinh, L., Le Ngoc, H. and Quoc Phan, L. 2023. OHYEAH AT VLSP2022-EVJVQA CHALLENGE: A JOINTLY LANGUAGE-IMAGE MODEL FOR MULTILINGUAL VISUAL QUESTION ANSWERING. Journal of Computer Science and Cybernetics. 39, 4 (Dec. 2023), 381–391. DOI:https://doi.org/10.15625/1813-9663/18122.