Ngo Dinh, L., Le Ngoc, H., & Quoc Phan, L. (2023). OHYEAH AT VLSP2022-EVJVQA CHALLENGE: A JOINTLY LANGUAGE-IMAGE MODEL FOR MULTILINGUAL VISUAL QUESTION ANSWERING. Journal of Computer Science and Cybernetics, 39(4), 381–391. https://doi.org/10.15625/1813-9663/18122