Nguyen Van Thinh, Tran Van Lang, and Van The Thanh Van. “OD-VR-Cap: Image Captioning Based on Detecting and Predicting Relationships Between Objects”. Journal of Computer Science and Cybernetics 40, no. 4 (December 3, 2024): 327–346. Accessed April 12, 2025. https://vjs.ac.vn/index.php/jcc/article/view/20929.