Thinh, Nguyen Van, Tran Van Lang, and Van The Thanh. “RGTranCNet: Effective Image Captioning Model Using Cross-Attention and Semantic Knowledge”. Vietnam Journal of Science and Technology (July 15, 2025). Accessed January 10, 2026. https://vjs.ac.vn/jst/article/view/22381.