Addressing data imbalance in Vietnamese chest X-ray diagnosis using deep neural networks

Nguyen Trong Vinh, Pham Trung Hieu, Do Nang Toan, Lam Thanh Hien
Author affiliations

Authors

  • Nguyen Trong Vinh Faculty of Information Technology, Lac Hong University, Huynh Van Nghe Street, Bien Hoa Ward, Dong Nai Province, Viet Nam
  • Pham Trung Hieu Institute of Information Technology, Vietnam Academy of Science and Technology, 18 Hoang Quoc Viet Street, Nghia Do Ward, Ha Noi, Viet Nam https://orcid.org/0009-0002-5067-2545
  • Do Nang Toan Institute of Information Technology, Vietnam Academy of Science and Technology, 18 Hoang Quoc Viet Street, Nghia Do Ward, Ha Noi, Viet Nam
  • Lam Thanh Hien Faculty of Information Technology, Lac Hong University, Huynh Van Nghe Street, Bien Hoa Ward, Dong Nai Province, Viet Nam https://orcid.org/0000-0002-4539-3712

DOI:

https://doi.org/10.15625/1813-9663/23414

Keywords:

Chest X-ray diagnosis, imbalanced data, deep learning, convolutional neural networks, balanced accuracy.

Abstract

Pulmonary diseases such as pneumonia, tuberculosis, and particularly lung cancer represent serious public health concerns, necessitating early and accurate detection methods, in which chest X-ray classification plays a pivotal role. However, an inherent challenge in medical datasets is the issue of class imbalance, where rare but critical pathologies often have significantly fewer samples compared to normal cases or more common conditions. This study systematically proposes and evaluates a deep learning–based approach for automatic chest X-ray classification, with a focus on addressing data imbalance to improve the detection of minority classes. The approach involves data normalization, the application of appropriate data augmentation techniques, and loss function reweighting through class weighting. We conducted experiments and performance comparisons using state-of-the-art convolutional neural network (CNN) architectures, including DenseNet-121, ResNet-50, EfficientNet-B0, and MobileNet-V3 Small, on two chest X-ray datasets: a publicly available dataset from Kaggle and the Vietnam VRPACs dataset. Experimental results demonstrate that DenseNet-121, when combined with imbalance-handling techniques, achieved the highest balanced accuracy (BACC) of 0.85, indicating a substantial improvement in minority-class classification performance compared with methods without imbalance handling. This study provides a potential solution and a scientific foundation for the development and deployment of automated diagnostic support systems in healthcare facilities, particularly in Vietnam.

Downloads

Published

23-03-2026

How to Cite

[1]Nguyen Trong Vinh, Pham Trung Hieu, Do Nang Toan, and Lam Thanh Hien, “Addressing data imbalance in Vietnamese chest X-ray diagnosis using deep neural networks”, J. Comput. Sci. Cybern., vol. 42, no. 1, p. 88–102, Mar. 2026.

Issue

Section

Articles

Similar Articles

<< < 9 10 11 12 13 14 15 16 17 18 > >> 

You may also start an advanced similarity search for this article.