Diagnosis of the Diseases Using Resampling Methods with Machine Learning Algorithms

Authors

  • Ahmet Çelik Computer Technologies Department, Tavşanlı Vocational School, Kütahya Dumlupınar University, Turkey

DOI:

https://doi.org/10.7546/CRABS.2023.07.10

Keywords:

machine learning, pneumonia, medical imaging techniques, medical decision, chest X-ray images, classification

Abstract

The rapid diagnosis of diseases is very important for the early start of the treatment process. Pneumonia is a disease that affects the lungs and can cause death in advanced cases. Pneumonia is still today diagnosed by doctors while examining chest X-ray images. As diagnosis of the diseases using machine learning algorithms will be useful. In addition, high success of rate will be obtained while classifying balanced dataset by using machine learning algorithms. Resampling methods are used to balance the dataset by using under-sampling or over-sampling methods. In literature, there is no study comparing under-sampling and over-sampling methods. In the study, an open source dataset was used which included two classes published through the Kaggle data store. The data set includes 1341 healthy and 3875 pneumonia chest X-ray images. Two different resampling methods named Random under-sampling (RU) and ADASYN (Adaptive Synthetic) over-sampling were used while balancing healthy and pneumonia images. After this operation obtained data were used for training machine learning algorithms. In the study, first and second level attributes of the X-ray images in the dataset were used. Logistic Regression (LR) and Support Vector Machine (SVM) algorithms were used for classification of dataset. According to the results obtained, 93.109% accuracy rate of classification success was achieved of X-ray image dataset which balanced by ADASYN over-sampling method with SVM algorithms.

Author Biography

Ahmet Çelik, Computer Technologies Department, Tavşanlı Vocational School, Kütahya Dumlupınar University, Turkey

Mailing Address:
Computer Technologies Department,
Tavşanlı Vocational School,
Kütahya Dumlupınar University
Kütahya, Turkey

E-mail: ahmet.celik@dpu.edu.tr

Downloads

Published

31-07-2023

How to Cite

[1]
A. Çelik, “Diagnosis of the Diseases Using Resampling Methods with Machine Learning Algorithms”, C. R. Acad. Bulg. Sci. , vol. 76, no. 7, pp. 1065–1076, Jul. 2023.

Issue

Section

Engineering Sciences