Predicting diabetes in healthy population through machine learning

Hasan Abbas, Lejla Alic, Marelyn Rios, Muhammad Abdul-Ghani, Khalid Qaraqe

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

In this paper, we revisit the data of the San Antonio Heart Study, and employ machine learning to predict the future development of type-2 diabetes. To build the prediction model, we use the support vector machines and ten features that are wellknown in the literature as strong predictors of future diabetes. Due to the unbalanced nature of the dataset in terms of the class labels, we use 10-fold cross-validation to train the model and a hold-out set to validate it. The results of this study show a validation accuracy of 84.1% with a recall rate of 81.1% averaged over 100 iterations. The outcomes of this study can help in identifying the population that is at high risk of developing type-2 diabetes in the future.

Original languageEnglish
Title of host publicationProceedings - 2019 IEEE 32nd International Symposium on Computer-Based Medical Systems, CBMS 2019
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages567-570
Number of pages4
ISBN (Electronic)9781728122861
DOIs
Publication statusPublished - 1 Jun 2019
Event32nd IEEE International Symposium on Computer-Based Medical Systems, CBMS 2019 - Cordoba, Spain
Duration: 5 Jun 20197 Jun 2019

Publication series

NameProceedings - IEEE Symposium on Computer-Based Medical Systems
Volume2019-June
ISSN (Print)1063-7125

Conference

Conference32nd IEEE International Symposium on Computer-Based Medical Systems, CBMS 2019
CountrySpain
CityCordoba
Period5/6/197/6/19

Fingerprint

Medical problems
Type 2 Diabetes Mellitus
Learning systems
Population
Outcome Assessment (Health Care)
Support vector machines
Labels
Machine Learning
Datasets
Support Vector Machine

Keywords

  • Disease Prediction
  • Support vector machine
  • Type 2 diabetes

ASJC Scopus subject areas

  • Radiology Nuclear Medicine and imaging
  • Computer Science Applications

Cite this

Abbas, H., Alic, L., Rios, M., Abdul-Ghani, M., & Qaraqe, K. (2019). Predicting diabetes in healthy population through machine learning. In Proceedings - 2019 IEEE 32nd International Symposium on Computer-Based Medical Systems, CBMS 2019 (pp. 567-570). [8787404] (Proceedings - IEEE Symposium on Computer-Based Medical Systems; Vol. 2019-June). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/CBMS.2019.00117

Predicting diabetes in healthy population through machine learning. / Abbas, Hasan; Alic, Lejla; Rios, Marelyn; Abdul-Ghani, Muhammad; Qaraqe, Khalid.

Proceedings - 2019 IEEE 32nd International Symposium on Computer-Based Medical Systems, CBMS 2019. Institute of Electrical and Electronics Engineers Inc., 2019. p. 567-570 8787404 (Proceedings - IEEE Symposium on Computer-Based Medical Systems; Vol. 2019-June).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abbas, H, Alic, L, Rios, M, Abdul-Ghani, M & Qaraqe, K 2019, Predicting diabetes in healthy population through machine learning. in Proceedings - 2019 IEEE 32nd International Symposium on Computer-Based Medical Systems, CBMS 2019., 8787404, Proceedings - IEEE Symposium on Computer-Based Medical Systems, vol. 2019-June, Institute of Electrical and Electronics Engineers Inc., pp. 567-570, 32nd IEEE International Symposium on Computer-Based Medical Systems, CBMS 2019, Cordoba, Spain, 5/6/19. https://doi.org/10.1109/CBMS.2019.00117
Abbas H, Alic L, Rios M, Abdul-Ghani M, Qaraqe K. Predicting diabetes in healthy population through machine learning. In Proceedings - 2019 IEEE 32nd International Symposium on Computer-Based Medical Systems, CBMS 2019. Institute of Electrical and Electronics Engineers Inc. 2019. p. 567-570. 8787404. (Proceedings - IEEE Symposium on Computer-Based Medical Systems). https://doi.org/10.1109/CBMS.2019.00117
Abbas, Hasan ; Alic, Lejla ; Rios, Marelyn ; Abdul-Ghani, Muhammad ; Qaraqe, Khalid. / Predicting diabetes in healthy population through machine learning. Proceedings - 2019 IEEE 32nd International Symposium on Computer-Based Medical Systems, CBMS 2019. Institute of Electrical and Electronics Engineers Inc., 2019. pp. 567-570 (Proceedings - IEEE Symposium on Computer-Based Medical Systems).
@inproceedings{dbff0d772b9b4fbb8f64f759805447f8,
title = "Predicting diabetes in healthy population through machine learning",
abstract = "In this paper, we revisit the data of the San Antonio Heart Study, and employ machine learning to predict the future development of type-2 diabetes. To build the prediction model, we use the support vector machines and ten features that are wellknown in the literature as strong predictors of future diabetes. Due to the unbalanced nature of the dataset in terms of the class labels, we use 10-fold cross-validation to train the model and a hold-out set to validate it. The results of this study show a validation accuracy of 84.1{\%} with a recall rate of 81.1{\%} averaged over 100 iterations. The outcomes of this study can help in identifying the population that is at high risk of developing type-2 diabetes in the future.",
keywords = "Disease Prediction, Support vector machine, Type 2 diabetes",
author = "Hasan Abbas and Lejla Alic and Marelyn Rios and Muhammad Abdul-Ghani and Khalid Qaraqe",
year = "2019",
month = "6",
day = "1",
doi = "10.1109/CBMS.2019.00117",
language = "English",
series = "Proceedings - IEEE Symposium on Computer-Based Medical Systems",
publisher = "Institute of Electrical and Electronics Engineers Inc.",
pages = "567--570",
booktitle = "Proceedings - 2019 IEEE 32nd International Symposium on Computer-Based Medical Systems, CBMS 2019",

}

TY - GEN

T1 - Predicting diabetes in healthy population through machine learning

AU - Abbas, Hasan

AU - Alic, Lejla

AU - Rios, Marelyn

AU - Abdul-Ghani, Muhammad

AU - Qaraqe, Khalid

PY - 2019/6/1

Y1 - 2019/6/1

N2 - In this paper, we revisit the data of the San Antonio Heart Study, and employ machine learning to predict the future development of type-2 diabetes. To build the prediction model, we use the support vector machines and ten features that are wellknown in the literature as strong predictors of future diabetes. Due to the unbalanced nature of the dataset in terms of the class labels, we use 10-fold cross-validation to train the model and a hold-out set to validate it. The results of this study show a validation accuracy of 84.1% with a recall rate of 81.1% averaged over 100 iterations. The outcomes of this study can help in identifying the population that is at high risk of developing type-2 diabetes in the future.

AB - In this paper, we revisit the data of the San Antonio Heart Study, and employ machine learning to predict the future development of type-2 diabetes. To build the prediction model, we use the support vector machines and ten features that are wellknown in the literature as strong predictors of future diabetes. Due to the unbalanced nature of the dataset in terms of the class labels, we use 10-fold cross-validation to train the model and a hold-out set to validate it. The results of this study show a validation accuracy of 84.1% with a recall rate of 81.1% averaged over 100 iterations. The outcomes of this study can help in identifying the population that is at high risk of developing type-2 diabetes in the future.

KW - Disease Prediction

KW - Support vector machine

KW - Type 2 diabetes

UR - http://www.scopus.com/inward/record.url?scp=85070958362&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85070958362&partnerID=8YFLogxK

U2 - 10.1109/CBMS.2019.00117

DO - 10.1109/CBMS.2019.00117

M3 - Conference contribution

AN - SCOPUS:85070958362

T3 - Proceedings - IEEE Symposium on Computer-Based Medical Systems

SP - 567

EP - 570

BT - Proceedings - 2019 IEEE 32nd International Symposium on Computer-Based Medical Systems, CBMS 2019

PB - Institute of Electrical and Electronics Engineers Inc.

ER -