Regression Imputation and Optimized Gaussian Naïve Bayes Algorithm for an Enhanced Diabetes Mellitus Prediction Model

Abstract Diabetes mellitus (DM) is a category of metabolic disorders caused by high blood sugar. The DM affects human metabolism, and this disease causes many complications like Heart disease, Neuropathy, Diabetic retinopathy, kidney problems, skin disorder and slow healing. It is therefore essential to predict the presence of DM using an automated diabetes diagnosis system, which can be implemented using machine learning algorithms. A variety of automated diabetes prediction systems have been proposed in previous studies. Even so, the low prediction accuracy of DM prediction systems is a major issue. This proposed work developed a diabetes mellitus prediction system to improve the diabetes mellitus prediction accuracy using Optimized Gaussian Naive Bayes algorithm. This proposed model using the Pima Indians diabetes dataset as an input to build the DM predictive model. The missing values of an input dataset are imputed using regression imputation method. The sequential backward feature elimination method is used in this proposed model for selecting the relevant risk factors of diabetes disease. The proposed machine learning classifier named Optimized Gaussian Naïve Bayes (OGNB) is applied to the selected risk factors to create an enhanced Diabetes diagnostic system which predicts Diabetes in an individual. The performance analysis of this prediction architecture shows that, over other traditional machine learning classifiers, the Optimized Gaussian Naïve Bayes achieves an 81.85% classifier accuracy. This proposed DM prediction system is effective as compared to other diabetes prediction systems found in the literature. According to our experimental study, the OGNB based diabetes mellitus prediction system is more appropriate for DM disease prediction.

Saved in:
Bibliographic Details
Main Authors: Mohideen,Dhilsath Fathima Mohammed, Raj,Justin Samuel Savari, Raj,Raja Soosaimarian Peter
Format: Digital revista
Language:English
Published: Instituto de Tecnologia do Paraná - Tecpar 2021
Online Access:http://old.scielo.br/scielo.php?script=sci_arttext&pid=S1516-89132021000100624
Tags: Add Tag
No Tags, Be the first to tag this record!
id oai:scielo:S1516-89132021000100624
record_format ojs
spelling oai:scielo:S1516-891320210001006242022-02-18Regression Imputation and Optimized Gaussian Naïve Bayes Algorithm for an Enhanced Diabetes Mellitus Prediction ModelMohideen,Dhilsath Fathima MohammedRaj,Justin Samuel SavariRaj,Raja Soosaimarian Peter Optimized Gaussian Naïve Bayes classifier Regression imputation Sequential backward feature elimination Diabetes mellitus diagnosis Abstract Diabetes mellitus (DM) is a category of metabolic disorders caused by high blood sugar. The DM affects human metabolism, and this disease causes many complications like Heart disease, Neuropathy, Diabetic retinopathy, kidney problems, skin disorder and slow healing. It is therefore essential to predict the presence of DM using an automated diabetes diagnosis system, which can be implemented using machine learning algorithms. A variety of automated diabetes prediction systems have been proposed in previous studies. Even so, the low prediction accuracy of DM prediction systems is a major issue. This proposed work developed a diabetes mellitus prediction system to improve the diabetes mellitus prediction accuracy using Optimized Gaussian Naive Bayes algorithm. This proposed model using the Pima Indians diabetes dataset as an input to build the DM predictive model. The missing values of an input dataset are imputed using regression imputation method. The sequential backward feature elimination method is used in this proposed model for selecting the relevant risk factors of diabetes disease. The proposed machine learning classifier named Optimized Gaussian Naïve Bayes (OGNB) is applied to the selected risk factors to create an enhanced Diabetes diagnostic system which predicts Diabetes in an individual. The performance analysis of this prediction architecture shows that, over other traditional machine learning classifiers, the Optimized Gaussian Naïve Bayes achieves an 81.85% classifier accuracy. This proposed DM prediction system is effective as compared to other diabetes prediction systems found in the literature. According to our experimental study, the OGNB based diabetes mellitus prediction system is more appropriate for DM disease prediction.info:eu-repo/semantics/openAccessInstituto de Tecnologia do Paraná - TecparBrazilian Archives of Biology and Technology v.64 20212021-01-01info:eu-repo/semantics/articletext/htmlhttp://old.scielo.br/scielo.php?script=sci_arttext&pid=S1516-89132021000100624en10.1590/1678-4324-2021210181
institution SCIELO
collection OJS
country Brasil
countrycode BR
component Revista
access En linea
databasecode rev-scielo-br
tag revista
region America del Sur
libraryname SciELO
language English
format Digital
author Mohideen,Dhilsath Fathima Mohammed
Raj,Justin Samuel Savari
Raj,Raja Soosaimarian Peter
spellingShingle Mohideen,Dhilsath Fathima Mohammed
Raj,Justin Samuel Savari
Raj,Raja Soosaimarian Peter
Regression Imputation and Optimized Gaussian Naïve Bayes Algorithm for an Enhanced Diabetes Mellitus Prediction Model
author_facet Mohideen,Dhilsath Fathima Mohammed
Raj,Justin Samuel Savari
Raj,Raja Soosaimarian Peter
author_sort Mohideen,Dhilsath Fathima Mohammed
title Regression Imputation and Optimized Gaussian Naïve Bayes Algorithm for an Enhanced Diabetes Mellitus Prediction Model
title_short Regression Imputation and Optimized Gaussian Naïve Bayes Algorithm for an Enhanced Diabetes Mellitus Prediction Model
title_full Regression Imputation and Optimized Gaussian Naïve Bayes Algorithm for an Enhanced Diabetes Mellitus Prediction Model
title_fullStr Regression Imputation and Optimized Gaussian Naïve Bayes Algorithm for an Enhanced Diabetes Mellitus Prediction Model
title_full_unstemmed Regression Imputation and Optimized Gaussian Naïve Bayes Algorithm for an Enhanced Diabetes Mellitus Prediction Model
title_sort regression imputation and optimized gaussian naïve bayes algorithm for an enhanced diabetes mellitus prediction model
description Abstract Diabetes mellitus (DM) is a category of metabolic disorders caused by high blood sugar. The DM affects human metabolism, and this disease causes many complications like Heart disease, Neuropathy, Diabetic retinopathy, kidney problems, skin disorder and slow healing. It is therefore essential to predict the presence of DM using an automated diabetes diagnosis system, which can be implemented using machine learning algorithms. A variety of automated diabetes prediction systems have been proposed in previous studies. Even so, the low prediction accuracy of DM prediction systems is a major issue. This proposed work developed a diabetes mellitus prediction system to improve the diabetes mellitus prediction accuracy using Optimized Gaussian Naive Bayes algorithm. This proposed model using the Pima Indians diabetes dataset as an input to build the DM predictive model. The missing values of an input dataset are imputed using regression imputation method. The sequential backward feature elimination method is used in this proposed model for selecting the relevant risk factors of diabetes disease. The proposed machine learning classifier named Optimized Gaussian Naïve Bayes (OGNB) is applied to the selected risk factors to create an enhanced Diabetes diagnostic system which predicts Diabetes in an individual. The performance analysis of this prediction architecture shows that, over other traditional machine learning classifiers, the Optimized Gaussian Naïve Bayes achieves an 81.85% classifier accuracy. This proposed DM prediction system is effective as compared to other diabetes prediction systems found in the literature. According to our experimental study, the OGNB based diabetes mellitus prediction system is more appropriate for DM disease prediction.
publisher Instituto de Tecnologia do Paraná - Tecpar
publishDate 2021
url http://old.scielo.br/scielo.php?script=sci_arttext&pid=S1516-89132021000100624
work_keys_str_mv AT mohideendhilsathfathimamohammed regressionimputationandoptimizedgaussiannaivebayesalgorithmforanenhanceddiabetesmellituspredictionmodel
AT rajjustinsamuelsavari regressionimputationandoptimizedgaussiannaivebayesalgorithmforanenhanceddiabetesmellituspredictionmodel
AT rajrajasoosaimarianpeter regressionimputationandoptimizedgaussiannaivebayesalgorithmforanenhanceddiabetesmellituspredictionmodel
_version_ 1756424063559925760