Machine learning based efficient prediction of positive cases of waterborne diseases

dc.authoridMohammad, Fida/0009-0001-2760-7211
dc.authorid, Mushtaq Hussain/0000-0002-7238-7924
dc.authoridIbrahim, Muhammad/0000-0001-8729-8759
dc.contributor.authorHussain, Mushtaq
dc.contributor.authorCifci, Mehmet Akif
dc.contributor.authorSehar, Tayyaba
dc.contributor.authorNabi, Said
dc.contributor.authorCheikhrouhou, Omar
dc.contributor.authorMaqsood, Hasaan
dc.contributor.authorIbrahim, Muhammad
dc.date.accessioned2025-07-03T21:25:42Z
dc.date.issued2023
dc.departmentBalıkesir Üniversitesi
dc.description.abstractBackgroundWater quality has been compromised and endangered by different contaminants due to Pakistan's rapid population development, which has resulted in a dramatic rise in waterborne infections and afflicted many regions of Pakistan. Because of this, modeling and predicting waterborne diseases has become a hot topic for researchers and is very important for controlling waterborne disease pollution.MethodsIn our study, first, we collected typhoid and malaria patient data for the years 2017-2020 from Ayub Medical Hospital. The collected data set has seven important input features. In the current study, different ML models were first trained and tested on the current study dataset using the tenfold cross-validation method. Second, we investigated the importance of input features in waterborne disease-positive case detection. The experiment results showed that Random Forest correctly predicted malaria-positive cases 60% of the time and typhoid-positive cases 77% of the time, which is better than other machine-learning models. In this research, we have also investigated the input features that are more important in the prediction and will help analyze positive cases of waterborne disease. The random forest feature selection technique has been used, and experimental results have shown that age, history, and test results play an important role in predicting waterborne disease-positive cases. In the end, we concluded that this interesting study could help health departments in different areas reduce the number of people who get sick from the water.
dc.description.sponsorshipHigher Education Commission (HEC) Pakistan under Start-Up Research Grant Program (SRGP) [162/IPFP-II(Batch-I)/SRGP/NAHE/HEC/2020/183]
dc.description.sponsorshipAcknowledgementsThis work was supported by the Higher Education Commission (HEC) Pakistan under Start-Up Research Grant Program (SRGP) (Ref. No. 162/IPFP-II(Batch-I)/SRGP/NAHE/HEC/2020/183).
dc.identifier.doi10.1186/s12911-022-02092-1
dc.identifier.issn1472-6947
dc.identifier.issue1
dc.identifier.pmid36653779
dc.identifier.scopusqualityQ1
dc.identifier.urihttps://doi.org/10.1186/s12911-022-02092-1
dc.identifier.urihttps://hdl.handle.net/20.500.12462/21613
dc.identifier.volume23
dc.identifier.wosWOS:000913470800001
dc.identifier.wosqualityQ2
dc.indekslendigikaynakWeb of Science
dc.indekslendigikaynakPubMed
dc.language.isoen
dc.publisherBmc
dc.relation.ispartofBmc Medical Informatics and Decision Making
dc.relation.publicationcategoryMakale - Uluslararası Hakemli Dergi - Kurum Öğretim Elemanı
dc.rightsinfo:eu-repo/semantics/openAccess
dc.snmzKA_WOS_20250703
dc.subjectMachine learning
dc.subjectPatient information
dc.subjectMalaria
dc.subjectTyphoid
dc.subjectWaterborne disease
dc.titleMachine learning based efficient prediction of positive cases of waterborne diseases
dc.typeArticle

Dosyalar