Okagbue, H. I., Ijezie, O. A., Ugwoke, P. O., Adeyemi-Kayode, T. M. and Jonathan, O., 2023. Single-label machine learning classification revealed some hidden but inter-related causes of five psychotic disorder diseases. Heliyon, 9 (9), e19422.
Full text available as:
|
PDF (OPEN ACCESS ARTICLE)
1-s2.0-S2405844023066306-main.pdf - Published Version Available under License Creative Commons Attribution Non-commercial No Derivatives. 950kB | |
Copyright to original material in this document is with the original owner(s). Access to this content through BURO is granted on condition that you use it only for research, scholarly or other non-commercial purposes. If you wish to use it for any other purposes, you must contact BU via BURO@bournemouth.ac.uk. Any third party copyright material in this document remains the property of its respective owner(s). BU grants no licence for further use of that third party material. |
DOI: 10.1016/j.heliyon.2023.e19422
Abstract
Psychotic disorder diseases (PDD) or mental illnesses are group of illnesses that affect the minds and impair the cognitive ability, retard emotional ability and obstruct the process of communication and relationship with others and are characterized by delusions, hallucinations and disoriented or disordered pattern of thinking. Prognosis of PDD is not sufficient because of the nature of the diseases and as such adequate form of diagnosis is required to detect, manage and treat the illness. This paper applied the single-label classification (SLC) machine learning approach in mining of electronic health records of people with PDD in Nigeria using eleven independent (demographic) variables and five PDD as target variables. The five PDDs are Insomnia, Schizophrenia, Minimal Brain dysfunction (MBD), which is also known as Attention-Deficit/Hyperactivity Disorder (ADHD), Vascular Dementia (VD) and Bipolar Disorder (BD). The aim of using SLC is that it would be easier to detect some PDDs that are related to each other without the loss of information, which is a plus over multi-label classification (MLC). ReliefF algorithm was used at each experiment to precipitate the order of importance of the independent variables and redundant variables were excluded from the analysis. The order of the variables in feature selection was matched with feature importance after the classifications and quantified using the Spearman rank correlation coefficient. The data was divided into: 70% for training and 30% for testing. Four new performance metrics adapted from the root mean square (RMSE) were proposed and used to measure the differences between the performance results of the 10 Machine learning models in terms of the training and testing and secondly, feature and without feature selection. The new metrics are close to zero which is an indication that the use of feature selection and cross validation may not greatly affects the accuracy of the SLC. When the PDDs are included as predictors for classifying others, there was a tremendous improvement as revealed by the four new metrics for classification accuracy (CA), precision and recall. Analysis of variance showed the four different metrics differs significantly for classification accuracy (CA) and precision. However, there were no significant difference between the CA and precision when the duo are compared together across the four evaluation metrics at p value less than 0.05.
Item Type: | Article |
---|---|
ISSN: | 2405-8440 |
Uncontrolled Keywords: | Classification; Diagnosis; Feature importance; Machine learning; Psychotic disorder; Single label approach |
Group: | Faculty of Science & Technology |
ID Code: | 38957 |
Deposited By: | Symplectic RT2 |
Deposited On: | 06 Sep 2023 09:35 |
Last Modified: | 06 Sep 2023 09:35 |
Downloads
Downloads per month over past year
Repository Staff Only - |