Arora, P., Mishra, A. and Malhi, A., 2021. N-semble-based method for identifying Parkinson's disease genes. Neural Computing and Applications, 35, 23829-23839.
Full text available as:
|
PDF (OPEN ACCESS ARTICLE)
s00521-021-05974-z (1).pdf - Published Version Available under License Creative Commons Attribution. 1MB | |
PDF (OPEN ACCESS ARTICLE)
Arora2021_Article_N-semble-basedMethodForIdentif.pdf - Published Version Restricted to Repository staff only Available under License Creative Commons Attribution. 1MB | ||
Copyright to original material in this document is with the original owner(s). Access to this content through BURO is granted on condition that you use it only for research, scholarly or other non-commercial purposes. If you wish to use it for any other purposes, you must contact BU via BURO@bournemouth.ac.uk. Any third party copyright material in this document remains the property of its respective owner(s). BU grants no licence for further use of that third party material. |
DOI: 10.1007/s00521-021-05974-z
Abstract
Parkinson’s disease (PD) genes identification plays an important role in improving the diagnosis and treatment of the disease. A number of machine learning methods have been proposed to identify disease-related genes, but only few of these methods are adopted for PD. This work puts forth a novel neural network-based ensemble (n-semble) method to identify Parkinson’s disease genes. The artificial neural network is trained in a unique way to ensemble the multiple model predictions. The proposed n-semble method is composed of four parts: (1) protein sequences are used to construct feature vectors using physicochemical properties of amino acid; (2) dimensionality reduction is achieved using the t-Distributed Stochastic Neighbor Embedding (t-SNE) method, (3) the Jaccard method is applied to find likely negative samples from unknown (candidate) genes, and (4) gene prediction is performed with n-semble method. The proposed n-semble method has been compared with Smalter’s, ProDiGe, PUDI and EPU methods using various evaluation metrics. It has been concluded that the proposed n-semble method outperforms the existing gene identification methods over the other methods and achieves significantly higher precision, recall and F Score of 88.9%, 90.9% and 89.8%, respectively. The obtained results confirm the effectiveness and validity of the proposed framework.
Item Type: | Article |
---|---|
ISSN: | 0941-0643 |
Uncontrolled Keywords: | Parkinson’s disease; Machine learning methods; Healthcare; Physicochemical properties of amino acid; Neural networks |
Group: | Faculty of Science & Technology |
ID Code: | 35614 |
Deposited By: | Symplectic RT2 |
Deposited On: | 08 Jun 2021 09:41 |
Last Modified: | 17 May 2024 15:35 |
Downloads
Downloads per month over past year
Repository Staff Only - |