Rafati-Afshar, A. A., 2017. Unified processing framework of high-dimensional and overly imbalanced chemical datasets for virtual screening. Doctoral Thesis (Doctoral). Bournemouth University.
Full text available as:
|
PDF
RAFATI AFSHAR, Amir Ali_Ph.D._2017.pdf 5MB | |
Copyright to original material in this document is with the original owner(s). Access to this content through BURO is granted on condition that you use it only for research, scholarly or other non-commercial purposes. If you wish to use it for any other purposes, you must contact BU via BURO@bournemouth.ac.uk. Any third party copyright material in this document remains the property of its respective owner(s). BU grants no licence for further use of that third party material. |
Abstract
Virtual screening in drug discovery involves processing large datasets containing unknown molecules in order to find the ones that are likely to have the desired effects on a biological target, typically a protein receptor or an enzyme. Molecules are thereby classified into active or non-active in relation to the target. Misclassification of molecules in cases such as drug discovery and medical diagnosis is costly, both in time and finances. In the process of discovering a drug, it is mainly the inactive molecules classified as active towards the biological target i.e. false positives that cause a delay in the progress and high late-stage attrition. However, despite the pool of techniques available, the selection of the suitable approach in each situation is still a major challenge. This PhD thesis is designed to develop a pioneering framework which enables the analysis of the virtual screening of chemical compounds datasets in a wide range of settings in a unified fashion. The proposed method provides a better understanding of the dynamics of innovatively combining data processing and classification methods in order to screen massive, potentially high dimensional and overly imbalanced datasets more efficiently.
Item Type: | Thesis (Doctoral) |
---|---|
Additional Information: | If you feel that this work infringes your copyright please contact the BURO Manager. |
Uncontrolled Keywords: | virtual screening; data imbalance; classification; SMOTE |
Group: | Faculty of Science & Technology |
ID Code: | 29248 |
Deposited By: | Symplectic RT2 |
Deposited On: | 22 May 2017 11:15 |
Last Modified: | 09 Aug 2022 16:04 |
Downloads
Downloads per month over past year
Repository Staff Only - |