Improving protein structure prediction through data purification.

Fan. Zhang

Back

Improving protein structure prediction through data purification.

Doctoral Thesis

Open access

Improving protein structure prediction through data purification.

Fan. Zhang

Doctor of Philosophy (PhD), University of Surrey (United Kingdom).

2007

Abstract

In this thesis, the author pursues the target of improving accuracy of protein structural prediction through the procedure of data purification. A Protein Attributes Microtuning System (PAMS) is developed to prepare a variety of new datasets as and when required. Furthermore, a Protein Structural Accuracy Reckoner (PSAR) framework is used to recommend procedures that might lead to high prediction accuracy. By using the PSAR, it is shown that using a refined dataset generated by the PAMS, and implementing an appropriate window mechanism considerably improves the accuracy of protein structure prediction by 12%, giving a best accuracy of 90.97%. On average, almost all classifiers that are applied in the experiments result in accuracy increases of 10%-15%. A list of classifiers is categorized according to their prediction performances and classification efficiencies. A few refined datasets are proposed as benchmark datasets. Apart from the aforementioned achievements, examination of a total of 3,135,393 predictions tasks, which carried out by the PSAR framework, yielded 139 'best' and 73 'worst' combinations of amino acid features descriptors. In this analysis, the 'best' prediction gave 82.34%, and the 'worst' prediction gave 73.65%. To achieve a greater computational capacity the PSAR infrastructure is hosted on the Condor platform in the Department of Computing, University of Surrey. (Abstract shortened by ProQuest.).

Files and links (1)

pdf

101485594.37 MBDownload View

TextCC BY-NC-SA V4.0, Open Access

Metrics

35 File views/ downloads

54 Record Views

Details

Title: Improving protein structure prediction through data purification.
Creators: Fan. Zhang
Contributors: University of Surrey (United Kingdom). (Institution)
Awarding Institution: University of Surrey (United Kingdom).; Doctor of Philosophy (PhD)
Theses and Dissertations: Doctor of Philosophy (PhD), University of Surrey (United Kingdom).
Number of pages: 147
Date published: 2007
Date submitted: 25/10/2017
Identifiers: 99511691802346
Academic Unit: Surrey research (other units)
Resource Type: Doctoral Thesis

Improving protein structure prediction through data purification.

Abstract

Files and links (1)

Metrics

Details

Usage Policy