Automated Protein Classification Using Rigidity Analysis

dc.contributorDobosh, Paul
dc.contributorLerner, Barbara
dc.contributorWoodard, Craig
dc.contributor.advisorSt. John, Audrey
dc.contributor.authorSchirf, Courtney Lenna
dc.date.accessioned2011-06-01T15:46:37Z
dc.date.available2011-06-01T15:46:37Z
dc.date.gradyear2011en_US
dc.date.issued2011-06-01
dc.description.abstractProteins are one of the most important biological structures found in nature. Consequently, the ability to determine a protein’s function quickly and accurately is of considerable importance to the scientific community. Many modern computational techniques seek to solve this problem by using available structural information to classify proteins in order to infer protein function. Most current protein classification systems use a combination of automated techniques and manual curation. Even so, some levels for certain systems rely solely on expert analysis. Generally, in these circumstances, the characteristics of the group- ing are considered too broad or vague for automated techniques. The architecture level of CATH is one such example. This thesis explores the application of rigidity theory to the physical structure of proteins in automated protein classification by augmenting standard protein data (in the form of secondary structure information) with rigidity data. Machine learning algorithms are used to perform the classification. We evaluate the effect of the rigidity data on the accuracy of these algorithms. Specifically, we focus on three architectures within the Mainly Alpha class: orthogonal bundle, up-down bundle, and alpha/alpha barrel.en_US
dc.description.sponsorshipComputer Scienceen_US
dc.identifier.urihttp://hdl.handle.net/10166/875
dc.language.isoenen_US
dc.rights.holderAttribution-NonCommercial-NoDerivs 3.0 Unported
dc.rights.restrictedpublic
dc.rights.urihttp://creativecommons.org/licenses/by-nc-nd/3.0/
dc.subjectautomated protein classificationen_US
dc.subjectrigidity theoryen_US
dc.subjectproteinen_US
dc.subjectbioinformaticsen_US
dc.subjectcomputational biologyen_US
dc.subjectmachine learningen_US
dc.titleAutomated Protein Classification Using Rigidity Analysisen_US
dc.typeThesisen_US
mhc.degreeUndergraduateen_US
mhc.institutionMount Holyoke College

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
thesis.pdf
Size:
6.89 MB
Format:
Adobe Portable Document Format
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.82 KB
Format:
Item-specific license agreed upon to submission
Description: