A novel method of protein secondary structure prediction with high segment overlap measure: support vector machine approach
Abstract
We have introduced a new method of protein secondary structure prediction which is based on the theory of support vector machine (SVM). SVM represents a new approach to supervised pattern classification which has been successfully applied to a wide range of pattern recognition problems, including object recognition, speaker identification, gene function prediction with microarray expression profile, etc. In these cases, the performance of SVM either matches or is significantly better than that of traditional machine learning approaches, including neural networks.The first use of the SVM approach to predict protein secondary structure is described here. Unlike the previous studies, we first constructed several binary classifiers, then assembled a tertiary classifier for three secondary structure states (helix, sheet and coil) based on these binary classifiers. The SVM method achieved a good performance of segment overlap accuracy SOV=76.2 % through sevenfold cross validation on a database of 513 non-homologous protein chains with multiple sequence alignments, which out-performs existing methods. Meanwhile three-state overall per-residue accuracy Q(3) achieved 73.5 %, which is at least comparable to existing single prediction met...Continue Reading
Citations
Related Concepts
Related Feeds
Cajal Bodies & Gems
Cajal bodies or coiled bodies are dense foci of coilin protein. Gemini of Cajal bodies, or gems, are microscopically similar to Cajal bodies. It is believed that Cajal bodies play important roles in RNA processing while gems assist the Cajal bodies. Find the latest research on Cajal bodies and gems here.