Mechanisms classification for glycoside hydrolases by sequence and structure features using computational methods

International Journal of Data Mining and Bioinformatics
Fan Yang, Lin Wang

Abstract

Glycoside Hydrolases (GHs) have played key roles in the development of biofuels as well as many other industries. Research aimed at accurate classification of catalytic mechanisms to increase the catalytic activity of GHs is receiving extensive attention. The traditional theories or methods used in the study of catalytic mechanisms of GHs are limited by reaction conditions. They are not suitable for the study of various GHs because different enzymes would show devious physicochemical properties. In this paper, a new method is proposed to classify and predict the catalytic mechanism of a certain glycoside hydrolase according to their sequence and structure features using k-Nearest Neighbor (kNN) classifier, Support Vector Machine (SVM), Naive Bayes (NB) Classifier and the Multilayer Perception (MLP) Classifier. The classification performance of the four computational methods used were evaluated and compared. Experimental results show that each classifier has its own advantages, but the kNN classifier is more accurate at the overall level. This research also helps us to gain a better understanding of the catalytic mechanisms in different GHs.

Related Concepts

Related Feeds

Bioinformatics in Biomedicine

Bioinformatics in biomedicine incorporates computer science, biology, chemistry, medicine, mathematics and statistics. Discover the latest research on bioinformatics in biomedicine here.

Biofuels (ASM)

Biofuels are produced through contemporary processes from biomass rather than geological processes involved in fossil fuel formation. Examples include biodiesel, green diesel, biogas, etc. Discover the latest research on biofuels in this feed.