论文标题
布达佩斯淀粉样蛋白预测因子及其应用
The Budapest Amyloid Predictor and its Applications
论文作者
论文摘要
蛋白质的淀粉样蛋白状态在神经病学,生物化学和生物技术方面进行了广泛研究。与无定形聚集相反,淀粉样蛋白状态具有明确的结构,由平行和反平行的$β$ - 表中的平行和反平行的$β$ sheet组成。对淀粉样蛋白状态的理解正在随着新型分子成像工具(如低温电子显微镜)的发展而增长。基于序列的淀粉样蛋白预测因子是通过将主要人工神经网络(ANN)作为基础计算技术开发的。从良好的基于神经网络的预测器中,确定输入氨基酸序列的那些属性是一项非常困难的任务,这意味着网络的决策。在这里,我们提供了一个基于支持向量机(SVM)的基于六肽的预测指标,其正确性高于84 \%,即至少与已发布的基于ANN的工具一样好。与人工神经网络不同,SVM的决定更容易分析,并且从良好的预测因素中,我们可以推断出丰富的生化知识。 可用性和实现:布达佩斯淀粉样蛋白预测器Web服务器可以在https://pitgroup.org/bap上免费获得。
The amyloid state of proteins is widely studied with relevancy in neurology, biochemistry, and biotechnology. In contrast with amorphous aggregation, the amyloid state has a well-defined structure, consisting of parallel and anti-parallel $β$-sheets in a periodically repeated formation. The understanding of the amyloid state is growing with the development of novel molecular imaging tools, like cryogenic electron microscopy. Sequence-based amyloid predictors were developed by using mostly artificial neural networks (ANNs) as the underlying computational techniques. From a good neural network-based predictor, it is a very difficult task to identify those attributes of the input amino acid sequence, which implied the decision of the network. Here we present a Support Vector Machine (SVM)-based predictor for hexapeptides with correctness higher than 84\%, i.e., it is at least as good as the published ANN-based tools. Unlike the artificial neural networks, the decision of the SVMs are much easier to analyze, and from a good predictor, we can infer rich biochemical knowledge. Availability and Implementation: The Budapest Amyloid Predictor webserver is freely available at https://pitgroup.org/bap.