Construction of Large Library of Protein Fragments Using Inter Alpha-carbon Distance and Binet-Cauchy Distance

내부 알파탄소간 거리와 비네-코시 거리를 사용한 대규모 단백질 조각 라이브러리 구성

  • Chi, Sang-mun (Department of Computer Science and Engineering, Kyungsung University)
  • Received : 2015.08.21
  • Accepted : 2015.09.22
  • Published : 2015.12.31


Representing protein three-dimensional structure by concatenating a sequence of protein fragments gives an efficient application in analysis, modeling, search, and prediction of protein structures. This paper investigated the effective combination of distance measures, which can exploit large protein structure database, in order to construct a protein fragment library representing native protein structures accurately. Clustering method was used to construct a protein fragment library. Initial clustering stage used inter alpha-carbon distance having low time complexity, and cluster extension stage used the combination of inter alpha-carbon distance, Binet-Cauchy distance, and root mean square deviation. Protein fragment library was constructed by leveraging large protein structure database using the proposed combination of distance measures. This library gives low root mean square deviation in the experiments representing protein structures with protein fragments.


  1. A. G. de Brevern, C. Etchebest, and S. Hazout, "Bayesian probabilistic approach for predicting backbone structures in terms of protein blocks," Proteins, vol. 41, pp. 271-287, 2000.<271::AID-PROT10>3.0.CO;2-Z
  2. R. Kolodny, P. Koehl, L. Guibas, and M. Levitt, "Small libraries of protein fragments model native protein structures accurately," Journal of Molecular Biology, vol. 323, pp. 297-307, 2002.
  3. A. C. Camproux, R. Gautier, and P. Tuffery, "A hidden markov model derived structural alphabet for proteins," Journal of Molecular Biology, vol. 339, pp. 591-605, 2004.
  4. T. Hamelryck, J. T. Kent, and A. Krogh, "Sampling realistic protein conformations using local structural bias," PLoS Comput. Biol. vol. 2, e131, pp. 1121-1133, 2006.
  5. S. C. Li, D. Bu, J. Xu, and M. Li, "Fragment-HMM: A new approach to protein structure prediction,", Protein Science, vol. 17, pp. 1025-1934, 2008.
  6. I. Kalev and M. Habeck, "HHfrag: HMM-based fragment detection using HHpred," Bioinformatics, vol. 27, no. 22, pp. 3110-3116, 2011.
  7. A. P. Joseph, et al., "A short survey on protein blocks," Biophys. Rev. vol. 2, pp. 137-145, 2010.
  8. W. Kapsch, "A discussion of the solution for the best rotation to relate two sets of vectors" Acta Crystallog. sect., vol. 34, pp. 827-828, 1978.
  9. F. Guyon and P. Tuffery, "Fast protein fragment similarity scoring using a Binet-Cauchy kernel," Bioinformatics, vol. 30, no. 6, pp. 784-791, 2014.
  10. N. K. Fox, S. E. Brenner J. M. Chandonia, "SCOPe: Structural Classification of Proteins-extended, integrating SCOP and ASTRAL data and classification of new structures," Nucl. Acids Res. 42(Database issue), D304-309, 2014.