DOI QR코드

DOI QR Code

2-D Graphical Representation for Characteristic Sequences of DNA and its Application

  • Li, Chun (Department of Mathematics, Bohai University) ;
  • Hu, Ji (Faculty of Chemistry and Chemical engineering, Bohai University)
  • Received : 2005.12.03
  • Accepted : 2006.02.14
  • Published : 2006.05.31

Abstract

DNA sequencing has resulted in an abundance of data on DNA sequences for various species. Hence, the characterization and comparison of sequences become more important but still difficult tasks. In this paper, we first give a 2-D ladderlike graphical representation for the characteristic sequences of a DNA sequence, and then construct a 3-component vector, in which the normalized ALE-indices extracted from such three 2-D graphs via D/D matrices are individual components, to characterize the DNA sequence. The examination of similarities/dissimilarities among sequences of the $\beta$-globin genes of different species illustrates the utility of the approach.

Keywords

References

  1. Bajzer, Z., Randic, M., Plavsic, D. and Basak, S. C. (2003) Novel map descriptors for characterization of toxic effects in proteomics maps. J. Mol. Graph. Model. 22, 1-9. https://doi.org/10.1016/S1093-3263(02)00186-9
  2. Cornish-Bowden, A. (1985) Nomenclature for incompletely specified bases in nucleic acid sequences: recommendations 1984. Nucleic Acids Res. 13, 3021-3030 https://doi.org/10.1093/nar/13.9.3021
  3. Guo, X. F., Randic, M. and Basak S. C. (2001) A novel 2-D graphical representation of DNA sequences of low degeneracy. Chem. Phys. Lett. 350, 106-112 https://doi.org/10.1016/S0009-2614(01)01246-5
  4. Guo, X. F. and Nandy, A. (2003) Numerical characterization of DNA sequences in a 2-D graphical representation scheme of low degeneracy. Chem. Phys. Lett. 369, 366
  5. He, P-an and Wang, J. (2002a) Characteristic sequences for DNA primary sequence. J. Chem. Inf. Comput. Sci. 42, 1080-1085 https://doi.org/10.1021/ci010131z
  6. He, P-an and Wang, J. (2002b) Numerical characterization of DNA primary sequence. Internet Electron. J. Mol. Des. 1, 668- 674
  7. Li, C. and Wang, J. (2003) Numerical characterization and similarity analysis of DNA sequences based on 2-D graphical representation of the characteristic sequences. Comb. Chem. High T. Scr. 6, 795
  8. Li, C. and Wang, J. (2004) On a 3-D representation of DNA primary sequences. Comb. Chem. High T. Scr. 7, 23
  9. Li, C. and Wang, J. (2005) New Invariant of DNA Sequences. J. Chem. Inf. Model. 45, 115-120 https://doi.org/10.1021/ci049874l
  10. Liao, B. and Wang, T. (2004) Analysis of similarity/dissimilarity of DNA sequences based on 3-D graphical representation. Chem. Phys. Lett. 388, 195-200 https://doi.org/10.1016/j.cplett.2004.02.089
  11. Liao, B., Zhang, Y., Ding, K. and Wang, T. (2005) Analysis of similarity/dissimilarity of DNA sequences based on a condensed curve representation. Journal of Molecular Structure: THEOCHEM 717, 199-203 https://doi.org/10.1016/j.theochem.2004.12.015
  12. Nandy, A. (1994a) A new graphical representation and analysis of DNA sequence structure: I. Methodology and application to globin genes. Curr. Sci. 66, 309-314
  13. Nandy, A. (1994b) Graphical representation of long DNA sequences. Curr. Sci. 66, 821
  14. Randic, M., Vracko, M., Nandy, A., Basak, S. C. (2000) On 3-D Graphical Representation of DNA Primary Sequences and Their Numerical Characterization. J. Chem. Inf. Comput. Sci. 40, 1235-1244 https://doi.org/10.1021/ci000034q
  15. Randic, M. and Vracko, M. (2000) On the similarity of DNA primary sequences. J. Chem. Inf. Comput. Sci. 40, 599-606 https://doi.org/10.1021/ci9901082
  16. Randic, M. (2000) On characterization of DNA primary sequences by a condensed matrix. Chem. Phys. Lett. 317, 29-34 https://doi.org/10.1016/S0009-2614(99)01321-4
  17. Randic, M., Guo, X. F. and Basak, S. C. (2001) On the characterization of DNA primary sequences by triplet of nucleic acid bases. J. Chem. Inf. Comput. Sci. 41, 619-626 https://doi.org/10.1021/ci000120q
  18. Randic, M., Vracko, M., Lers, N. and Plavsic, D. (2003a) Novel 2-D graphical representation of DNA sequences and their numerical characterization. Chem. Phys. Lett. 368, 1-6 https://doi.org/10.1016/S0009-2614(02)01784-0
  19. Randic, M., Vracko, M., Lers, N. and Plavsic, D. (2003b) Analysis of similarity/dissimilarity of DNA sequences based on novel 2-D graphical representation. Chem. Phys. Lett. 371, 202-207 https://doi.org/10.1016/S0009-2614(03)00244-6
  20. Wu, Y., Liew, A. W., Yan, H. and Yang, M. (2003) DB-Curve: a novel 2D method of DNA sequence visualization and representation. Chem. Phys. Lett. 367, 170-176 https://doi.org/10.1016/S0009-2614(02)01684-6

Cited by

  1. Analysis of similarity of RNA secondary structures based on a 2D graphical representation vol.458, pp.1-3, 2008, https://doi.org/10.1016/j.cplett.2008.04.112
  2. tomocomd-camps and protein bilinear indices - novel bio-macromolecular descriptors for protein research: I. Predicting protein stability effects of a complete set of alanine substitutions in the Arc repressor vol.277, pp.15, 2010, https://doi.org/10.1111/j.1742-4658.2010.07711.x
  3. A generalization of Lempel-Ziv complexity and its application to the comparison of protein sequences vol.48, pp.2, 2010, https://doi.org/10.1007/s10910-010-9673-7
  4. Milestones in graphical bioinformatics 2013, https://doi.org/10.1002/qua.24479