Alignment of Noncoding Ribonucleic Acids with Pseudoknots Using Context-Sensitive Hidden Markov Model

Nayyer Mostaghim Bakhshayesh, Mousa Shamsi, Mohammad Hossein Sedaaghi, Hossein Ebrahimnezhad

DOI: 10.4103/jmss.JMSS_11_19

Abstract


Up to now, various signal processing techniques have been used to predict protein-coding genes that are unsuitable for predicting ribonucleic acids (RNAs). Modeling a gene network can be employed in various fields, such as the discovery of new drugs, reducing the side effects of treatment methods, further identifying genetic diseases and treatments for genetic disorders by influencing the activity of effectual genes, preventing the growth of unwanted tissues via growth weakening and cell reproduction, and also for many other applications in the fields of medicine and agriculture. The main purpose of this study was to design a suitable algorithm based on context-sensitive hidden Markov models (csHMMs) for the alignment of secondary structures of RNAs, which can identify noncoding RNAs. In this model, several RNA families are compared, and their existing similarities are measured. An expectation-maximization algorithm is used to estimate the model's parameters. This algorithm is the standard algorithm to maximize HMM parameters. The alignment results for RNAs belonging to the hepatitis delta virus family showed an accuracy of 83.33%, a specificity of 89%, and a sensitivity of 97%, and RNAs belonging to the purine family showed an accuracy of 65%, a specificity of 76%, and a sensitivity of 76%. The results show that csHMMs, in addition to aligning the primary sequences of RNAs, would align the secondary structures of RNAs with high accuracy.


Keywords


Context-sensitive hidden Markov models, expectation-maximization algorithm, noncoding ribonucleic acids, structural alignment

Full Text:

PDF

References


Miranda-Rios J, Navarro M, Soberón M. A conserved RNA structure (thi box) is involved in regulation of thiamin biosynthetic gene expression in bacteria. Proc Natl Acad Sci U S A 2001;98:9736-41.

Moulton V. Tracking down noncoding RNAs. Proc Natl Acad Sci U S A 2005;102:2269-70.

Fiannaca A, La Rosa M, La Paglia L, Rizzo R, Urso A. NRC: Non-coding RNA classifier based on structural features. BioData Min 2017;10:27.

Schneider HW, Raiol T, Brigido MM, Walter ME, Stadler PF. A support vector machine based method to distinguish long non-coding RNAs from protein coding transcripts. BMC Genomics 2017;18:804.

Byung JY, Vaidyanathan PP. Structural alignment of RNAs using profile-csHMMs and its application to RNA homology search: Overview and new results. Automat Control IEEE Trans 2008;53:10-25.

Byung JY. Signal Processing Methods for Genomic Sequence Analysis. Thesis Presented to the California Institute of Technology for the Degree of Doctor of Philosophy. Pasadena, California; 2007.

Eddy SR, Durbin R. RNA sequence analysis using covariance models. Nucleic Acids Res 1994;22:2079-88.

Hiroshi M, Kengo S, Yasubumi S. Pair stochastic tree adjoining grammars for aligning and predicting pseudoknot RNA structure. Bioinformatics 2005;21:2611-7.

Lari K, Young SJ. The estimation of stochastic context-free grammars using the inside-outside algorithm. Comput Speech Lang 1990;4:35-56.

Ruan J, Stormo GD, Zhang W. An iterated loop matching approach to the prediction of RNA secondary structures with pseudoknots. Bioinformatics 2004;20:58-66.

Shubhra SR, Sankar KP. RNA secondary structure prediction using soft computing. IEEE/ACM Trans Comput Biol Bioinform 2013;10.

Christopher N. Hidden Markov Models with Applications to DNA Sequence Analysis, STOR-i; May, 2011.

Available from: http://rfam.sanger.ac.uk/; http://rfam.janelia.org/. [Last accessed on 2015 Jul 14; Last accessed on 2019 Jun 22].

Richard D, Sean E, Anders K, Graeme M. Biological Sequence Analysis. Cambridge, UK: Cambridge University Press; 1998.

Byung JY. Comparative analysis of biological networks: Hidden Markov model and Markov chain-based approach. Signal Process Mag IEEE 2012;29:22-34.


Refbacks

  • There are currently no refbacks.


 

  https://e-rasaneh.ir/Certificate/22728

https://e-rasaneh.ir/

ISSN : 2228-7477