Performance Improvement of the Goertzel Algorithm in Estimating of Protein Coding Regions Using Modified Antinotch Filter and Linear Predictive Coding Model

Mahsa Saffari Farsani, Masoud Reza Aghabozorgi Sahhaf, Vahid Abootalebi



The aim of this paper is to improve the performance of the conventional Goertzel algorithm in determining the protein codingregions in deoxyribonucleic acid (DNA) sequences. First, the symbolic DNA sequences are converted into numerical signals usingelectron ion interaction potential method. Then by combining the modified anti-notch filter and linear predictive coding model,we proposed an efficient algorithm to achieve the performance improvement in the Goertzel algorithm for estimating geneticregions. Finally, a thresholding method is applied to precisely identify the exon and intron regions. The proposed algorithm isapplied to several genes, including genes available in databases BG570 and HMR195 and the results are compared to othermethods based on the nucleotide level evaluation criteria. Results demonstrate that our proposed method reduces the numberof incorrect nucleotides which are estimated to be in the noncoding region. In addition, the area under the receiver operatingcharacteristic curve has improved by the factor of 1.35 and 1.12 in HMR195 and BG570 datasets respectively, in comparison withthe conventional Goertzel algorithm.


Anti-notch filter, deoxyribonucleic acid, Goertzel, linear predictive coding, thresholding

