Measuring Similarity among Protein Sequences Using a New Descriptor

Abo-Elkhier, Mervat M.; Elwahaab, Marwa A Abd; Maaty, M.I Abo el

doi:10.1155/2019/2796971

Cited by 7 publications

(4 citation statements)

References 38 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…To perform quantitative measurement of homology [ 51 ], the parameters Identity, Similarity, and Alignment Score of the HELIOS outputs are calculated through simulation studies, as reported in Tables 1 – 3 , respectively, assuming the “Nine ND5 protein sequences dataset” [ 53 ]. While the Identity reports the number of exactly matched characters of two sequences (in percentage), the Similarity measures the resemblance of two compared sequences.…”

Section: Discussion and Resultsmentioning

confidence: 99%

“…Additionally, we consider twelve different datasets for this evaluation, represented in S1 Text – S12 Text ; while the input sequences of each dataset are represented in Table A3 in its corresponding file. Moreover, the quantitative measurement of homology of all aforementioned algorithms are reported in Tables A4-A33 in the S1 Text – S12 Text for twelve different datasets [ 53 , 55 – 61 ]. By the way, as a brief report, the average value of each parameter, achieved by the aforementioned algorithms, are reported in Table 7 for the twelve datasets.…”

Section: Discussion and Resultsmentioning

confidence: 99%

“…See S1 Text for the detailed results. 2 Dataset 2: Nine beta globin protein sequences dataset [ 53 ]. See S2 Text for the detailed results.…”

Section: Discussion and Resultsmentioning

confidence: 99%

See 2 more Smart Citations

HELIOS: High-speed sequence alignment in optics

2022

View full text Add to dashboard Cite

In response to the imperfections of current sequence alignment methods, originated from the inherent serialism within their corresponding electrical systems, a few optical approaches for biological data comparison have been proposed recently. However, due to their low performance, raised from their inefficient coding scheme, this paper presents a novel all-optical high-throughput method for aligning DNA, RNA, and protein sequences, named HELIOS. The HELIOS method employs highly sophisticated operations to locate character matches, single or multiple mutations, and single or multiple indels within various biological sequences. On the other hand, the HELIOS optical architecture exploits high-speed processing and operational parallelism in optics, by adopting wavelength and polarization of optical beams. For evaluation, the functionality and accuracy of the HELIOS method are approved through behavioral and optical simulation studies, while its complexity and performance are estimated through analytical computation. The accuracy evaluations indicate that the HELIOS method achieves a precise pairwise alignment of two sequences, highly similar to those of Smith-Waterman, Needleman-Wunsch, BLAST, MUSCLE, ClustalW, ClustalΩ, T-Coffee, Kalign, and MAFFT. According to our performance evaluations, the HELIOS optical architecture outperforms all alternative electrical and optical algorithms in terms of processing time and memory requirement, relying on its highly sophisticated method and optical architecture. Moreover, the employed compact coding scheme highly escalates the number of input characters, and hence, it offers reduced time and space complexities, compared to the electrical and optical alternatives. It makes the HELIOS method and optical architecture highly applicable for biomedical applications.

show abstract

Section: Discussion and Resultsmentioning

confidence: 99%

Section: Discussion and Resultsmentioning

confidence: 99%

See 1 more Smart Citation

HELIOS: High-speed sequence alignment in optics

2022

View full text Add to dashboard Cite

show abstract

“…The construction of these graphic curves is based on the allocation of individual bases of four different sine (or tangent) functions. In 2019, Abo-Elkhier et al numerically represented each amino acid in the protein sequence and proposed a new 2-D graphical representation method [17]. They introduced a new descriptor that consisted of a vector (Ā t , SA t ) consisting of the mean and standard deviation from the total number of protein sequences.…”

Section: Pattern Similarity Analysis Of Biological Sequencesmentioning

confidence: 99%

Genetic Similarity Analysis Based on Positive and Negative Sequence Patterns of DNA

Zhao

et al. 2020

Symmetry

View full text Add to dashboard Cite

Similarity analysis of DNA sequences can clarify the homology between sequences and predict the structure of, and relationship between, them. At the same time, the frequent patterns of biological sequences explain not only the genetic characteristics of the organism, but they also serve as relevant markers for certain events of biological sequences. However, most of the aforementioned biological sequence similarity analysis methods are targeted at the entire sequential pattern, which ignores the missing gene fragment that may induce potential disease. The similarity analysis of such sequences containing a missing gene item is a blank. Consequently, some sequences with missing bases are ignored or not effectively analyzed. Thus, this paper presents a new method for DNA sequence similarity analysis. Using this method, we first mined not only positive sequential patterns, but also sequential patterns that were missing some of the base terms (collectively referred to as negative sequential patterns). Subsequently, we used these frequent patterns for similarity analysis on a two-dimensional plane. Several experiments were conducted in order to verify the effectiveness of this algorithm. The experimental results demonstrated that the algorithm can obtain various results through the selection of frequent sequential patterns and that accuracy and time efficiency was improved.

show abstract