IGLOSS: iterative gapless local similarity search

Rabar, Braslav; Zagorščak, Maja; Ristov, Strahil; Rosenzweig, Martin; Goldstein, Pavle

doi:10.1093/bioinformatics/btz086

Cited by 1 publication

(3 citation statements)

References 12 publications

(6 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…In order to test the method, we applied it to responses from three iterative motif scanners -PSI-BLAST (PB) ( [1]), JackHMMER (JH) ( [6]) and IGLOSS (IG) ( [9]) -and compared the maximal clique with the original response. As in ( [9]), scanners were applied to five plant proteomes -Arabidopsis thaliana (AT, v. TAIR9), Oryza sativa (OS, v. MSU v7), Solanum tuberosum (ST, v. ITAG1), Solanum lycopersicum (SL, v. ITAG2.3) and Beta vulgaris (BV, v. KWS2320) -where we searched for members of an extensively studied, motif characterized protein family -GDSL lipases. GDSL lipases belong to lipid hydrolyzing enzymes that exhibit a GDSL motif.…”

Section: Tests and Resultsmentioning

confidence: 99%

“…Block I contains the main characteristic motif (PROSITE:PS01098) ( [10]) from which the main search query of 10 amino acids was constructed. As in [9], the condition positive set was determined by processing the information from GoMapMan resource [12].…”

Section: Tests and Resultsmentioning

confidence: 99%

“…Both steps can be a source of errors -an inaccurate ranking scheme, with a low threshold, will generate a huge response, causing a large type I-like error, whereas a high threshold, while testing positive on strongest examples, might miss the candidates with a slightly weaker signal. Various applications deal with these problems in various ways -for example, JackHMMer ( [6]) uses -effectively -several scoring functions, each with its own ranking and threshold, while IGLOSS ( [9]) uses detailed parameter estimation to minimize both issues.…”

Section: Introductionmentioning

confidence: 99%

See 2 more Smart Citations

A Clique-Based Method for Improving Motif Scanning Accuracy

Rabar¹,

Nižetić²,

Goldstein³

2022

Preprint

Self Cite

View full text Add to dashboard Cite

BackgroundMotif scanning is a very common method in bioinformatics. Its objective is to detect motifs of sufficient similarity to the query, which is then used to determine familiy membership, or structural or functional features or assignments. Considering a variety of uses, accuracy of motif scanning procedures is of great importance. ResultsWe present a new approach for improving motif scanning accuracy, based on analysis of in-between similarity. Given a set of motifs obtained from a scanning process, we construct an associated weighted graph. We also compute the expected weight of an edge in such a graph. It turns out that restricting results to the maximal clique in the graph, computed with respect to the expected weight, greatly increases precision, hence improves accuracy of the scan. We tested the method on an ungapped motif-characterized protein family from five plant proteomes. The method was applied to three iterative motif scanners -PSI-BLAST, JackHMMer and IGLOSS -with very good results. ConclusionsWe presented a method for improving protein motif scanning accuracy, and have successfully applied it in several situations. The method has wider implications, for general pattern recognition and feature extraction strategies, as long as one can determine the expected similarity between objects under consideration.

show abstract