Threshold Average Precision (TAP-<i>k</i>): a measure of retrieval designed for bioinformatics

Carroll, Hyrum; Kann, Maricel G.; Sheetlin, Sergey L.; Spouge, John L.

doi:10.1093/bioinformatics/btq270

Cited by 28 publications

(22 citation statements)

References 27 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…In this study, we utilize the Threshold Average Precision (TAP) [14] method as the evaluation criterion for retrieval efficacy. The TAP method calculates the median Average Precision-Recall with a moderate adjustment for irrelevant sequences just before the threshold.…”

Section: Methodsmentioning

confidence: 99%

Improving Retrieval Efficacy of Homology Searches Using the False Discovery Rate

Carroll

Williams

Davis

et al. 2015

IEEE/ACM Trans. Comput. Biol. and Bioinf.

Self Cite

View full text Add to dashboard Cite

Over the past few decades, discovery based on sequence homology has become a widely accepted practice. Consequently, comparative accuracy of retrieval algorithms (e.g., BLAST) has been rigorously studied for improvement. Unlike most components of retrieval algorithms, the E-value threshold criterion has yet to be thoroughly investigated. An investigation of the threshold is important as it exclusively dictates which sequences are declared relevant and irrelevant. In this paper, we introduce the false discovery rate (FDR) statistic as a replacement for the uniform threshold criterion in order to improve efficacy in retrieval systems. Using NCBI’s BLAST and PSI-BLAST software packages, we demonstrate the applicability of such a replacement in both non-iterative (BLASTFDR) and iterative (PSI-BLASTFDR) homology searches. For each application, we performed an evaluation of retrieval efficacy with five different multiple testing methods on a large training database. For each algorithm, we choose the best performing method, Benjamini-Hochberg, as the default statistic. As measured by the Threshold Average Precision, BLASTFDR yielded 14.1% better retrieval performance than BLAST on a large (5,161 queries) test database and PSI-BLASTFDR attained 11.8% better retrieval performance than PSI-BLAST. The C++ source code specific to BLASTFDR and PSI-BLASTFDR and instructions are available at http://www.cs.mtsu.edu/~hcarroll/blast_fdr/.

show abstract

Section: Methodsmentioning

confidence: 99%

Improving Retrieval Efficacy of Homology Searches Using the False Discovery Rate

Carroll

Williams

Davis

et al. 2015

IEEE/ACM Trans. Comput. Biol. and Bioinf.

Self Cite

View full text Add to dashboard Cite

show abstract

“…Usually, the area under the receiver operating characteristic curve (AUC) score is the most popular criterion for the task. However, it was shown that AUC may fail to faithfully reflect the actual quality when the AUC scores are pooled together to evaluate a retrieval system for multiple independent retrieval tasks [16]. AUC is not robust against outlier results.…”

Section: Predicting Top Highly Cited Articlesmentioning

confidence: 99%

“…Finally, AUC does not always decrease as the threshold relaxed to include the entire retrieval list. To address these issues, a new evaluation method called the threshold average precision (TAP-k) was proposed [16]. We will adopt this new method to evaluate the metrics on their performance for predicting top 10% of highly cited articles.…”

Section: Predicting Top Highly Cited Articlesmentioning

confidence: 99%

“…TAP-k will penalize ranked lists that are cut short prematurely in an attempt to boost its precision and ranked lists with scores that only reflect the rank but not the quality or importance of the retrieved items (in our case, the papers). See [16] for details. Again, we applied each metric to rank the oldest 90% of the papers.…”

Section: Predicting Top Highly Cited Articlesmentioning

confidence: 99%

See 1 more Smart Citation

Time-Aware Ranking in Dynamic Citation Networks

Ghosh

Kuo

Hsu

et al. 2011

2011 IEEE 11th International Conference on Data Mining Workshops

View full text Add to dashboard Cite

Abstract-Many algorithms have been developed to identify important nodes in a complex network, including various centrality metrics and PageRank, but most fail to consider the dynamic nature of the network. They therefore suffer from recency bias and fail to recognize important new nodes that have not had as much time to accumulate links as their older counterparts. This paper describes the Effective Contagion Matrix (ECM), a solution to address recency bias in the analysis of dynamic complex networks. The idea of ECM is to explicitly consider the temporal order of links and chains of links connecting to a node with some temporal decay factors. We tested ECM with three large real world citation networks on the task of predicting papers' future importance. We compared ECM's performance with two static metrics, degree-centrality and PageRank, and two time-aware metrics, age-based PageRank and CiteRank. We show that ECM is more appropriate for predicting future citations and PageRank scores with regard to new citations. We also describe a procedure to estimate ECM's parameters from the data. Combining all five scores into a ν-SVR regression model of future citations improves the predictive performance further.

show abstract

“…The implicit assumption is that a curator could use the ranking to decide where to stop looking at the results, therefore a better ranking provides a better user experience. A recently proposed alternative measure of the ranking of the results is the "Threshold Average Precision" (TAP-k) [9], which (in slightly simplified terms) averages precision for the results above a given error threshold. The TAP-k metric is easier to interpret and directly relevant for the end user, who in most cases would not be willing to inspect a long list of results containing many false positives.…”

Section: A Evaluation Measuresmentioning

confidence: 99%

Ranking Interactions for a Curation Task

Clematide

Rinaldi

2011

2011 10th International Conference on Machine Learning and Applications and Workshops

View full text Add to dashboard Cite

One of the key pieces of information which biomedical text mining systems are expected to extract from the literature are interactions among different types of biomedical entities (proteins, genes, diseases, drugs, etc.). Different types of entities might be considered, for example protein-protein interactions have been extensively studied as part of the Bio Creative competitive evaluations. However, more complex interactions such as those among genes, drugs, and diseases are increasingly of interest. Different databases have been used as reference for the evaluation of extraction and ranking techniques. The aim of this paper is to describe a machine-learning based reranking approach for candidate interactions extracted from the literature. The results are evaluated using data derived from the Pharm GKB database. The importance of a good ranking is particularly evident when the results are applied to support human curators. Different types of entities might be considered, for example protein-protein interactions have been extensively studied as part of the BioCreative competitive evaluations. However, more complex interactions such as those among genes, drugs, and diseases are increasingly of interest. Different databases have been used as reference for the evaluation of extraction and ranking techniques.The aim of this paper is to describe a machine-learning based reranking approach for candidate interactions extracted from the literature. The results are evaluated using data derived from the PharmGKB database. The importance of a good ranking is particularly evident in the case the results are applied to support human curators.

show abstract

Threshold Average Precision (TAP-k): a measure of retrieval designed for bioinformatics

Cited by 28 publications

References 27 publications

Improving Retrieval Efficacy of Homology Searches Using the False Discovery Rate

Improving Retrieval Efficacy of Homology Searches Using the False Discovery Rate

Time-Aware Ranking in Dynamic Citation Networks

Ranking Interactions for a Curation Task

Contact Info

Product

Resources

About