Boosted Dense Retriever

Lewis, Patrick A.; Oğuz, Barlas; Xiong, Wenhan; Petroni, Filippo; Yih, Scott Wen-tau; Riedel, Sebastian

doi:10.18653/v1/2022.naacl-main.226

Cited by 1 publication

(3 citation statements)

References 34 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Currently, each component model in our architecture naively samples a subset of negative edges, whereas DrBoost could intelligently adjust the sampling distributions, potentially leading to improved results. 8 Finally, we plan to further enhance the model by incorporating user feedback through an application hosting this model. User interactions provide multiple avenues for model improvement.…”

Section: Resultsmentioning

confidence: 99%

“…Recent works have been designed to tackle the Question-answering task and require specific q uestions, a nswers, a nd p assages t o b e organized within the dataset. 8,14 While our document retrieval task is related, the concern of sensitive organizational data prevents us from using automatic labeling techniques and in general, leveraging the learned priors from internetscale models. 15,16 The alternative is a labor-intensive data annotation effort e ngaging m any s ubject matter experts.…”

Section: Resultsmentioning

confidence: 99%

“…Ensemble models have been shown to improve performance and efficiency in space and time, allowing for compact representations and faster training/inference. 7,8 These models employ a boosting mechanism, a strategy pioneered by Freund & Schapire. 9 They showed that if the weak hypothesis of each model in the ensemble is at least better than random guessing, then the combined hypothesis of the ensemble improves at an exponential rate.…”

Section: Boosting Via Ensemble Of Modelsmentioning

confidence: 99%

See 2 more Smart Citations

Intelligent knowledge base search tool using large language model and graph neural network

Payumo,

Subramanian,

et al. 2024

Pattern Recognition and Tracking XXXV

View full text Add to dashboard Cite

Within many organizations, a vast number of communications, memos, reports and documents have been accumulated in internal servers. Efficiently discovering relevant entries can reduce time spent addressing organizational needs such as personnel skills matching or anomaly resolution. However, per organization, information retrieval on these disparate data types can be challenging, as systems must be designed for their domain while accounting for unstructured and inconsistent datasets. Traditional querying via search terms often requires relevancy tuning by subject matter experts which makes it difficult to build retrieval systems. We argue that development of retrieval systems can be simplified and enhanced by embedding data with Large Language Models (LLMs), organizing information in a Knowledge Graph (KG) structure, and further encoding their relational features through a Graph Neural Network (GNN). One of the major challenges of using GNNs for information retrieval is optimizing negative edge selection. Training GNNs requires a balanced ratio between positive and negative edges however the space of negative edges is exponentially larger than positive edges. In this work, we extend the LLM-GNN hybrid architecture by applying ensemble voting on a set of trained LLM-GNNs. Preliminary results have shown modest improvement on our personnel-document matching tasks. This work contributes to a developmental effort that aims to help engineers and scientists find new research opportunities, learn from past mistakes, and quickly address future needs.

show abstract

Section: Resultsmentioning

confidence: 99%

Section: Resultsmentioning

confidence: 99%