Pat Verga scite author profile

Pat Verga

5Publications

71Citation Statements Received

102Citation Statements Given

How they've been cited

How they cite others

102

Affiliations

Google (United States)

Publications

Order By: Most citations

Facts as Experts: Adaptable and Interpretable Neural Memory over Symbolic Knowledge

Verga¹,

Sun²,

Soares³

et al. 2020

Preprint

View full text Add to dashboard Cite

Massive language models are the core of modern NLP modeling and have been shown to encode impressive amounts of commonsense and factual information. However, that knowledge exists only within the latent parameters of the model, inaccessible to inspection and interpretation, and even worse, factual information memorized from the training corpora is likely to become stale as the world changes. Knowledge stored as parameters will also inevitably exhibit all of the biases inherent in the source materials. To address these problems, we develop a neural language model that includes an explicit interface between symbolically interpretable factual information and subsymbolic neural knowledge. We show that this model dramatically improves performance on two knowledge-intensive question-answering tasks. More interestingly, the model can be updated without re-training by manipulating its symbolic representations. In particular this model allows us to add new facts and overwrite existing ones in ways that are not possible for earlier models.

show abstract

Adaptable and Interpretable Neural MemoryOver Symbolic Knowledge

Verga¹,

Sun²,

Soares³

et al. 2021

View full text Add to dashboard Cite

Past research has demonstrated that large neural language models (LMs) encode surprising amounts of factual information: however, augmenting or modifying this information requires modifying a corpus and retraining, which is computationally expensive. To address this problem, we develop a neural LM that includes an interpretable neuro-symbolic KB in the form of a "fact memory". Each element of the fact memory is formed from a triple of vectors, where each vector corresponds to a KB entity or relation. Our LM improves performance on knowledge-intensive question-answering tasks, sometimes dramatically, including a 27 point increase in one setting of WebQuestionsSP over a state-of-the-art open-book model, despite using 5% of the parameters. Most interestingly, we demonstrate that the model can be modified, without any re-training, by updating the fact memory.

show abstract

Unsupervised Latent Tree Induction with Deep Inside-Outside Recursive Autoencoders

Drozdov¹,

Verga²,

Yadav³

et al. 2019

Preprint

View full text Add to dashboard Cite

Attributed Question Answering: Evaluation and Modeling for Attributed Large Language Models

Bohnet¹,

Trần²,

Verga³

et al. 2022

Preprint

View full text Add to dashboard Cite

Augmenting Pre-trained Language Models with QA-Memory for Open-Domain Question Answering

Chen¹,

Verga²,

Jong³

et al. 2022

Preprint

View full text Add to dashboard Cite

Existing state-of-the-art methods for opendomain question-answering (ODQA) generally used a open book approach, in which information is retrieved from a large text corpus or knowledge base (KB), and then reasoned with to produce an answer. A recent alternative is to retrieve from a collection of previously-generated question-answer pairs. This has several practical advantages, including being more memory-and computeefficient. Question-answer pairs are also appealing in that they seem to be an intermediate between text and KB triples: like KB triples, they usually concisely express a single relationship, but like text, they have good coverage. We describe a new QA system which augments a text-to-text model with a large memory of question-answer pairs, and a new pretraining task for the latent step of question retrieval. The pre-training task substantially simplifies training, and greatly improves performance on smaller QA benchmarks. Unlike prior systems of this sort, our QA system can also answer multi-hop questions that do not explicitly appear in the collection of stored question-answer pairs.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Pat Verga

Facts as Experts: Adaptable and Interpretable Neural Memory over Symbolic Knowledge

Adaptable and Interpretable Neural MemoryOver Symbolic Knowledge

Unsupervised Latent Tree Induction with Deep Inside-Outside Recursive Autoencoders

Attributed Question Answering: Evaluation and Modeling for Attributed Large Language Models

Augmenting Pre-trained Language Models with QA-Memory for Open-Domain Question Answering

Contact Info

Product

Resources

About