Yuxiang Wu scite author profile

Recent progress in pretraining language models on large textual corpora led to a surge of improvements for downstream NLP tasks. Whilst learning linguistic knowledge, these models may also be storing relational knowledge present in the training data, and may be able to answer queries structured as "fillin-the-blank" cloze statements. Language models have many advantages over structured knowledge bases: they require no schema engineering, allow practitioners to query about an open class of relations, are easy to extend to more data, and require no human supervision to train. We present an in-depth analysis of the relational knowledge already present (without fine-tuning) in a wide range of state-of-theart pretrained language models. We find that (i) without fine-tuning, BERT contains relational knowledge competitive with traditional NLP methods that have some access to oracle knowledge, (ii) BERT also does remarkably well on open-domain question answering against a supervised baseline, and (iii) certain types of factual knowledge are learned much more readily than others by standard language model pretraining approaches. The surprisingly strong ability of these models to recall factual knowledge without any fine-tuning demonstrates their potential as unsupervised open-domain QA systems. The code to reproduce our analysis is available at https: //github.com/facebookresearch/LAMA.

show abstract

End-to-End Adversarial Memory Network for Cross-domain Sentiment Classification

Zhang

Wei

et al. 2017

211

121

View full text Add to dashboard Cite

Domain adaptation tasks such as cross-domain sentiment classification have raised much attention in recent years. Due to the domain discrepancy, a sentiment classifier trained in a source domain may not work well when directly applied to a target domain. Traditional methods need to manually select pivots, which behave in the same way for discriminative learning in both domains. Recently, deep learning methods have been proposed to learn a representation shared by domains. However, they lack the interpretability to directly identify the pivots. To address the problem, we introduce an endto-end Adversarial Memory Network (AMN) for cross-domain sentiment classification. Unlike existing methods, the proposed AMN can automatically capture the pivots using an attention mechanism. Our framework consists of two parametershared memory networks with one for sentiment classification and the other for domain classification. The two networks are jointly trained so that the selected features minimize the sentiment classification error and at the same time make the domain classifier indiscriminative between the representations from the source or target domains. Moreover, unlike deep learning methods that cannot tell which words are the pivots, AMN can offer a direct visualization of them. Experiments on the Amazon review dataset demonstrate that AMN can significantly outperform state-of-the-art methods.

show abstract

Electron transport layer-free planar perovskite solar cells: Further performance enhancement perspective from device simulation

Huang

Sun

Chang

et al. 2016

Solar Energy Materials and Solar Cells

202

View full text Add to dashboard Cite

PAQ: 65 Million Probably-Asked Questions and What You Can Do With Them

Lewis

Liu

et al. 2021

View full text Add to dashboard Cite

Open-domain Question Answering models that directly leverage question-answer (QA) pairs, such as closed-book QA (CBQA) models and QA-pair retrievers, show promise in terms of speed and memory compared with conventional models which retrieve and read from text corpora. QA-pair retrievers also offer interpretable answers, a high degree of control, and are trivial to update at test time with new knowledge. However, these models fall short of the accuracy of retrieve-and-read systems, as substantially less knowledge is covered by the available QA-pairs relative to text corpora like Wikipedia. To facilitate improved QA-pair models, we introduce Probably Asked Questions (PAQ), a very large resource of 65M automatically generated QA-pairs. We introduce a new QA-pair retriever, RePAQ, to complement PAQ. We find that PAQ preempts and caches test questions, enabling RePAQ to match the accuracy of recent retrieve-and-read models, whilst being significantly faster. Using PAQ, we train CBQA models which outperform comparable baselines by 5%, but trail RePAQ by over 15%, indicating the effectiveness of explicit retrieval. RePAQ can be configured for size (under 500MB) or speed (over 1K questions per second) while retaining high accuracy. Lastly, we demonstrate RePAQ’s strength at selective QA, abstaining from answering when it is likely to be incorrect. This enables RePAQ to “back-off” to a more expensive state-of-the-art model, leading to a combined system which is both more accurate and 2x faster than the state-of-the-art model alone.

show abstract

Organic–inorganic hybrid CH₃NH₃PbI₃ perovskite materials as channels in thin-film field-effect transistors

et al. 2016

RSC Adv.

View full text Add to dashboard Cite

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Yuxiang Wu

Language Models as Knowledge Bases?

End-to-End Adversarial Memory Network for Cross-domain Sentiment Classification

Electron transport layer-free planar perovskite solar cells: Further performance enhancement perspective from device simulation

PAQ: 65 Million Probably-Asked Questions and What You Can Do With Them

Organic–inorganic hybrid CH₃NH₃PbI₃ perovskite materials as channels in thin-film field-effect transistors

Contact Info

Product

Resources

About

Yuxiang Wu

Language Models as Knowledge Bases?

End-to-End Adversarial Memory Network for Cross-domain Sentiment Classification

Electron transport layer-free planar perovskite solar cells: Further performance enhancement perspective from device simulation

PAQ: 65 Million Probably-Asked Questions and What You Can Do With Them

Organic–inorganic hybrid CH3NH3PbI3 perovskite materials as channels in thin-film field-effect transistors

Contact Info

Product

Resources

About

Organic–inorganic hybrid CH₃NH₃PbI₃ perovskite materials as channels in thin-film field-effect transistors