Regulated Delivery of Glial Cell Line-Derived Neurotrophic Factor into Rat Striatum, Using a Tetracycline-Dependent Lentiviral Vector

This paper considers the problem of zero-shot entity linking, in which a link in the test time may not present in training. Following the prevailing BERT-based research efforts, we find a simple yet effective way is to expand the long-range sequence modeling. Unlike many previous methods, our method does not require expensive pre-training of BERT with long position embeddings. Instead, we propose an efficient position embeddings initialization method called Embedding-repeat, which initializes larger position embeddings based on BERT-Base. On Wikia's zero-shot EL dataset, our method improves the SOTA from 76.06% to 79.08%, and for its long data, the corresponding improvement is from 74.57% to 82.14%. Our experiments suggest the effectiveness of long-range sequence modeling without retraining the BERT model. 1

show abstract

Improving Formality Style Transfer with Context-Aware Rule Injection

Yao¹,

Yu²

2021

View full text Add to dashboard Cite

Models pre-trained on large-scale regular text corpora often do not work well for usergenerated data where the language styles differ significantly from the mainstream text.Here we present Context-Aware Rule Injection (CARI), an innovative method for formality style transfer (FST). CARI injects multiple rules into an end-to-end BERT-based encoder and decoder model. It learns to select optimal rules based on context. The intrinsic evaluation showed that CARI achieved the new highest performance on the FST benchmark dataset. Our extrinsic evaluation showed that CARI can greatly improve the regular pretrained models' performance on several tweet sentiment analysis tasks.

show abstract

The impact of preprint servers in the formation of novel ideas

Satish¹,

Yao²,

Drozdov

et al. 2020

Preprint

View full text Add to dashboard Cite

We study whether novel ideas in biomedical literature appear first in preprints or traditional journals. We develop a Bayesian method to estimate the time of appearance for a phrase in the literature, and apply it to a number of phrases, both automatically extracted and suggested by experts. We see that presently most phrases appear first in the traditional journals, but there is a number of phrases with the first appearance on preprint servers. A comparison of the general composition of texts from bioRxiv and traditional journals shows a growing trend of bioRxiv being predictive of traditional journals. We discuss the application of the method for related problems.

show abstract

Named Entity Location Prediction Combining Twitter and Web

Liu

Shen

Yao

et al. 2021

IEEE Trans. Knowl. Data Eng.

View full text Add to dashboard Cite

Automated Identification of Eviction Status from Electronic Health Record Notes

Yao¹,

Tsai²,

Liu³

et al. 2022

Preprint

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Zonghai Yao

Zero-shot Entity Linking with Efficient Long Range Sequence Modeling

Improving Formality Style Transfer with Context-Aware Rule Injection

The impact of preprint servers in the formation of novel ideas

Named Entity Location Prediction Combining Twitter and Web

Automated Identification of Eviction Status from Electronic Health Record Notes

Contact Info

Product

Resources

About