Multilevel Text Alignment with Cross-Document Attention

Zhou, Xuhui; Pappas, Nikolaos; Smith, Noah A.

doi:10.18653/v1/2020.emnlp-main.407

Cited by 13 publications

(51 citation statements)

References 34 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Despite these factors, BERT-HAN's large performance drop on PAN is still surprising. However, we emphasize that even when using Zhou et al (2020)'s original numbers, BERT-HAN still lags behind both our lexical overlap baselines and fine-tuned BERT models, so our overall takeaways from §7 still stand. For the S2D task, our results are not directly comparable to the original numbers of Zhou et al (2020) for two reasons: positively-labeled target sentences.…”

Section: B Implementation Of Bert-han and Gru-hanmentioning

confidence: 90%

“…However, we emphasize that even when using Zhou et al (2020)'s original numbers, BERT-HAN still lags behind both our lexical overlap baselines and fine-tuned BERT models, so our overall takeaways from §7 still stand. For the S2D task, our results are not directly comparable to the original numbers of Zhou et al (2020) for two reasons: positively-labeled target sentences. When there are fewer than k positively-labeled target sentences in an example, a perfect system will still have a P@k < 1.…”

Section: B Implementation Of Bert-han and Gru-hanmentioning

confidence: 90%

“…However, as discussed in §1, we conduct a much larger set of experiments beyond those of Zhou et al (2020). In addition to the hierarchical neural models with document-level supervision proposed by Zhou et al (2020), we evaluate four sets of models: lexical overlap models, SoTA neural models trained for general paraphrase detection, hierarchical neural models with sentence-level supervision, and finetuned sequence-pair BERT models. Further, in Also related to our work is research studying sentence-pair problems, e.g.…”

Section: Related Workmentioning

confidence: 99%

“…GRU-HAN (deep) (Zhou et al, 2020): this model mirrors BERT-HAN, except with GloVe (Pennington et al, 2014) embeddings and a HAN with CDA at both the word and sentence level. It follows the same training and testing regime.…”

Section: Hierarchical Neural Models (Hnm)mentioning

confidence: 99%

“…While many state-of-the-art (SoTA) NLP architectures have been trained on the closely-related tasks of document-and sentence-pair similarity detection (Reimers and Gurevych, 2019) and ad-hoc retrieval (Dai and Callan, 2019), prior methods for local text-reuse detection (LTRD) are mostly limited to lexical matching (Lee, 2007;Clough et al, 2002;Leskovec et al, 2009;Wilkerson et al, 2015;Smith et al, 2014) with some dictionary expansion (Moritz et al, 2016). To our knowledge, only Zhou et al (2020) has applied neural models to this problem, proposing hierarchical neural models that use a cross-document attention mechanism to model local similarities between two candidate documents.…”

Section: Introductionmentioning

confidence: 99%

See 4 more Smart Citations

Recovering Lexically and Semantically Reused Texts

MacLaughlin¹,

Xu²,

Smith³

2021

Proceedings of *SEM 2021: The Tenth Joint Conference on Lexical and Computational Semantics

View full text Add to dashboard Cite

Writers often repurpose material from existing texts when composing new documents. Because most documents have more than one source, we cannot trace these connections using only models of document-level similarity. Instead, this paper considers methods for local text reuse detection (LTRD), detecting localized regions of lexically or semantically similar text embedded in otherwise unrelated material. In extensive experiments, we study the relative performance of four classes of neural and bag-of-words models on three LTRD tasks -detecting plagiarism, modeling journalists' use of press releases, and identifying scientists' citation of earlier papers. We conduct evaluations on three existing datasets and a new, publicly-available citation localization dataset. Our findings shed light on a number of previously-unexplored questions in the study of LTRD, including the importance of incorporating document-level context for predictions, the applicability of of-the-shelf neural models pretrained on "general" semantic textual similarity tasks such as paraphrase detection, and the trade-offs between more efficient bag-of-words and feature-based neural models and slower pairwise neural models.

show abstract

Section: B Implementation Of Bert-han and Gru-hanmentioning

confidence: 90%

Section: B Implementation Of Bert-han and Gru-hanmentioning

confidence: 90%