This study uses word embeddings to investigate the semantic changes underlying the creation of two adversative connectives in Portuguese, porém and mas ‘but, however’. For porém, we chart its development from an original PP formed by a preposition with a causal meaning (por) and a demonstrative pronoun that referred anaphorically to a previous proposition (en(de)). For mas, we trace its change from an adverb meaning ‘more’. Adopting a distributional semantics approach, we use word embedding models trained on two corpora, the CIPM (Corpus Informatizado do Português Medieval, containing texts from the 12th–16th centuries) and COLONIA (containing texts from the 16th–20th centuries). We produce a measure of change based on the similarity scores of porém and mas with respect to words in relevant semantic categories in each corpus, representing the source and the target meanings. This paper, which constitutes the first computational study of semantic change in Portuguese, also discusses challenges and outlines steps to be taken into consideration when choosing embedding algorithms for small historical corpora.
High quality distributional models can capture lexical and semantic relations between words. Hence, researchers design various intrinsic tasks to test whether such relations are captured. However, most of the intrinsic tasks are designed for modern languages, and there is a lack of evaluation methods for distributional models of historical corpora. In this paper, we conducted BAHP: a benchmark of assessing word embeddings in Historical Portuguese, which contains four types of tests: analogy, similarity, outlier detection, and coherence. We examined word2vec models generated from two historical Portuguese corpora in these four test sets. The results demonstrate that our test sets are capable of measuring the quality of vector space models and can provide a holistic view of the model's ability to capture syntactic and semantic information. Furthermore, the methodology for the creation of our test sets can be easily extended to other historical languages.
We analyze two constructions expressing the negation of a causal relation in contemporary European Portuguese, nem por isso (lit. ‘not for this [reason]’) and não porque…mas porque (‘not because…but rather because’). We propose that the PA/SN distinction, established for adversative connectives (e.g., Spanish pero and German aber as PA, Spanish sino and German sondern as SN), can be extended to the domain of causality and provides insight on the properties of these two causal constructions. We show that their sensitivity to different types of conversational implicature as well as their interaction with negation are reminiscent of the distinction between restrictive and exclusive adversatives. Our investigation, which is the first semantic-pragmatic analysis of negated causality in European Portuguese, allows us to formulate historical predictions concerning the relation between types of negated causality and the possible development of adversative markers. Our findings provide synchronic evidence for a diachronic semantic path from negated causality to adversative meaning that has been posited for several Romance languages (e.g., Italian però, Spanish pero). Specifically, using contemporary corpus data, we show that the properties of nem por isso provide insight into the bridging contexts for semantic change from causal to adversative meanings. In addition, we identify a type of negated causality that fits within strategies of corrective contrast not exclusive to the domain of causal relations.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.