Clause-Level Tense, Mood, Voice and Modality Tagging for German

Dönicke, Tillmann

doi:10.18653/v1/2020.tlt-1.1

Cited by 3 publications

(5 citation statements)

References 19 publications

(27 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Furthermore, a lot of the datasets in the shared task incorporate automatically created dependency trees (created by models trained on UD treebanks), which may lead to follow-up errors in the clause-splitting step. Dönicke (2020) reports an F1 of 81% for predicting clauses in a German text after preprocessing it with a spaCy model trained on the German UD treebanks. Even though this number only gives a rough estimate on how well our system identifies clauses, there is clearly room for improvement.…”

Section: Resultsmentioning

confidence: 99%

“…5 Algorithm 1 shows an updated version of the original algorithm that has been modified to work with a broader range of languages, specifically the languages in the shared task. In the following, the algorithm is briefly described, with a focus on the adaptions made for multiple languages (numbers in parentheses refer to lines in the pseudocode); for further explanations see Dönicke (2020).…”

Section: Feature Vectorsmentioning

confidence: 99%

“…35-37). In Dönicke (2020), the lemmas of the verbs are used but in our multilingual implementation, we map the lemmas to three categories of modal verbs (cf. Biber et al, 2002, p. 176): permission/possibility/ability (POS), obligation/necessity (OBL), and volition/prediction (VOL).…”

Section: Feature Vectorsmentioning

confidence: 99%

See 2 more Smart Citations

Delexicalised Multilingual Discourse Segmentation for DISRPT 2021 and Tense, Mood, Voice and Modality Tagging for 11 Languages

Dönicke¹

2021

Proceedings of the 2nd Shared Task on Discourse Relation Parsing and Treebanking (DISRPT 2021)

Self Cite

View full text Add to dashboard Cite

This paper describes our participating system for the Shared Task on Discourse Segmentation and Connective Identification across Formalisms and Languages. Key features of the presented approach are the formulation as a clause-level classification task, a languageindependent feature inventory based on Universal Dependencies grammar, and compositeverb-form analysis. The achieved F1 is 92% for German and English and lower for other languages. The paper also presents a clauselevel tagger for grammatical tense, aspect, mood, voice and modality in 11 languages.

show abstract

Section: Resultsmentioning

confidence: 99%

Section: Feature Vectorsmentioning

confidence: 99%

See 1 more Smart Citation

Delexicalised Multilingual Discourse Segmentation for DISRPT 2021 and Tense, Mood, Voice and Modality Tagging for 11 Languages

Dönicke¹

2021

Proceedings of the 2nd Shared Task on Discourse Relation Parsing and Treebanking (DISRPT 2021)

Self Cite

View full text Add to dashboard Cite

show abstract

“…The Python package spaCy offers a solution, but reports question its reliability for German text [23]. In fact, determining the tense in German text is almost a research project in itself [22]. Given the limited scope and resources of this research project, tense as an explanatory aspect is not further considered.…”

Section: ) Psycholinguistic Word Propertiesmentioning

confidence: 99%

Exploring owner-based brand personality on Facebook and LinkedIn with Natural-Language-Processing (NLP)

Griesser

2022

2022 9th Swiss Conference on Data Science (SDS)

View full text Add to dashboard Cite

The current study on brand personality based on natural language focuses on large companies with a corpus of English terms. Other Natural-Language-Processing (NLP) techniques and psycholinguistic word properties are not considered. This study uncovers additional factors that explain owner-based brand personality of micro-companies by using Part-of-Speech (PoS) tagging and psycholinguistic corpora. The analysis of German-language posts of three micro-companies on LinkedIn and Facebook shows that the word type "verb" and language sentiment, emotional intensity, and concreteness are stronger explanatory factors compared to an owner-based brand personality corpus. Researchers and professionals should thus not limit their brand personality analysis to keyword semantics, but also consider word types and psycholinguistic word properties. Further research is needed to improve the explanatory power of the brand personality corpus.

show abstract

“…Clause segmentation is performed with the clausizer presented inDönicke (2020). The manually created clause-level annotations are then automatically mapped to the detected clauses 4.…”

mentioning

confidence: 99%

Modelling Speaker Attribution in Narrative Texts With Biased and Bias-Adjustable Neural Networks

Dönicke

Varachkina

Weimer

et al. 2022

Front. Artif. Intell.

Self Cite

View full text Add to dashboard Cite

Literary narratives regularly contain passages that different readers attribute to different speakers: a character, the narrator, or the author. Since literary narratives are highly ambiguous constructs, it is often impossible to decide between diverging attributions of a specific passage by hermeneutic means. Instead, we hypothesise that attribution decisions are often influenced by annotator bias, in particular an annotator's literary preferences and beliefs. We present first results on the correlation between the literary attitudes of an annotator and their attribution choices. In a second set of experiments, we present a neural classifier that is capable of imitating individual annotators as well as a common-sense annotator, and reaches accuracies of up to 88% (which improves the majority baseline by 23%).

show abstract

Clause-Level Tense, Mood, Voice and Modality Tagging for German

Cited by 3 publications

References 19 publications

Delexicalised Multilingual Discourse Segmentation for DISRPT 2021 and Tense, Mood, Voice and Modality Tagging for 11 Languages

Delexicalised Multilingual Discourse Segmentation for DISRPT 2021 and Tense, Mood, Voice and Modality Tagging for 11 Languages

Exploring owner-based brand personality on Facebook and LinkedIn with Natural-Language-Processing (NLP)

Modelling Speaker Attribution in Narrative Texts With Biased and Bias-Adjustable Neural Networks

Contact Info

Product

Resources

About