HeBERT and HebEMO: A Hebrew BERT Model and a Tool for Polarity Analysis and Emotion Recognition

Chriqui, Avihay; Yahav, Inbal

doi:10.1287/ijds.2022.0016

Cited by 18 publications

(4 citation statements)

References 38 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Compared to fully fine-tuned models, adapter models only incorporate a few task-specific parameters for each new task. The BERT-based experiments were conducted on three pretrained language models, each capable of handling Hebrew text, that is, (a) XLM-RoBERTa (Conneau et al, 2019), a multilingual language model based on the RoBERTa architecture (Liu et al, 2019), (b) HeBERT (Chriqui & Yahav, 2021), a monolingual BERT model trained on Hebrew data, and (c) AlephBERT (Seker et al, 2022), another monolingual BERT-based model trained on a large Hebrew vocabulary of 52K tokens optimized via masked-token prediction. Corresponding variants of these BERT models with lightweight adapter solutions focused on a small number of task-specific parameters for training using bottleneck adapters (Houlsby et al, 2019) and mix-and-match (MAM) adapters (He et al, 2021).…”

Section: Methodsmentioning

confidence: 99%

Leveraging natural language processing to study emotional coherence in psychotherapy.

Atzil-Slonim,

Eliassaf,

Warikoo

et al. 2024

Psychotherapy

View full text Add to dashboard Cite

The association between emotional experience and expression, known as emotional coherence, is considered important for individual functioning. Recent advances in natural language processing (NLP) make it possible to automatically recognize verbally expressed emotions in psychotherapy dialogues and to explore emotional coherence with larger samples and finer granularity than previously. The present study used state-of-the-art emotion recognition models to automatically label clients’ emotions at the utterance level, employed these labeled data to examine the coherence between verbally expressed emotions and self-reported emotions, and examined the associations between emotional coherence and clients’ improvement in functioning throughout treatment. The data comprised 872 transcribed sessions from 68 clients. Clients self-reported their functioning before each session and their emotions after each. A subsample of 196 sessions were manually coded. A transformer-based approach was used to automatically label the remaining data for a total of 139,061 utterances. Multilevel modeling was used to assess emotional coherence and determine whether it was associated with changes in clients’ functioning throughout treatment. The emotion recognition model demonstrated moderate performance. The findings indicated a significant association between verbally expressed emotions and self-reported emotions. Coherence in clients’ negative emotions was associated with improvement in functioning. The results suggest an association between clients’ subjective experience and their verbal expression of emotions and underscore the importance of this coherence to functioning. NLP may uncover crucial emotional processes in psychotherapy.

show abstract

Section: Methodsmentioning

confidence: 99%

Leveraging natural language processing to study emotional coherence in psychotherapy.

Atzil-Slonim,

Eliassaf,

Warikoo

et al. 2024

Psychotherapy

View full text Add to dashboard Cite

show abstract

“…In general, NLP models and solutions for lowresource languages are extremely limited. In Hebrew, two pre-trained language models were published, HeBERT (Chriqui and Yahav, 2021) and AlephBERT (Seker et al, 2022). We used Aleph-BERT which is freely available and was trained on a larger dataset than HeBERT and was able to outperform HeBERT on a variety of natural language tasks.…”

Section: Related Workmentioning

confidence: 99%

Combining Psychological Theory with Language Models for Suicide Risk Detection

Izmaylov,

Segal,

Gal

et al. 2023

Findings of the Association for Computational Linguistics: EACL 2023

View full text Add to dashboard Cite

Recent years saw a dramatic increase in the popularity of online counseling services providing emergency mental health support. This paper provides a new language model for automatic detection of suicide risk in online chat sessions between help-seekers and counselors. The model adapts a hierarchical BERT language model for this task. It extends the state of the art in capturing aspects of the conversation structure in the counseling session and in integrating psychological theory into the model. We test the performance of our approach in a leading national online counseling service that operates in the Hebrew language. Our model outperformed other non-hierarchical approaches from the literature, achieving a 0.76 F2 score and 0.92 ROC-AUC. Moreover, we demonstrate our model's superiority over strong baselines even early on in the conversation, which is key for real-time detection in the field. This is a first step towards incorporating suicide predictive models in online support services and advancing NLP tools for resource-bounded languages.

show abstract

“…The second event is marked similarly, but replacing 1 with 2. We use four Hebrew language models: AlephBERT (Seker et al, 2022), HeBERT (Chriqui and Yahav, 2022), mBERT cased (bertbase-multilingual) (Devlin et al, 2019), and AlephBERTGimmel (Guetta et al, 2022), all having 110M parameters, obtained directly from Hugging Face's transformer library. Inspired by Soares et al ( 2019), we experiment with three sequence classification architectures:…”

Section: Trc Modelsmentioning

confidence: 99%

Temporal Relation Classification in Hebrew

Yanko,

Pariente,

Bar

2023

Findings of the Association for Computational Linguistics: IJCNLP-AACL 2023 (Findings)

View full text Add to dashboard Cite

Temporal Relation Classification (TRC) is a fundamental task in natural language processing (NLP) and is essential for achieving a comprehensive understanding of a natural language. Given a document containing two event mentions, the objective of this task is to discern which of the two events happened first. Existing TRC datasets predominantly consist of texts written in English. To accommodate the growing interest in relevant NLP applications for Hebrew, we introduce a new TRC dataset for Hebrew. Professional annotators labeled Hebrew documents with TRC labels, adhering to guidelines adapted from a similar project on English and with some changes required to address some unique aspects of the Hebrew language. Overall, we annotated a corpus of 28,757 words, corresponding to 7,260 pairs of events. In addition to releasing the new dataset, which can be accessed at https://github.com/ shahafp/TRC-Hebrew, we train several baseline models for TRC and report their performance.

show abstract

HeBERT and HebEMO: A Hebrew BERT Model and a Tool for Polarity Analysis and Emotion Recognition

Cited by 18 publications

References 38 publications

Leveraging natural language processing to study emotional coherence in psychotherapy.

Leveraging natural language processing to study emotional coherence in psychotherapy.

Combining Psychological Theory with Language Models for Suicide Risk Detection

Temporal Relation Classification in Hebrew

Contact Info

Product

Resources

About