2020 IEEE International Conference on Data Mining (ICDM) 2020
DOI: 10.1109/icdm50108.2020.00146
|View full text |Cite
|
Sign up to set email alerts
|

Bottom-Up and Top-Down: Predicting Personality with Psycholinguistic and Language Model Features

Abstract: State-of-the-art personality prediction with text data mostly relies on bottom up, automated feature generation as part of the deep learning process. More traditional models rely on hand-crafted, theory-based text-feature categories. We propose a novel deep learning-based model which integrates traditional psycholinguistic features with language model embeddings to predict personality from the Essays dataset for Big-Five and Kaggle dataset for MBTI. With this approach we achieve stateof-the-art model performan… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
39
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
5
3

Relationship

1
7

Authors

Journals

citations
Cited by 76 publications
(51 citation statements)
references
References 40 publications
1
39
0
Order By: Relevance
“…The dataset only contained two columns, one is the user MBTI type, and another is their respective posts on Personality Cafe forum. Five researches on the same dataset were identified, namely Cui & Qi (2017), Bharadwaj et al (2018), Li et al (2018), Mehta et al (2020) and Amirhosseini & Kazemian (2020). The justifications on the reasons of choosing this dataset are as follows:…”
Section: Datasetmentioning
confidence: 99%
See 2 more Smart Citations
“…The dataset only contained two columns, one is the user MBTI type, and another is their respective posts on Personality Cafe forum. Five researches on the same dataset were identified, namely Cui & Qi (2017), Bharadwaj et al (2018), Li et al (2018), Mehta et al (2020) and Amirhosseini & Kazemian (2020). The justifications on the reasons of choosing this dataset are as follows:…”
Section: Datasetmentioning
confidence: 99%
“…However, that cannot be reproduced with Kaggle-Filtered dataset with MBTI keyword removed. Four researches, Cui & Qi (2017), Bharadwaj et al (2018), Li et al (2018) and Mehta et al (2020) did not mention explicitly about removal of MBTI keyword in their preprocessing step. Only Amirhosseini & Kazemian (2020) mentioned removal of MBTI keywords in their preprocessing steps.…”
Section: Benchmarking With Previous Researchesmentioning
confidence: 99%
See 1 more Smart Citation
“…In particular, this is justified by the fact that trendy modern pedagogical concepts such as 'self-directed', 'inquiry-based', and 'student agency' are all integrated within bottom-up education as an umbrella term. Mehta et al (2020) consider the bottom-up approach to be the basis of the strategy for personal development and a part of the deep learning process.…”
Section: General Insights Of the Problem Articulationmentioning
confidence: 99%
“…Third, cloud computing is subject to a weak and still under-development regulation. Accordingly, privacy concerns arise especially when one recalls that emotion recognition and sentiment analysis allow for user profiling [ 6 , 7 ].…”
Section: Introductionmentioning
confidence: 99%