Bridging Information-Seeking Human Gaze and Machine Reading Comprehension

Malmaud, Jonathan; Lévy, Roger; Berzak, Yevgeni

doi:10.18653/v1/2020.conll-1.11

Cited by 24 publications

(20 citation statements)

References 27 publications

(24 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…In terms of limitations, we did not investigate the breadth or depth of influence of our method of [CLS]-based aggregate attention supervision on the model attentions across layers and heads, nor the supervision of specific layers or heads as done by Strubell et al (2018). We did not explore trade-off coefficients on the multiple losses, such as the convex combination used by Malmaud et al (2020). We used a relatively small English dataset, which limited generalizability and robustness.…”

Section: Discussionmentioning

confidence: 99%

“…Because BERT uses subword tokenization, to allow matching entries to be found in the ZuCo wordlevel data we split the ZuCo words into BERT tokens, evenly dividing values between each subword piece (e.g., "delicacy" → "del", "##ica", "##cy", each piece allotted a third of the ZuCo value), a technique used by Malmaud et al (2020). We preserve entity markers "<e>" and "</e>" in each sample by adding them as special tokens to the BERT tokenizer so their embeddings are learned with other tokens during fine-tuning.…”

Section: Methodsmentioning

confidence: 99%

“…For sentiment analysis, use MTL for a bidirectional Long Short-Term Memory (biLSTM) network, learning gaze behavior as the auxiliary task. Malmaud et al (2020) predict ET data with a variant of BERT as an auxiliary to question answering. Bautista and Naval (2020) predict gaze features with an LSTM to evaluate on sentiment classification and NER tasks.…”

Section: Introductionmentioning

confidence: 99%

See 2 more Smart Citations

Proceedings of the Workshop on Cognitive Modeling and Computational Linguistics

2021

View full text Add to dashboard Cite

Word concreteness and imageability have proven crucial in understanding how humans process and represent language in the brain. While word-embeddings do not explicitly incorporate the concreteness of words into their computations, they have been shown to accurately predict human judgments of concreteness and imageability. Inspired by the recent interest in using neural activity patterns to analyze distributed meaning representations, we first show that brain responses acquired while human subjects passively comprehend natural stories can significantly distinguish the concreteness levels of the words encountered. We then examine for the same task whether the additional perceptual information in the brain representations can complement the contextual information in the word-embeddings. However, the results of our predictive models and residual analyses indicate the contrary. We find that the relevant information in the brain representations is a subset of the relevant information in the contextualized wordembeddings, providing new insight into the existing state of natural language processing models.

show abstract

Section: Discussionmentioning

confidence: 99%

Section: Methodsmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Proceedings of the Workshop on Cognitive Modeling and Computational Linguistics

2021

View full text Add to dashboard Cite

show abstract

“…Obtaining human scores for sequence tokens requires preprocessing. As ZuCo uses a word-level lexicon but BERT uses subword tokenization, in order to find matching entries we first split the ZuCo words into BERT tokens, evenly dividing values between each subword piece (e.g., when tokenizing "delicacy" → ["del", "##ica", "##cy"] in sentence j, each piece is allotted a third of the ZuCo value), a technique previously used by [26]. We pass the human ET and EEG token values z ET and z EEG through a softmax layer to obtain two distributions over sentences, vectors α ′′ ET and α ′ EEG .…”

Section: Methodsmentioning

confidence: 99%

“…Cognitive NLP tasks that have been studied in recent years include sentiment analysis [23], part-of-speech (POS) tagging [24], and named entity recognition (NER) [25]. Typically for neural network-based approaches, a recurrent architecture such as a bidirectional Long Short-Term Memory (biLSTM) network has been used; more recently a variant of BERT was used for question answering with ET prediction as the auxiliary task [26]. MTL has been used for sentiment analysis and NER with learning gaze behavior as the auxiliary task [27,28], and a combination of gaze and brain data has been applied to a suite of NLP tasks [11], including sentiment analysis, using approaches such as predicting cognitive data or using those data to augment input embeddings.…”

Section: Related Workmentioning

confidence: 99%

Sentiment Analysis with Cognitive Attention Supervision

McGuire

Tomuro

2021

Proceedings of the Canadian Conference on Artificial Intelligence

View full text Add to dashboard Cite

Neural network-based language models such as BERT (Bidirectional Encoder Representations from Transformers) use attention mechanisms to create contextualized representations of inputs, conceptually analogous to humans reading words in context. For the task of classifying the sentiment of texts, we ask whether BERT's attention can be informed by human cognitive data. During training, we supervise attention with eye-tracking and/or brain imaging data and combine binary sentiment classification loss with these attention losses. We find that attention supervision can be used to manipulate BERT attention to be more similar to the ground truth human data, but that there are no significant differences in sentiment classification accuracy. However, models with cognitive attention supervision more frequently misclassify different samples from the baseline models-they more often make different errors-and the errors from models with supervised attention have a higher ratio of false negatives.

show abstract

Evidence Augment for Multiple-Choice Machine Reading Comprehension by Weak Supervision

Luo

Zhang

et al. 2021

Lecture Notes in Computer Science

View full text Add to dashboard Cite

Bridging Information-Seeking Human Gaze and Machine Reading Comprehension

Cited by 24 publications

References 27 publications

Proceedings of the Workshop on Cognitive Modeling and Computational Linguistics

Proceedings of the Workshop on Cognitive Modeling and Computational Linguistics

Sentiment Analysis with Cognitive Attention Supervision

Evidence Augment for Multiple-Choice Machine Reading Comprehension by Weak Supervision

Contact Info

Product

Resources

About