This study set out to analyze questions about type 2 diabetes mellitus (T2DM) from patients and the public. The aim was to better understand people’s information needs by starting with what they do not know, discovered through their own questions, rather than starting with what we know about T2DM and subsequently finding ways to communicate that information to people affected by or at risk of the disease. One hundred and sixty-four questions were collected from 120 patients attending outpatient diabetes clinics and 300 questions from 100 members of the public through the Amazon Mechanical Turk crowdsourcing platform. Twenty-three general and diabetes-specific topics and five phases of disease progression were identified; these were used to manually categorize the questions. Analyses were performed to determine which topics, if any, were significant predictors of a question’s being asked by a patient or the public, and similarly for questions from a woman or a man. Further analysis identified the individual topics that were assigned significantly more often to the crowdsourced or clinic questions. These were Causes (CI: [-0.07, -0.03], p < .001), Risk Factors ([-0.08, -0.03], p < .001), Prevention ([-0.06, -0.02], p < .001), Diagnosis ([-0.05, -0.02], p < .001), and Distribution of a Disease in a Population ([-0.05,-0.01], p = .0016) for the crowdsourced questions and Treatment ([0.03, 0.01], p = .0019), Disease Complications ([0.02, 0.07], p < .001), and Psychosocial ([0.05, 0.1], p < .001) for the clinic questions. No highly significant gender-specific topics emerged in our study, but questions about Weight were more likely to come from women and Psychosocial questions from men. There were significantly more crowdsourced questions about the time Prior to any Diagnosis ([(-0.11, -0.04], p = .0013) and significantly more clinic questions about Health Maintenance and Prevention after diagnosis ([0.07. 0.17], p < .001). A descriptive analysis pointed to the value provided by the specificity of questions, their potential to disclose emotions behind questions, and the as-yet unrecognized information needs they can reveal. Large-scale collection of questions from patients across the spectrum of T2DM progression and from the public–a significant percentage of whom are likely to be as yet undiagnosed–is expected to yield further valuable insights.
The annotation files for S.cerevisiae are available from ftp://genome-ftp.stanford.edu/pub/yeast/data_download/literature_curation/gene_association.sgd.gz. The draft protocol vocabulary is available by request from the first author.
This paper presents a new method of analysis by which structural similarities between brain data and linguistic data can be assessed at the semantic level. It shows how to measure the strength of these structural similarities and so determine the relatively better fit of the brain data with one semantic model over another. The first model is derived from WordNet, a lexical database of English compiled by language experts. The second is given by the corpus-based statistical technique of latent semantic analysis (LSA), which detects relations between words that are latent or hidden in text. The brain data are drawn from experiments in which statements about the geography of Europe were presented auditorily to participants who were asked to determine their truth or falsity while electroencephalographic (EEG) recordings were made. The theoretical framework for the analysis of the brain and semantic data derives from axiomatizations of theories such as the theory of differences in utility preference. Using brain-data samples from individual trials time-locked to the presentation of each word, ordinal relations of similarity differences are computed for the brain data and for the linguistic data. In each case those relations that are invariant with respect to the brain and linguistic data, and are correlated with sufficient statistical strength, amount to structural similarities between the brain and linguistic data. Results show that many more statistically significant structural similarities can be found between the brain data and the WordNet-derived data than the LSA-derived data. The work reported here is placed within the context of other recent studies of semantics and the brain. The main contribution of this paper is the new method it presents for the study of semantics and the brain and the focus it permits on networks of relations detected in brain data and represented by a semantic model.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.