2018
DOI: 10.2196/10779
|View full text |Cite
|
Sign up to set email alerts
|

Improving Consumer Understanding of Medical Text: Development and Validation of a New SubSimplify Algorithm to Automatically Generate Term Explanations in English and Spanish

Abstract: BackgroundWhile health literacy is important for people to maintain good health and manage diseases, medical educational texts are often written beyond the reading level of the average individual. To mitigate this disconnect, text simplification research provides methods to increase readability and, therefore, comprehension. One method of text simplification is to isolate particularly difficult terms within a document and replace them with easier synonyms (lexical simplification) or an explanation in plain lan… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
17
0

Year Published

2019
2019
2023
2023

Publication Types

Select...
4
2
2

Relationship

0
8

Authors

Journals

citations
Cited by 22 publications
(18 citation statements)
references
References 16 publications
0
17
0
Order By: Relevance
“…Notice that in (Alfano et al, 2019c) we have evaluated the language familiarity of Web pages targeted to different audience types. This has been done by computing the "term familiarity index" of a word (i.e., number of results provided by the Google search engine, Kloehn, et al, 2018;Leroy, et al, 2012) and then computing the language familiarity of a Web page as the average of the term familiarity indexes of its words. The results clearly show that, on average, the Web pages targeted to patients have a much higher language familiarity, and thus a simpler terminology than the Web pages targeted to clinicians or medical researchers.…”
Section: Experimental Usementioning
confidence: 99%
“…Notice that in (Alfano et al, 2019c) we have evaluated the language familiarity of Web pages targeted to different audience types. This has been done by computing the "term familiarity index" of a word (i.e., number of results provided by the Google search engine, Kloehn, et al, 2018;Leroy, et al, 2012) and then computing the language familiarity of a Web page as the average of the term familiarity indexes of its words. The results clearly show that, on average, the Web pages targeted to patients have a much higher language familiarity, and thus a simpler terminology than the Web pages targeted to clinicians or medical researchers.…”
Section: Experimental Usementioning
confidence: 99%
“…Yet, medical educational texts are often written beyond the reading level of the average individual. One work proposes to isolate particularly difficult terms within documents and to replace them with easier synonyms or explanations in plain language 11 . The main accent is put on the automatic generation of explanations for difficult terms in English and Spanish.…”
Section: Resultsmentioning
confidence: 99%
“…Among the pre-selected publications, we can for instance mention: Chinese, addressed through the manually annotated dataset containing 540 breast radiology reports 5 ; Italian, addressed through 5,432 non-annotated medical reports belonging to patients with rare arrhythmias and the manually curated hospital database 6 ; French, addressed for detecting medical events as an epidemiological purpose 7 , or through the clinical data warehouse for thousands of patients 8 ; Japanese, addressed through the data set of 5,000 patient pharmacovigilance complaints 9 ; and Korean, addressed through 30 rheumatic patients discharge summaries annotated with temporal information 10 . Besides, non-clinical data have been used in Spanish and English for the building of vocabulary useful for patients to better understand the medical and health information in these languages 11 . In almost all these works, the corpora built and annotated are proprietary and can only be used by the teams that built them in collaboration with their clinical colleagues.…”
Section: Introductionmentioning
confidence: 99%
“…We have taken the three subsets presented in the previous section, related to Patient, Clinician, and MedicalResearcher audience types, and, for each quadruple, we have analysed the related Web page in order to estimate its language complexity. To this end, we have evaluated the 'term familiarity index', as described in [6], [24], [25] of the English and non-empty Web pages (around 50% of the total). In particular, for each Web page, we have computed the term familiarity of each word by using the number of results provided by the Google search engine and we have then computed the page familiarity index by averaging all the term familiarity indexes.…”
Section: Mapping Language Complexity User Requirements To Audience Typesmentioning
confidence: 99%