2016
DOI: 10.1371/journal.pone.0144717
|View full text |Cite
|
Sign up to set email alerts
|

Text Mining the History of Medicine

Abstract: Historical text archives constitute a rich and diverse source of information, which is becoming increasingly readily accessible, due to large-scale digitisation efforts. However, it can be difficult for researchers to explore and search such large volumes of data in an efficient manner. Text mining (TM) methods can help, through their ability to recognise various types of semantic information automatically, e.g., instances of concepts (places, medical conditions, drugs, etc.), synonyms/variant forms of concept… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

3
47
0
1

Year Published

2016
2016
2022
2022

Publication Types

Select...
3
3
2

Relationship

1
7

Authors

Journals

citations
Cited by 54 publications
(51 citation statements)
references
References 51 publications
3
47
0
1
Order By: Relevance
“…Some PDF files without texts are scans of the original article (point 1). We did not attempt to make an optical character recognition conversion (OCR) as the old typesetting fonts often are less compatible with present day OCR programs, and this can lead to text recognition errors [ 28 , 29 ]. For any discarded document, we still used the meta-data to calculate summary statistics.…”
Section: Methodsmentioning
confidence: 99%
“…Some PDF files without texts are scans of the original article (point 1). We did not attempt to make an optical character recognition conversion (OCR) as the old typesetting fonts often are less compatible with present day OCR programs, and this can lead to text recognition errors [ 28 , 29 ]. For any discarded document, we still used the meta-data to calculate summary statistics.…”
Section: Methodsmentioning
confidence: 99%
“…All data must be digital before it can be processed; but not all data that requires processing is in a usable digital format. While it is rare for data scientists to interact with non-digital data, many clinicians [1], historians [2], educators and field researchers [3] still regularly capture or must work with historical archives of paper-based spreadsheet data. For small datasets, manual transcription of these records is feasible, but as data requirements grow, researchers and professionals are required to invest considerable resources to transcribe their paper-based data into digital form [4], [5].…”
Section: Introductionmentioning
confidence: 99%
“…Data in the educational and healthcare domains, for instance, often contain sensitive personal information requiring specialized authorization to share with third parties. These constraints make transcription arduous and costly [1].…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…A recent study by Thompson et al (115) may serve as an example of combining all of the elements of text mining within a single project. The goal was to analyse medical vocabulary from a historical perspective, observing how certain terms and concepts appear, transform and wither across the years.…”
Section: Discussionmentioning
confidence: 99%