2023
DOI: 10.2196/39077
|View full text |Cite
|
Sign up to set email alerts
|

German Medical Named Entity Recognition Model and Data Set Creation Using Machine Translation and Word Alignment: Algorithm Development and Validation

Abstract: Background Data mining in the field of medical data analysis often needs to rely solely on the processing of unstructured data to retrieve relevant data. For German natural language processing, few open medical neural named entity recognition (NER) models have been published before this work. A major issue can be attributed to the lack of German training data. Objective We developed a synthetic data set and a novel German medical NER model for public ac… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
0
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
2
1
1

Relationship

0
4

Authors

Journals

citations
Cited by 4 publications
(1 citation statement)
references
References 42 publications
0
0
0
Order By: Relevance
“…Most models are not easily transferable and reusable in non-English medical NLP models. Instead, retraining with new data in the target language is typically necessary [20]. Accurately and meaningfully extracting information from raw ultrasound reports poses a significant challenge due to the complex and unstructured nature of the text.…”
Section: Introductionmentioning
confidence: 99%
“…Most models are not easily transferable and reusable in non-English medical NLP models. Instead, retraining with new data in the target language is typically necessary [20]. Accurately and meaningfully extracting information from raw ultrasound reports poses a significant challenge due to the complex and unstructured nature of the text.…”
Section: Introductionmentioning
confidence: 99%