T-staging pulmonary oncology from radiological reports using natural language processing: translating into a multi-language setting

Nobel, J. Martijn; Puts, Sander; Weiß, Jakob; Aerts, Hugo J.W.L.; Mak, Raymond H.; Robben, Simon G. F.; Dekker, André

doi:10.1186/s13244-021-01018-1

Cited by 12 publications

(11 citation statements)

References 19 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…The pre-trained language model (PLM) is driven by a large amount of corpus and can use these data to realize the semantic representation of knowledge contained in a large amount of text to realize downstream tasks. The downstream tasks include natural language processing tasks such as classification (Li et al, 2019b ; Maltoudoglou et al, 2022 ; Ni et al, 2020a , 2020b ), sequence labeling (Dai et al, 2019 ; Li et al, 2020b ), summarization (Chintagunta et al, 2021 ; Lacson et al, 2006 ; Yuan et al, 2021 ), translation (Névéol et al, 2018 ; Nobel et al, 2021 ; Wang et al, 2019 ), generation (Melamud & Shivade, 2019 ; Peng et al, 2019 ; Xiong et al, 2019 ), etc. As one of the new downstream tasks, the translation task, Zhu et al ( 2020 ) previously found that using the pre-trained language model as contextual embedding instead of direct fine-tuning will produce better results.…”

Section: Related Workmentioning

confidence: 99%

Knowledge Graph and Deep Learning-based Text-to-GraphQL Model for Intelligent Medical Consultation Chatbot

Okhrati

Guan

et al. 2022

Inf Syst Front

View full text Add to dashboard Cite

Text-to-GraphQL (Text2GraphQL) is a task that converts the user's questions into Graph + QL (Query Language) when a graph database is given. That is a task of semantic parsing that transforms natural language problems into logical expressions, which will bring more efficient direct communication between humans and machines. The existing related work mainly focuses on Text-to-SQL tasks, and there is no available semantic parsing method and data set for the graph database. In order to fill the gaps in this field to serve the medical Human–Robot Interactions (HRI) better, we propose this task and a pipeline solution for the Text2GraphQL task. This solution uses the Adapter pre-trained by “the linking of GraphQL schemas and the corresponding utterances” as an external knowledge introduction plug-in. By inserting the Adapter into the language model, the mapping between logical language and natural language can be introduced faster and more directly to better realize the end-to-end human–machine language translation task. In the study, the proposed Text2GraphQL task model is mainly constructed based on an improved pipeline composed of a Language Model, Pre-trained Adapter plug-in, and Pointer Network. This enables the model to copy objects' tokens from utterances, generate corresponding GraphQL statements for graph database retrieval, and builds an adjustment mechanism to improve the final output. And the experiments have proved that our proposed method has certain competitiveness on the counterpart datasets (Spider, ATIS, GeoQuery, and 39.net) converted from the Text2SQL task, and the proposed method is also practical in medical scenarios.

show abstract

Section: Related Workmentioning

confidence: 99%

Knowledge Graph and Deep Learning-based Text-to-GraphQL Model for Intelligent Medical Consultation Chatbot

Okhrati

Guan

et al. 2022

Inf Syst Front

View full text Add to dashboard Cite

show abstract

“…For both the training and the validation sets, the substage accuracy scores were calculated separately for the T-stage and the N-stage. T-substage is a subdivision of the T-stage to provide more detail, for example, stage T1 (≤3 cm) contains substage T1c (2 to ≤3 cm) [ 1 , 18 ]. Next to the T- and N-stage, the combined accuracy score (TN-stage) was scored for the training and validation sets.…”

Section: Methodsmentioning

confidence: 99%

“…NLP has also been used in a recent and ongoing transnational project to extract the stage in pulmonary oncology from free-text radiological chest CT scan reports [ 17 , 18 ]. The overall goal is to build a language-independent algorithm that can extract pulmonary oncology staging according to the TNM classification.…”

Section: Introductionmentioning

confidence: 99%

“…In prior work, a rule-based NLP algorithm was trained and validated on Dutch radiological reports before it was translated and validated on English reports, which showed an accuracy rate for T-stage ranging between 0.84 and 0.87 [ 18 ] The rule-based approach is thought to be the easiest way to accurately determine the oncological stage, as TNM is already a rule-based system. When, for instance, only machine learning (ML) strategies for staging were used, apart from the issue of correctly finding the specific concepts, the algorithm also needs to extract the set of rules of each concept from the training data, which requires a very large amount of data.…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

How Natural Language Processing Can Aid With Pulmonary Oncology Tumor Node Metastasis Staging From Free-Text Radiology Reports: Algorithm Development and Validation

Puts¹,

Nobel²,

Zegers³

et al. 2023

JMIR Form Res

Self Cite

View full text Add to dashboard Cite

Background Natural language processing (NLP) is thought to be a promising solution to extract and store concepts from free text in a structured manner for data mining purposes. This is also true for radiology reports, which still consist mostly of free text. Accurate and complete reports are very important for clinical decision support, for instance, in oncological staging. As such, NLP can be a tool to structure the content of the radiology report, thereby increasing the report’s value. Objective This study describes the implementation and validation of an N-stage classifier for pulmonary oncology. It is based on free-text radiological chest computed tomography reports according to the tumor, node, and metastasis (TNM) classification, which has been added to the already existing T-stage classifier to create a combined TN-stage classifier. Methods SpaCy, PyContextNLP, and regular expressions were used for proper information extraction, after additional rules were set to accurately extract N-stage. Results The overall TN-stage classifier accuracy scores were 0.84 and 0.85, respectively, for the training (N=95) and validation (N=97) sets. This is comparable to the outcomes of the T-stage classifier (0.87-0.92). Conclusions This study shows that NLP has potential in classifying pulmonary oncology from free-text radiological reports according to the TNM classification system as both the T- and N-stages can be extracted with high accuracy.

show abstract

“…We can observe that, in 2021, researchers mainly concentrated on studying English-language data. Indeed, compared to previous years, a fewer number of languages were covered: Chinese [3][4][5][6][7][8][9][10], Dutch [11], French [12,13], Italian [14][15][16], Japanese [17], Korean [18,19], Norwegian [20], and Spanish . Besides, except for Chinese, there were also very few works done for the languages represented in publications.…”

Section: Languages Addressedmentioning

confidence: 99%

Year 2021: COVID-19, Information Extraction and BERTization among the Hottest Topics in Medical Natural Language Processing

Grabar

Grouin

2022

Yearb Med Inform

View full text Add to dashboard Cite

Objectives: Analyze the content of publications within the medical natural language processing (NLP) domain in 2021. Methods: Automatic and manual preselection of publications to be reviewed, and selection of the best NLP papers of the year. Analysis of the important issues. Results: Four best papers have been selected in 2021. We also propose an analysis of the content of the NLP publications in 2021, all topics included. Conclusions: The main issues addressed in 2021 are related to the investigation of COVID-related questions and to the further adaptation and use of transformer models. Besides, the trends from the past years continue, such as information extraction and use of information from social networks.

show abstract

T-staging pulmonary oncology from radiological reports using natural language processing: translating into a multi-language setting

Cited by 12 publications

References 19 publications

Knowledge Graph and Deep Learning-based Text-to-GraphQL Model for Intelligent Medical Consultation Chatbot

Knowledge Graph and Deep Learning-based Text-to-GraphQL Model for Intelligent Medical Consultation Chatbot

How Natural Language Processing Can Aid With Pulmonary Oncology Tumor Node Metastasis Staging From Free-Text Radiology Reports: Algorithm Development and Validation

Year 2021: COVID-19, Information Extraction and BERTization among the Hottest Topics in Medical Natural Language Processing

Contact Info

Product

Resources

About