Cláudia Maria Cabral Moro scite author profile

With the growing number of electronic health record data, clinical NLP tasks have become increasingly relevant to unlock valuable information from unstructured clinical text. Although the performance of downstream NLP tasks, such as named-entity recognition (NER), in English corpus has recently improved by contextualised language models, less research is available for clinical texts in low resource languages. Our goal is to assess a deep contextual embedding model for Portuguese, so called BioBERTpt, to support clinical and biomedical NER. We transfer learned information encoded in a multilingual-BERT model to a corpora of clinical narratives and biomedical-scientific papers in Brazilian Portuguese. To evaluate the performance of BioBERTpt, we ran NER experiments on two annotated corpora containing clinical narratives and compared the results with existing BERT models. Our in-domain model outperformed the baseline model in F1-score by 2.72%, achieving higher performance in 11 out of 13 assessed entities. We demonstrate that enriching contextual embedding models with domain literature can play an important role in improving performance for specific NLP tasks. The transfer learning process enhanced the Portuguese biomedical NER model by reducing the necessity of labeled data and the demand for retraining a whole new model.

show abstract

ISO/TR 12300:2016 for clinical cross-terminology mapping: contribution to nursing

Torres

Gomes

Ronnau

et al. 2020

Rev. esc. enferm. USP

View full text Add to dashboard Cite

This theoretical and reflective study aimed to assess the contribution of the ISO/TR 12300:2016 document for the mapping of nursing terminology. The referred document and related articles were used as an empirical framework. The study analyzed the content of the document, highlighting cardinality and equivalence principles. The standard presents conceptual and operational basis for mapping, with cardinality and equivalence as the support for the categorization of cross-terminology mapping in the area of nursing. Cardinality verifies candidate target terms to represent the source term, while the equivalence degree scale checks semantic correspondence. Among the principles included in the ISO/TR 12300:2016, cardinality and equivalence contribute to the accurate representation of the results of the cross-terminology mapping process and its use should decrease inconsistencies.

show abstract

Applying process mining and semantic reasoning for process model customisation in healthcare

Detro

Santos

Panetto

et al. 2019

Enterprise Information Systems

View full text Add to dashboard Cite

Process flexibility plays a key role in high variability environments, such as healthcare. In this type of environment, the process model needs to change some elements to adjust to specific sets of requirements. Thus, this paper proposes a process model customizing method based on ontology and process mining. The method proposed is applied in customizing process models for acute ischemic stroke treatment. During process model customization, the method provides decision-making support for users, thereby ensuring a structurally correct process customization and enabling improves patient treatment by means of recommendations.

show abstract

Supervised learning for the detection of negation and of its scope in French and Brazilian Portuguese biomedical corpora

Dalloux¹,

Claveau²,

Grabar

et al. 2020

Nat. Lang. Eng.

View full text Add to dashboard Cite

Automatic detection of negated content is often a prerequisite in information extraction systems in various domains. In the biomedical domain especially, this task is important because negation plays an important role. In this work, two main contributions are proposed. First, we work with languages which have been poorly addressed up to now: Brazilian Portuguese and French. Thus, we developed new corpora for these two languages which have been manually annotated for marking up the negation cues and their scope. Second, we propose automatic methods based on supervised machine learning approaches for the automatic detection of negation marks and of their scopes. The methods show to be robust in both languages (Brazilian Portuguese and French) and in cross-domain (general and biomedical languages) contexts. The approach is also validated on English data from the state of the art: it yields very good results and outperforms other existing approaches. Besides, the application is accessible and usable online. We assume that, through these issues (new annotated corpora, application accessible online, and cross-domain robustness), the reproducibility of the results and the robustness of the NLP applications will be augmented.

show abstract

A GPT-2 Language Model for Biomedical Texts in Portuguese

Schneider

Souza

Gumiel

et al. 2021

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.