Noa P. Cruz Díaz scite author profile

Noa P. Cruz Díaz

4Publications

25Citation Statements Received

49Citation Statements Given

How they've been cited

How they cite others

141

Affiliations

University of Huelva, Medica (United States)

Publications

Order By: Most citations

A machine‐learning approach to negation and speculation detection in clinical texts

Díaz

López

Vázquez

et al. 2012

J Am Soc Inf Sci Tec

View full text Add to dashboard Cite

Detecting negative and speculative information is essential in most biomedical text-mining tasks where these language forms are used to express impressions, hypotheses, or explanations of experimental results. Our research is focused on developing a system based on machine-learning techniques that identifies negation and speculation signals and their scope in clinical texts. The proposed system works in two consecutive phases: first, a classifier decides whether each token in a sentence is a negation/speculation signal or not. Then another classifier determines, at sentence level, the tokens which are affected by the signals previously identified. The system was trained and evaluated on the clinical texts of the BioScope corpus, a freely available resource consisting of medical and biological texts: fulllength articles, scientific abstracts, and clinical reports. The results obtained by our system were compared with those of two different systems, one based on regular expressions and the other based on machine learning. Our system's results outperformed the results obtained by these two systems. In the signal detection task, the F-score value was 97.3% in negation and 94.9% in speculation. In the scope-finding task, a token was correctly classified if it had been properly identified as being inside or outside the scope of all the negation signals present in the sentence. Our proposal showed an F score of 93.2% in negation and 80.9% in speculation. Additionally, the percentage of correct scopes (those with all their tokens correctly classified) was evaluated obtaining F scores of 90.9% in negation and 71.9% in speculation.

show abstract

An Analysis of Biomedical Tokenization: Problems and Strategies

Díaz

López

2015

View full text Add to dashboard Cite

Choosing the right tokenizer is a non-trivial task, especially in the biomedical domain, where it poses additional challenges, which if not resolved means the propagation of errors in successive Natural Language Processing analysis pipeline. This paper aims to identify these problematic cases and analyze the output that, a representative and widely used set of tokenizers, shows on them. This work will aid the decision making process of choosing the right strategy according to the downstream application. In addition, it will help developers to create accurate tokenization tools or improve the existing ones. A total of 14 problematic cases were described, showing biomedical samples for each of them. The outputs of 12 tokenizers were provided and discussed in relation to the level of agreement among tools.

show abstract

Negation and Speculation Detection

Díaz

López

2019

View full text Add to dashboard Cite

Negation and Speculation Detection for Improving Information Retrieval Effectiveness

Díaz

2013

View full text Add to dashboard Cite

The thesis proposed here intends to assist information retrieval and text mining tasks through the negation and speculation detection focusing on two different areas. In the biomedical domain, the existence of an annotated corpus with this kind of information has made possible the development of an effective system to automatically detect these language forms. In the review domain, we have annotated for negation, speculation and their scope a set of reviews. Information retrieval. Negation and speculation detection. Biomedical and review domains.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Noa P. Cruz Díaz

A machine‐learning approach to negation and speculation detection in clinical texts

An Analysis of Biomedical Tokenization: Problems and Strategies

Negation and Speculation Detection

Negation and Speculation Detection for Improving Information Retrieval Effectiveness

Contact Info

Product

Resources

About