Yuka Tateisi scite author profile

Background: Associating literature with pathways poses new challenges to the Text Mining (TM) community. There are three main challenges to this task: (1) the identification of the mapping position of a specific entity or reaction in a given pathway, (2) the recognition of the causal relationships among multiple reactions, and (3) the formulation and implementation of required inferences based on biological domain knowledge.

show abstract

An intelligent search engine and GUI-based efficient MEDLINE search tool based on deep syntactic parsing

Ohta

Miyao

Ninomiya

et al. 2006

View full text Add to dashboard Cite

show abstract

Automatic construction of predicate-argument structure patterns for biomedical information extraction

Yakushiji

Miyao

Ohta

et al. 2006

View full text Add to dashboard Cite

This paper presents a method of automatically constructing information extraction patterns on predicate-argument structures (PASs) obtained by full parsing from a smaller training corpus. Because PASs represent generalized structures for syntactical variants, patterns on PASs are expected to be more generalized than those on surface words. In addition, patterns are divided into components to improve recall and we introduce a Support Vector Machine to learn a prediction model using pattern matching results. In this paper, we present experimental results and analyze them on how well protein-protein interactions were extracted from MEDLINE abstracts. The results demonstrated that our method improved accuracy compared to a machine learning approach using surface word/part-of-speech patterns.

show abstract

The GENIA corpus

Ohta

Tateisi²,

Kim³

2002

View full text Add to dashboard Cite

With the information overload in genome-related field, there is an infreest need for natural language processing technology to extract information from literature and various attempts of information extraction using NLP has been being made. We are developing the necessary resources including domain ontology and annotated corpus from research abstracts in MEDLINE database (GENIA corpus). We are building the ontology and the corpus simultaneously, using each other. In this paper we report on our new corpus, its ontological basis, annotation scheme, and statistics of annotated objects. We also describe the tools used for corpus annotation and management.

show abstract

A Lightweight Approach for Extracting Disease-Symptom Relation with MetaMap toward Automated Generation of Disease Knowledge Base

Okumura

Tateisi

2012

View full text Add to dashboard Cite

12 3 4

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Yuka Tateisi

GENIA corpus—a semantically annotated corpus for bio-textmining

Introduction to the bio-entity recognition task at JNLPBA

Event Extraction From Biomedical Papers Using a Full Parser

New challenges for text mining: mapping between text and manually curated pathways

An intelligent search engine and GUI-based efficient MEDLINE search tool based on deep syntactic parsing

Automatic construction of predicate-argument structure patterns for biomedical information extraction

The GENIA corpus

A Lightweight Approach for Extracting Disease-Symptom Relation with MetaMap toward Automated Generation of Disease Knowledge Base

Contact Info

Product

Resources

About