Yang Li scite author profile

Entity Extraction is a process of identifying meaningful entities from text documents. In enterprises, extracting entities improves enterprise efficiency by facilitating numerous applications, including search, recommendation, etc. However, the problem is particularly challenging on enterprise domains due to several reasons. First, the lack of redundancy of enterprise entities makes previous web-based systems like NELL and OpenIE not effective, since using only high-precision/low-recall patterns like those systems would miss the majority of sparse enterprise entities, while using more low-precision patterns in sparse setting also introduces noise drastically. Second, semantic drift is common in enterprises ("Blue" refers to "Windows Blue"), such that public signals from the web cannot be directly applied on entities. Moreover, many internal entities never appear on the web. Sparse internal signals are the only source for discovering them. To address these challenges, we propose an end-to-end framework for extracting entities in enterprises, taking the input of enterprise corpus and limited seeds to generate a high-quality entity collection as output. We introduce the novel concept of Semantic Pattern Graph to leverage public signals to understand the underlying semantics of lexical patterns, reinforce pattern evaluation using mined semantics, and yield more accurate and complete entities. Experiments on Microsoft enterprise data show the effectiveness of our approach.

show abstract

Construction and Evaluation of an Oil Spill Semantic Relation Taxonomy for Supporting Knowledge Discovery

Wu¹,

Li²

2015

View full text Add to dashboard Cite

An Efficient Network Anomaly Detection Scheme Based on TCM-KNN Algorithm and Data Reduction Mechanism

Li¹,

Guo

2007

View full text Add to dashboard Cite

Design and Implementation of Barcode Management Information System

Weng¹,

Li²

2012

View full text Add to dashboard Cite

Creating a Taxonomy of Earthquake Disaster Response and Recovery for Online Earthquake Information Management

Li¹,

Wu²

2019

View full text Add to dashboard Cite

The goal of this study is to develop a taxonomy of earthquake response and recovery using online information resources for organizing and sharing earthquake-related online information resources. A constructivist/interpretivist research paradigm was used in the study. A combination of top-down and bottom-up approaches was used to build the taxonomy. Facet analysis of disaster management, the timeframe of disaster management, and modular design were performed when designing the taxonomy. Two case studies were done to demonstrate the usefulness of the taxonomy for organizing and sharing information. The facet-based taxonomy can be used to organize online information for browsing and navigation. It can also be used to index and tag online information resources to support searching. It creates a common language for earthquake management stakeholders to share knowledge. The top three level categories of the taxonomy can be applied to the management of other types of disasters. The taxonomy has implications for earthquake online information management, knowledge management and disaster management. The approach can be used to build taxonomies for managing online information resources on other topics (including various types of time-sensitive disaster responses). We propose a common language for sharing information on disasters, which has great social relevance.

show abstract

DeepXSS

Fang

Liu

et al. 2018

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Yang Li

Reverse Engineering Variability from Natural Language Documents

Malphite: A convolutional neural network and ensemble learning based protein secondary structure predictor

Leveraging Pattern Semantics for Extracting Entities in Enterprises

Construction and Evaluation of an Oil Spill Semantic Relation Taxonomy for Supporting Knowledge Discovery

An Efficient Network Anomaly Detection Scheme Based on TCM-KNN Algorithm and Data Reduction Mechanism

Design and Implementation of Barcode Management Information System

Creating a Taxonomy of Earthquake Disaster Response and Recovery for Online Earthquake Information Management

DeepXSS

Contact Info

Product

Resources

About