Li Xian scite author profile

Li Xian

5Publications

23Citation Statements Received

109Citation Statements Given

How they've been cited

How they cite others

109

Affiliations

University of Kaiserslautern, Binghamton University

Publications

Order By: Most citations

Efficient knowledge graph accuracy evaluation

Gao

Xian

et al. 2019

Proc. VLDB Endow.

View full text Add to dashboard Cite

Estimation of the accuracy of a large-scale knowledge graph (KG) often requires humans to annotate samples from the graph. How to obtain statistically meaningful estimates for accuracy evaluation while keeping human annotation costs low is a problem critical to the development cycle of a KG and its practical applications. Surprisingly, this challenging problem has largely been ignored in prior research. To address the problem, this paper proposes an efficient sampling and evaluation framework, which aims to provide quality accuracy evaluation with strong statistical guarantee while minimizing human efforts. Motivated by the properties of the annotation cost function observed in practice, we propose the use of cluster sampling to reduce the overall cost. We further apply weighted and two-stage sampling as well as stratification for better sampling designs. We also extend our framework to enable efficient incremental evaluation on evolving KG, introducing two solutions based on stratified sampling and a weighted variant of reservoir sampling. Extensive experiments on real-world datasets demonstrate the effectiveness and efficiency of our proposed solution. Compared to baseline approaches, our best solutions can provide up to 60% cost reduction on static KG evaluation and up to 80% cost reduction on evolving KG evaluation, without loss of evaluation quality.

show abstract

Scaling up copy detection

Xian

Dong

Lyons

et al. 2015

View full text Add to dashboard Cite

Abstract-Recent research shows that copying is prevalent for Deep-Web data and considering copying can significantly improve truth finding from conflicting values. However, existing copy detection techniques do not scale for large sizes and numbers of data sources, so truth finding can be slowed down by one to two orders of magnitude compared with the corresponding techniques that do not consider copying. In this paper, we study how to improve scalability of copy detection on structured data.Our algorithm builds an inverted index for each shared value and processes the index entries in decreasing order of how much the shared value can contribute to the conclusion of copying. We show how we use the index to prune the data items we consider for each pair of sources, and to incrementally refine our results in iterative copy detection. We also apply a sampling strategy with which we are able to further reduce copy-detection time while still obtaining very similar results as on the whole data set. Experiments on various real data sets show that our algorithm can reduce the time for copy detection by two to three orders of magnitude; in other words, truth finding can benefit from copy detection with very little overhead.

show abstract

Verification of Fact Statements with Multiple Truthful Alternatives

Xian

Meng

2016

View full text Add to dashboard Cite

Fundamental analysis powered by Semantic Web

Xian

Bao

Hendler

2011

View full text Add to dashboard Cite

Abstract-Conducting fundamental analysis within subsets of comparable firms has been demonstrated to provide more reliable inferences and increase the prediction quality in equity research. However, incorporating and representing both firmspecific information and common economic determinants has been widely recognized as the key challenge. This paper investigates how to leverage Semantic Web technologies to assist fundamental analysis by generating flexible and meaningful selections of comparable firms at low costs. We approach the problem by proposing Linked Open Financial Data as the data organization model and ontology modeling for knowledge representation. Results are verified in terms of efficiency with examples of quick mashups, and feasibility by adapting to existing valuation models.

show abstract

A wearable system for situational awareness estimation in underground mines

Xian

Sun

Liu

et al. 2018

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Li Xian

Efficient knowledge graph accuracy evaluation

Scaling up copy detection

Verification of Fact Statements with Multiple Truthful Alternatives

Fundamental analysis powered by Semantic Web

A wearable system for situational awareness estimation in underground mines

Contact Info

Product

Resources

About