Tien Duc Cao scite author profile

Tien Duc Cao

2Publications

9Citation Statements Received

15Citation Statements Given

How they've been cited

How they cite others

Affiliations

Inria Saclay - Île-de-France Research Centre, University of Paris-Saclay, Institut Polytechnique de Paris

Publications

Order By: Most citations

Extracting linked data from statistic spreadsheets

Cao

Manolescu

Tannier

2017

View full text Add to dashboard Cite

Statistic data is an important sub-category of open data; it is interesting for many applications, including but not limited to data journalism, as such data is typically of high quality, and reflects (under an aggregated form) important aspects of a society's life such as births, immigration, economic output etc. However, such open data is often not published as Linked Open Data (LOD) limiting its usability.We provide a conceptual model for the open data comprised in statistic files published by INSEE, the leading French economic and societal statistics institute. Then, we describe a novel method for extracting RDF LOD populating an instance of this conceptual model. Our method was used to produce RDF data out of 20k+ Excel spreadsheets, and our validation indicates a 91% rate of successful extraction.

show abstract

Extracting Statistical Mentions from Textual Claims to Provide Trusted Content

Cao

Manolescu

Tannier

2019

View full text Add to dashboard Cite

To cite this version:Tien Duc Cao, Ioana Manolescu, Xavier Tannier. Extracting statistical mentions from textual claims to provide trusted content.Abstract. Claims on statistic (numerical) data, e.g., immigrant populations, are often fact-checked. We present a novel approach to extract from text documents, e.g., online media articles, mentions of statistic entities from a reference source. A claim states that an entity has certain value, at a certain time. This completes a fact-checking pipeline from text, to the reference data closest to the claim. We evaluated our method on the INSEE dataset and show that it is efficient and effective.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Tien Duc Cao

Extracting linked data from statistic spreadsheets

Extracting Statistical Mentions from Textual Claims to Provide Trusted Content

Contact Info

Product

Resources

About