Charese Smiley scite author profile

Charese Smiley

5Publications

113Citation Statements Received

44Citation Statements Given

How they've been cited

144

112

How they cite others

Affiliations

JPMorgan Chase & Co (United States), Thomson Reuters (United States), Indiana University Bloomington

Publications

Order By: Most citations

FinQA: A Dataset of Numerical Reasoning over Financial Data

Chen¹,

Chen²,

Smiley³

et al. 2021

View full text Add to dashboard Cite

The sheer volume of financial statements makes it difficult for humans to access and analyze a business's financials. Robust numerical reasoning likewise faces unique challenges in this domain. In this work, we focus on answering deep questions over financial data, aiming to automate the analysis of a large corpus of financial documents. In contrast to existing tasks on general domain, the finance domain includes complex numerical reasoning and understanding of heterogeneous representations. To facilitate analytical progress, we propose a new large-scale dataset, FINQA, with Question-Answering pairs over Financial reports, written by financial experts. We also annotate the gold reasoning programs to ensure full explainability. We further introduce baselines and conduct comprehensive experiments in our dataset. The results demonstrate that popular, large, pre-trained models fall far short of expert humans in acquiring finance knowledge and in complex multi-step numerical reasoning on that knowledge. Our dataset -the first of its kind -should therefore enable significant, new community research into complex application domains. The dataset and code are publicly available 1 .

show abstract

TR Discover: A Natural Language Interface for Querying and Analyzing Interlinked Datasets

Song

Schilder

Smiley

et al. 2015

View full text Add to dashboard Cite

Currently, the dominant technology for providing nontechnical users with access to Linked Data is keyword-based search. This is problematic because keywords are often inadequate as a means for expressing user intent. In addition, while a structured query language can provide convenient access to the information needed by advanced analytics, unstructured keyword-based search cannot meet this extremely common need. This makes it harder than necessary for non-technical users to generate analytics. We address these difficulties by developing a natural language-based system that allows non-technical users to create wellformed questions. Our system, called TR Discover, maps from a fragment of English into an intermediate First Order Logic representation, which is in turn mapped into SPARQL or SQL. The mapping from natural language to logic makes crucial use of a feature-based grammar with full formal semantics. The fragment of English covered by the natural language grammar is domain specific and tuned to the kinds of questions that the system can handle. Because users will not necessarily know what the coverage of the system is, TR Discover offers a novel auto-suggest mechanism that can help users to construct well-formed and useful natural language questions. TR Discover was developed for future use with Thomson Reuters Cortellis, which is an existing product built on top of a linked data system targeting the pharmaceutical domain. Currently, users access it via a keyword-based query interface. We report results and performance measures for TR Discover on Cortellis, and in addition, to demonstrate the portability of the system, on the QALD-4 dataset, which is associated with a public shared task. We show that the system is usable and portable, and report on the relative performance of queries using SQL and SPARQL back ends.

show abstract

Building and Querying an Enterprise Knowledge Graph

Song

Schilder

Hertz

et al. 2019

IEEE Trans. Serv. Comput.

View full text Add to dashboard Cite

The E2E NLG Challenge: A Tale of Two Systems

Smiley

Davoodi

Song

et al. 2018

View full text Add to dashboard Cite

This paper presents the two systems we entered into the 2017 E2E NLG Challenge: TemplGen, a templated-based system and SeqGen, a neural network-based system. Through the automatic evaluation, SeqGen achieved competitive results compared to the template-based approach and to other participating systems as well. In addition to the automatic evaluation, in this paper we present and discuss the human evaluation results of our two systems.

show abstract

Say the Right Thing Right: Ethics Issues in Natural Language Generation Systems

Smiley¹,

Schilder²,

Plachouras³

et al. 2017

View full text Add to dashboard Cite

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Charese Smiley

FinQA: A Dataset of Numerical Reasoning over Financial Data

TR Discover: A Natural Language Interface for Querying and Analyzing Interlinked Datasets

Building and Querying an Enterprise Knowledge Graph

The E2E NLG Challenge: A Tale of Two Systems

Say the Right Thing Right: Ethics Issues in Natural Language Generation Systems

Contact Info

Product

Resources

About