Kevin Jiao scite author profile

Kevin Jiao

4Publications

6Citation Statements Received

33Citation Statements Given

How they've been cited

How they cite others

109

Affiliations

New York University, Management Sciences (United States)

Publications

Order By: Most citations

Data-Driven Investment Strategies for Peer-to-Peer Lending: A Case Study for Teaching Data Science

Cohen

Guetta²,

Jiao

et al. 2018

Big Data

View full text Add to dashboard Cite

We develop a number of data-driven investment strategies that demonstrate how machine learning and data analytics can be used to guide investments in peer-to-peer loans. We detail the process starting with the acquisition of (real) data from a peer-to-peer lending platform all the way to the development and evaluation of investment strategies based on a variety of approaches. We focus heavily on how to apply and evaluate the data science methods, and resulting strategies, in a real-world business setting. The material presented in this article can be used by instructors who teach data science courses, at the undergraduate or graduate levels. Importantly, we go beyond just evaluating predictive performance of models, to assess how well the strategies would actually perform, using real, publicly available data. Our treatment is comprehensive and ranges from qualitative to technical, but is also modular—which gives instructors the flexibility to focus on specific parts of the case, depending on the topics they want to cover. The learning concepts include the following: data cleaning and ingestion, classification/probability estimation modeling, regression modeling, analytical engineering, calibration curves, data leakage, evaluation of model performance, basic portfolio optimization, evaluation of investment strategies, and using Python for data science.

show abstract

Data Aggregation and Demand Prediction

Cohen

Zhang

Jiao

2022

Operations Research

View full text Add to dashboard Cite

High accuracy in demand prediction allows retailers to effectively manage their inventory and mitigate stock-outs and excess supply. A typical retail setting involves predicting the demand for hundreds of items simultaneously, some with abundant historical data and others with scarce data. In “Data Aggregation and Demand Prediction,” Cohen, Zhang, and Jiao propose a novel practical method, called data aggregation with clustering (DAC), which balances the tradeoff between data aggregation and model flexibility. DAC empowers retailers to predict demand while optimally identifying the features that should be estimated at the item, cluster, and aggregate levels. Theoretically, DAC yields a consistent estimate, along with improved prediction errors relative to the benchmark that estimates a different model for each item. Practically, DAC yields a higher demand prediction accuracy relative to many common benchmarks using a real data set from a large online retailer.

show abstract

Data Aggregation and Demand Prediction

2019

View full text Add to dashboard Cite

Bayesian Decision Process for Cost-Efficient Dynamic Ranking via Crowdsourcing

Chen¹,

Jiao²,

Lin³

2016

Preprint

View full text Add to dashboard Cite

Rank aggregation based on pairwise comparisons over a set of items has a wide range of applications. Although considerable research has been devoted to the development of rank aggregation algorithms, one basic question is how to efficiently collect a large amount of high-quality pairwise comparisons for the ranking purpose. Because of the advent of many crowdsourcing services, a crowd of workers are often hired to conduct pairwise comparisons with a small monetary reward for each pair they compare. Since different workers have different levels of reliability and different pairs have different levels of ambiguity, it is desirable to wisely allocate the limited budget for comparisons among the pairs of items and workers so that the global ranking can be accurately inferred from the comparison results. To this end, we model the active sampling problem in crowdsourced ranking as a Bayesian Markov decision process, which dynamically selects item pairs and workers to improve the ranking accuracy under a budget constraint. We further develop a computationally efficient sampling policy based on knowledge gradient as well as a moment matching technique for posterior approximation. Experimental evaluations on both synthetic and real data show that the proposed policy achieves high ranking accuracy with a lower labeling cost.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Kevin Jiao

Data-Driven Investment Strategies for Peer-to-Peer Lending: A Case Study for Teaching Data Science

Data Aggregation and Demand Prediction

Data Aggregation and Demand Prediction

Bayesian Decision Process for Cost-Efficient Dynamic Ranking via Crowdsourcing

Contact Info

Product

Resources

About