Revisiting Unreasonable Effectiveness of Data in Deep Learning Era

Sun, Chen; Shrivastava, Abhinav; Singh, Saurabh; Gupta, Abhinav

doi:10.1109/iccv.2017.97

Cited by 1,775 publications

(1,193 citation statements)

References 49 publications

Supporting

Mentioning

1,153

Contrasting

Unclassified

Order By: Relevance

“…Machine learning tools and improved ability to gather data provides the opportunity to learn more sophisticated WOC methods in a datadriven fashion (Rokach, 2010;Bachrach et al, 2012;Polikar, 2012;Sun et al, 2017). We are likely to learn much more about effective strategies of opinion aggregation through their widespread adoption.…”

Section: Discussionmentioning

confidence: 99%

See 1 more Smart Citation

Rescuing Collective Wisdom when the Average Group Opinion Is Wrong

2017

View full text Add to dashboard Cite

The total knowledge contained within a collective supersedes the knowledge of even its most intelligent member. Yet the collective knowledge will remain inaccessible to us unless we are able to find efficient knowledge aggregation methods that produce reliable decisions based on the behavior or opinions of the collective's members. It is often stated that simple averaging of a pool of opinions is a good and in many cases the optimal way to extract knowledge from a crowd. The method of averaging has been applied to analysis of decision-making in very different fields, such as forecasting, collective animal behavior, individual psychology, and machine learning. Two mathematical theorems, Condorcet's theorem and Jensen's inequality, provide a general theoretical justification for the averaging procedure. Yet the necessary conditions which guarantee the applicability of these theorems are often not met in practice. Under such circumstances, averaging can lead to suboptimal and sometimes very poor performance. Practitioners in many different fields have independently developed procedures to counteract the failures of averaging. We review such knowledge aggregation procedures and interpret the methods in the light of a statistical decision theory framework to explain when their application is justified. Our analysis indicates that in the ideal case, there should be a matching between the aggregation procedure and the nature of the knowledge distribution, correlations, and associated error costs. This leads us to explore how machine learning techniques can be used to extract near-optimal decision rules in a data-driven manner. We end with a discussion of open frontiers in the domain of knowledge aggregation and collective intelligence in general.

show abstract

Section: Discussionmentioning

confidence: 99%

“…Recent technological advances have also opened up the possibility of gathering very large datasets from which collective wisdom can be extracted (Sun et al, 2017). Large datasets allow researchers to consider and reliably test increasingly complex methodologies of opinion aggregation.…”

Section: Introductionmentioning

confidence: 99%

Rescuing Collective Wisdom when the Average Group Opinion Is Wrong

2017

View full text Add to dashboard Cite

show abstract

“…tissue image in this application) are very unique compared to those of the training data, then the trained machine learning models will not recognize the input. As suggested by a recent work on the relationship between deep learning and training data [50], this problem can be solved by using more training data and fine tuning the neural network in our future work. In case of having extremely unique samples, a rejection option [51, 52] can be incorporated to the machine learning models, and the enhanced models can refuse to make a decision if the patterns of input are very different from those of the training data.…”

Section: Discussionmentioning

confidence: 99%

A deep learning approach to estimate chemically-treated collagenous tissue nonlinear anisotropic stress-strain responses from microscopy images

2017

View full text Add to dashboard Cite

Biological collagenous tissues comprised of networks of collagen fibers are suitable for a broad spectrum of medical applications owing to their attractive mechanical properties. In this study, we developed a noninvasive approach to estimate collagenous tissue elastic properties directly from microscopy images using Machine Learning (ML) techniques. Glutaraldehyde-treated bovine pericardium (GLBP) tissue, widely used in the fabrication of bioprosthetic heart valves and vascular patches, was chosen to develop a representative application. A Deep Learning model was designed and trained to process second harmonic generation (SHG) images of collagen networks in GLBP tissue samples, and directly predict the tissue elastic mechanical properties. The trained model is capable of identifying the overall tissue stiffness with a classification accuracy of 84%, and predicting the nonlinear anisotropic stress-strain curves with average regression errors of 0.021 and 0.031. Thus, this study demonstrates the feasibility and great potential of using the Deep Learning approach for fast and noninvasive assessment of collagenous tissue elastic properties from microstructural images.

show abstract

“…(Pratt, 2017) Simultaneously, research on technology improvement across industries suggests that there is a power-law relationship between production and performance: A doubling of production leads to a constant improvement in performance (as measured by cost or other characteristics) (Nagy et al, 2013). A similar relationship may exist for the performance of machine learning algorithms and data sets (Sun et al, 2017). For HAVs, this suggests that achieving gains that some might consider "near perfect" may take much more effort and time than reaching better-than-average human performance, which may itself be still out of reach.…”

Section: What Does the Evidence Suggest About The Conditions That Leamentioning

confidence: 99%

The Enemy of Good: Estimating the Cost of Waiting for Nearly Perfect Automated Vehicles

Kalra¹,

Groves²

2017

View full text Add to dashboard Cite

This document and trademark(s) contained herein are protected by law. This representation of RAND intellectual property is provided for noncommercial use only. Unauthorized posting of this publication online is prohibited. Permission is given to duplicate this document for personal use only, as long as it is unaltered and complete. Permission is required from RAND to reproduce, or reuse in another form, any of its research documents for commercial use. For information on reprint and linking permissions, please visit www.rand.org/pubs/permissions.The RAND Corporation is a research organization that develops solutions to public policy challenges to help make communities throughout the world safer and more secure, healthier and more prosperous. RAND is nonprofit, nonpartisan, and committed to the public interest.RAND's publications do not necessarily reflect the opinions of its research clients and sponsors.Support RAND Make a tax-deductible charitable contribution at www.rand.org/giving/contribute www.rand.org For more information on this publication, visit www.rand.org/t/RR2150Library of Congress Cataloging-in-Publication Data is available for this publication.ISBN: 978-0-8330-9937-2 Published by the RAND Corporation, Santa Monica, Calif. © Copyright 2017 RAND CorporationR® is a registered trademark. Cover image: AP Photo/Gene J. Puskar iii PrefaceThe RAND Corporation has a long history of research on intelligent systems. Since the 1950s, with work on chess-playing computers and the Logic Theory Machine, RAND has produced objective, evidence-based research to help inform how society can harness the benefits and manage the risks of intelligent, transformative technologies. RAND's work on autonomous and automated vehicles builds on this firm foundation. RAND Science, Technology, and PolicyThis research was conducted in the RAND Science, Technology, and Policy program, which focuses primarily on the role of scientific development and technological innovation in human behavior, global and regional decisionmaking as it relates to science and technology, and the concurrent effects that science and technology have on policy analysis and policy choices.This program is part of RAND Justice, Infrastructure, and Environment, a division of the RAND Corporation dedicated to improving policy-and decisionmaking in a wide range of policy domains, including civil and criminal justice, infrastructure development and financing, environmental policy, transportation planning and techiv The Enemy of Good: Estimating the Cost of Waiting for Nearly Perfect Automated Vehicles nology, immigration and border protection, public and occupational safety, energy policy, science and innovation policy, space, and telecommunications.During the development of this report and at the time of publication, co-author Nidhi Kalra's spouse served as co-founder and president of Nuro, a machine-learning and robotics start-up company engaged in autonomous vehicle development. He previously served as a principal engineer for Google's driverless car proje...

show abstract

Revisiting Unreasonable Effectiveness of Data in Deep Learning Era

Cited by 1,775 publications

References 49 publications

Rescuing Collective Wisdom when the Average Group Opinion Is Wrong

Rescuing Collective Wisdom when the Average Group Opinion Is Wrong

A deep learning approach to estimate chemically-treated collagenous tissue nonlinear anisotropic stress-strain responses from microscopy images

The Enemy of Good: Estimating the Cost of Waiting for Nearly Perfect Automated Vehicles

Contact Info

Product

Resources

About