Efficient object localization using Convolutional Networks

Tompson, Jonathan; Goroshin, Ross; Jain, Arjun; LeCun, Yann; Bregler, Christoph

doi:10.1109/cvpr.2015.7298664

Cited by 1,203 publications

(898 citation statements)

References 18 publications

Supporting

Mentioning

831

Contrasting

Unclassified

Order By: Relevance

“…This serves to increase efficency and reduce memory usage of their method while improving localization performance in the high precision range [16]. One consideration is that for many failure cases a refinement of position within a local window would not offer much improvement since error cases often consist of either occluded or misattributed limbs.…”

Section: Related Workmentioning

confidence: 99%

“…To improve performance at high precision thresholds the prediction is offset by a quarter of a pixel in the direction of its next highest neighbor before transforming back to the original coordinate space of the image. In MPII Human Pose, some joints do not have a corresponding [1] 76.5 59.1 Toshev et al [24] 92.3 82.0 Tompson et al [16] 93.1 89.0 Chen et al [25] 95.3 92.4 Wei et al [18] 97.6 95.0 Our model 99.0 97.0 ground truth annotation. In these cases the joint is either truncated or severely occluded, so for supervision a ground truth heatmap of all zeros is provided.…”

Section: Training Detailsmentioning

confidence: 99%

“…Recent pose estimation systems [15][16][17][18][19][20] have universally adopted ConvNets as their main building block, largely replacing hand-crafted features and graphical models; this strategy has yielded drastic improvements on standard benchmarks [1,21,22].…”

Section: Arxiv:160306937v2 [Cscv] 26 Jul 2016mentioning

confidence: 99%

See 2 more Smart Citations

Stacked Hourglass Networks for Human Pose Estimation

Newell

Yang

Deng

2016

Lecture Notes in Computer Science

4,115

4,081

View full text Add to dashboard Cite

Abstract. This work introduces a novel convolutional network architecture for the task of human pose estimation. Features are processed across all scales and consolidated to best capture the various spatial relationships associated with the body. We show how repeated bottom-up, top-down processing used in conjunction with intermediate supervision is critical to improving the performance of the network. We refer to the architecture as a "stacked hourglass" network based on the successive steps of pooling and upsampling that are done to produce a final set of predictions. State-of-the-art results are achieved on the FLIC and MPII benchmarks outcompeting all recent methods. Keywords: Human Pose Estimation

show abstract

Section: Related Workmentioning

confidence: 99%

Section: Training Detailsmentioning

confidence: 99%

See 1 more Smart Citation

Stacked Hourglass Networks for Human Pose Estimation

Newell

Yang

Deng

2016

Lecture Notes in Computer Science

4,115

4,081

View full text Add to dashboard Cite

show abstract

“…While the tree-structured models provide efficient inference, they struggle to model long-range characteristics of the human body. With the progress in convolutional neural network architectures, more recent works adopt CNNs to obtain stronger part detectors but still use graphical models to obtain coherent pose estimates [4,6,7].…”

Section: Related Workmentioning

confidence: 99%

“…This is mainly due to the availability of deep learning based methods for detecting joints [1][2][3][4][5]. While earlier approaches in this direction [4,6,7] combine the body part detectors with tree structured graphical models, more recent methods [1][2][3][8][9][10] demonstrate that spatial relations between joints can be directly learned by a neural network without the need of an additional graphical model. These approaches, however, assume that only a single person is visible in the image and the location of the person is known a-priori.…”

Section: Introductionmentioning

confidence: 99%

Multi-person Pose Estimation with Local Joint-to-Person Associations

Iqbal

Gall

2016

Lecture Notes in Computer Science

127

View full text Add to dashboard Cite

Abstract. Despite of the recent success of neural networks for human pose estimation, current approaches are limited to pose estimation of a single person and cannot handle humans in groups or crowds. In this work, we propose a method that estimates the poses of multiple persons in an image in which a person can be occluded by another person or might be truncated. To this end, we consider multiperson pose estimation as a joint-to-person association problem. We construct a fully connected graph from a set of detected joint candidates in an image and resolve the joint-to-person association and outlier detection using integer linear programming. Since solving joint-to-person association jointly for all persons in an image is an NP-hard problem and even approximations are expensive, we solve the problem locally for each person. On the challenging MPII Human Pose Dataset for multiple persons, our approach achieves the accuracy of a state-of-the-art method, but it is 6,000 to 19,000 times faster.

show abstract

Analysis of Electrochemical Impedance Data: Use of Deep Neural Networks

Doonyapisut

Kannan

Kim

et al. 2023

Advanced Intelligent Systems

View full text Add to dashboard Cite

Technology advancements in energy storage, photocatalysis, and sensors have generated enormous impedimetric data. Electrochemical impedance spectroscopy (EIS) results play an essential role in analyzing the interfacial properties of materials. Nonetheless, in many situations, the data is misinterpreted due to the complexity of the electrochemical system or the compromise between the experimental result and the theoretical model, resulting in partiality in the interpretation process, especially for the impedimetric results. Typically, the experimenter interprets impedimetric results using a searching approach based on a theoretical model until the best‐fitting model is obtained, which is a time‐consuming process, and errors can occur. To reduce misinterpretation by the experimenter, herein, the machine‐learning strategy is demonstrated for the classification of an EIS circuit model and parameter prediction using a deep neural network (DNN). The DNN model shows a highly accurate classifier for the commonly used EIS circuit with an average area under the receiver operating characteristic curve of more than 0.95. Additionally, the model demonstrates high accuracy in the prediction of EIS parameters on a complex EIS system, with a maximum R2 of 0.999. These reveal that the machine‐learning strategy may open a new room for studying electrochemical systems.

show abstract

Efficient object localization using Convolutional Networks

Cited by 1,203 publications

References 18 publications

Stacked Hourglass Networks for Human Pose Estimation

Stacked Hourglass Networks for Human Pose Estimation

Multi-person Pose Estimation with Local Joint-to-Person Associations

Analysis of Electrochemical Impedance Data: Use of Deep Neural Networks

Contact Info

Product

Resources

About