2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2017
DOI: 10.1109/cvpr.2017.635
|View full text |Cite
|
Sign up to set email alerts
|

MIML-FCN+: Multi-Instance Multi-Label Learning via Fully Convolutional Networks with Privileged Information

Abstract: Multi-instance multi-label (MIML) learning has many interesting applications in computer visions, including multi-object recognition and automatic image tagging. In these applications, additional information such as bounding-boxes, image captions and descriptions is often available during training phrase, which is referred as privileged information (PI). However, as existing works on learning using PI only consider instance-level PI (privileged instances), they fail to make use of bag-level PI (privileged bags… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

1
35
0

Year Published

2018
2018
2020
2020

Publication Types

Select...
5
4
1

Relationship

1
9

Authors

Journals

citations
Cited by 57 publications
(36 citation statements)
references
References 23 publications
1
35
0
Order By: Relevance
“…Information Dropout [1]: a regularization method that utilizes injection of multiplicative noise in the activations of a deep neural network (but as a function of the input x, not x ). MIML-FCN [44]: a CNN-based LUPI framework designed for multi-instance problems. Our problem is not multiple instance; however, we still make a comparison for the sake of completeness.…”
Section: Datasetsmentioning
confidence: 99%
“…Information Dropout [1]: a regularization method that utilizes injection of multiplicative noise in the activations of a deep neural network (but as a function of the input x, not x ). MIML-FCN [44]: a CNN-based LUPI framework designed for multi-instance problems. Our problem is not multiple instance; however, we still make a comparison for the sake of completeness.…”
Section: Datasetsmentioning
confidence: 99%
“…In the computer vision community, PI, which is interpreted as the auxiliary information about the training data, can be used for learning better recognition systems. Recently, extensive efforts are devoted to exploring PI cues for enhancing the model training in a variety of vision tasks [22,8,13,14,9,10,11,6,5,4].…”
Section: Learning Using Privileged Informationmentioning
confidence: 99%
“…During the past few years, many related algorithms have been investigated and developed for MIML problems [3,2,16]. The MIML formulation has also been applied in many practical vision domains, such as image annotation [36,25] and classification tasks [37,6,38,41].…”
Section: Multi-instance Multi-label Learningmentioning
confidence: 99%