2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition 2018
DOI: 10.1109/cvpr.2018.00065
|View full text |Cite
|
Sign up to set email alerts
|

Deep Texture Manifold for Ground Terrain Recognition

Abstract: We present a texture network called Deep Encoding Pooling Network (DEP) for the task of ground terrain recognition. Recognition of ground terrain is an important task in establishing robot or vehicular control parameters, as well as for localization within an outdoor environment. The architecture of DEP integrates orderless texture details and local spatial information and the performance of DEP surpasses state-of-the-art methods for this task. The GTOS database (comprised of over 30,000 images of 40 classes o… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

1
117
1

Year Published

2018
2018
2021
2021

Publication Types

Select...
5
2
2

Relationship

1
8

Authors

Journals

citations
Cited by 104 publications
(119 citation statements)
references
References 40 publications
1
117
1
Order By: Relevance
“…This makes sense since objects and scenes could serve as the basis to construct arbitrary action videos and the semantic representation can alleviate such visual gaps. The motivation can also be ascribed to the success of CNNs [49,51,48]. With the help of off-the-shelf object detectors, such methods [28] could even perform zero-shot spatio-temporal action localization.…”
Section: Related Workmentioning
confidence: 99%
“…This makes sense since objects and scenes could serve as the basis to construct arbitrary action videos and the semantic representation can alleviate such visual gaps. The motivation can also be ascribed to the success of CNNs [49,51,48]. With the help of off-the-shelf object detectors, such methods [28] could even perform zero-shot spatio-temporal action localization.…”
Section: Related Workmentioning
confidence: 99%
“…All algorithms will be evaluated on the computer based on the gathered data. Among the terrains listed in [ 46 ], we select six terrains on which a robot is most likely to traverse to do the experiment. As shown in Figure 3 , some of them are artificial terrains (e.g., asphalt road), while some are natural ones (e.g., natural grass).…”
Section: Experimental Verificationmentioning
confidence: 99%
“…However, this tandem method is to extract the texture features after the convolution features, and there is no effective fusion of the convolution features and the texture features. Later, Xue et al [34] proposed DEP that inputs the output features of the convolution layer into the coding layer and the pooling layer at the same time. It can not only extract the texture features of the image, but also preserve the convolution features of the CNN.…”
Section: Related Workmentioning
confidence: 99%
“…In [34], Xue et al expressed texture features by the orderless characteristic of texture, but the texture features can have different expressions in fact. In this paper, we want to extract the detailed information in the texture feature.…”
Section: Related Workmentioning
confidence: 99%