An End-to-End Deep Learning Network for 3D Object Detection From RGB-D Data Based on Hough Voting

Yan, Ming; Li, ZhongTong; Yu, Xinyan; Jin, Cong

doi:10.1109/access.2020.3012695

Cited by 25 publications

(18 citation statements)

References 34 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…The target-domain detector improves the one-class universal detector by mining box-level pseudo ground facts in each iteration [20]. Yan et al [21] designed a deep learning network for object detection based on merging the geometric data (3D) and texture data of two-dimensional (2D). To solve the issue of one sensor, they used an inverse mapping level and a gathering level to merge the one or more input of RGB datum with the geometric input of point cloud data and designed a top gathering layer to transact with the data of multiple vision cameras.…”

Section: Literature Reviewmentioning

confidence: 99%

“…Deductively, the type of dataset differs from one research to another based on the purpose of the research, whether to improve results or to a specific application, for example in [24] used object detection for surveillance purposes, they used specific objects classification in airports, such as people, bags, trolleys. While in [21], [25], [26], the authors studied 3D object detection using RGB-D data scenes in outdoor [27] and indoor [28]- [30], or both. some of selected related studies will be described in details in the following.…”

Section: Literature Reviewmentioning

confidence: 99%

See 1 more Smart Citation

Hybrid features for object detection in RGB-D scenes

Awwad

Igried

Wedyan

et al. 2021

IJEECS

View full text Add to dashboard Cite

<div>Object detection is considered a hot research topic in applications of artificial intel-ligence and computer vision. Historically, object detection was widely used in var-ious fields like surveillance, fine-grained activities and robotics. All studies focus on improving accuracy for object detection using images, whether indoor or outdoor scenes. Therefore, this paper took a shot by improving the doable features extraction and proposing crossed sliding window approach using exiting classifiers for object de-tection. In this paper, the contribution includes two parts: First, improving local depth pattern feature along side SIFT and the second part explains a new technique presented by proposing crossed sliding window approach using two different types of images (colored and depth). Two types of features local depth patterns for detection (LDPD) and scale-invariant feature transform (SIFT) were merged as one feature vector. The RGB-D object dataset has been used and it consists of 300 different objects and in-cludes thousands of scenes. The proposed approach achieved high results comparing to other features or separated features that are used in this paper. All experiments and comparatives were applied on the same dataset for the same objective. Experimental results report a high accuracy in terms of detection rate, recall, precision and F1 scorein RGB-D scenes.</div>

show abstract

Section: Literature Reviewmentioning

confidence: 99%

Section: Literature Reviewmentioning

confidence: 99%

Hybrid features for object detection in RGB-D scenes

Awwad

Igried

Wedyan

et al. 2021

IJEECS

View full text Add to dashboard Cite

show abstract

“…Recently, with the breakthrough progress of artificial intelligence technology, intelligent vehicles with the advanced driving assistance system (ADAS) are vigorously launched on the market [1,2]. The ADAS of the intelligent vehicle collects the surrounding data from the sensors like radars and cameras and then performs road object detection and so on.…”

Section: Introductionmentioning

confidence: 99%

A Novel Vehicle Detection Framework Based on Parallel Vision

Zhuo

Lan

Zheng

et al. 2022

Wireless Communications and Mobile Computing

View full text Add to dashboard Cite

Autonomous driving has become a prevalent research topic in recent years, arousing the attention of many academic universities and commercial companies. As human drivers rely on visual information to discern road conditions and make driving decisions, autonomous driving calls for vision systems such as vehicle detection models. These vision models require a large amount of labeled data while collecting and annotating the real traffic data are time-consuming and costly. Therefore, we present a novel vehicle detection framework based on the parallel vision to tackle the above issue, using the specially designed virtual data to help train the vehicle detection model. We also propose a method to construct large-scale artificial scenes and generate the virtual data for the vision-based autonomous driving schemes. Experimental results verify the effectiveness of our proposed framework, demonstrating that the combination of virtual and real data has better performance for training the vehicle detection model than the only use of real data.

show abstract

“…With the increasing demand for large-scale training sets for deep learning models [12][13][14][15][16][17][18][19][20][21][22], some researchers propose self-supervised learning methods [23][24][25][26], which train CNNs by automatic generation of labels based on the structure or characteristics of the image itself. We observe that each attribute of the object corresponds to a particular part of the object region.…”

Section: Introductionmentioning

confidence: 99%

SAL‐Net: Self‐Supervised Attribute Learning for Object Recognition and Segmentation

Yang

Wang

Arif

et al. 2021

Wireless Communications and Mobile Computing

View full text Add to dashboard Cite

Existing attribute learning methods rely on predefined attributes, which require manual annotations. Due to the limitation of human experience, the predefined attributes are not capable enough of providing enough description. This paper proposes a self-supervised attribute learning (SAL) method, which automatically generates attribute descriptions by differentially occluding the object region to deal with the above problems. The relationship between attributes is formulated with triplet loss functions and is utilized to supervise the CNN. Attribute learning is used as an auxiliary task of a multitask image classification and segmentation network, in which self-supervision of attributes motivates the CNN to learn more discriminative features for the main semantic tasks. Experimental results on public benchmarks CUB-2011 and Pascal VOC show that the proposed SAL-Net can obtain more accurate classification and segmentation results without additional annotations. Moreover, the SAL-Net is embedded into a multiobject recognition and segmentation system, which realizes instance-aware semantic segmentation with the help of a region proposal algorithm and a fusion nonmaximum suppression algorithm.

show abstract

An End-to-End Deep Learning Network for 3D Object Detection From RGB-D Data Based on Hough Voting

Cited by 25 publications

References 34 publications

Hybrid features for object detection in RGB-D scenes

Hybrid features for object detection in RGB-D scenes

A Novel Vehicle Detection Framework Based on Parallel Vision

SAL‐Net: Self‐Supervised Attribute Learning for Object Recognition and Segmentation

Contact Info

Product

Resources

About