2018
DOI: 10.3390/jimaging4100116
|View full text |Cite
|
Sign up to set email alerts
|

ECRU: An Encoder-Decoder Based Convolution Neural Network (CNN) for Road-Scene Understanding

Abstract: This research presents the idea of a novel fully-Convolutional Neural Network (CNN)-based model for probabilistic pixel-wise segmentation, titled Encoder-decoder-based CNN for Road-Scene Understanding (ECRU). Lately, scene understanding has become an evolving research area, and semantic segmentation is the most recent method for visual recognition. Among vision-based smart systems, the driving assistance system turns out to be a much preferred research topic. The proposed model is an encoder-decoder that perfo… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
7
0
1

Year Published

2019
2019
2024
2024

Publication Types

Select...
8
2

Relationship

0
10

Authors

Journals

citations
Cited by 19 publications
(10 citation statements)
references
References 50 publications
0
7
0
1
Order By: Relevance
“…5) está establecido de acuerdo con el número de filtros (8, 16, 32 y 64) aplicados por capa; así el encoder se compone por 5 capas convolucionales, con filtros de 3x3 que crean mapas de características que capturan patrones en la imagen, en la que el volumen de salida está determinado según el número de filtros. A cada capa le sigue una unidad lineal rectificada (ReLU) que anula los valores negativos y deja pasar a los positivos a la siguiente capa tal como entran (Yasrab, 2018). La operación de MaxPooling de 2x2 realiza un muestreo descendente (downsampling), en donde, en cada contracción las dimensiones son reducidas a la mitad.…”
Section: Creación Del Modelounclassified
“…5) está establecido de acuerdo con el número de filtros (8, 16, 32 y 64) aplicados por capa; así el encoder se compone por 5 capas convolucionales, con filtros de 3x3 que crean mapas de características que capturan patrones en la imagen, en la que el volumen de salida está determinado según el número de filtros. A cada capa le sigue una unidad lineal rectificada (ReLU) que anula los valores negativos y deja pasar a los positivos a la siguiente capa tal como entran (Yasrab, 2018). La operación de MaxPooling de 2x2 realiza un muestreo descendente (downsampling), en donde, en cada contracción las dimensiones son reducidas a la mitad.…”
Section: Creación Del Modelounclassified
“…To achieve this task, the classifiers existing in the literature, such as SoftMax, SVM with various kernels (Linear, RBF, and Cubic) [3,12,33,[35][36][37], DA (Linear, and Quadratic) [12,33] and KNN (Fine, and Cubic) [12,33] are employed to accomplish the task. The earlier research also presents the similar medical image assessment tasks which implemented classifiers [38][39][40][41].…”
Section: Selected Featurementioning
confidence: 99%
“…In the conventional SegNet, a rectified linear unit (ReLU) activation function is used in the convolution block; however, we replaced ReLU with an exponential linear unit (ELU) [19]. Many experiments have been performed with different activation functions, and ELU has been found to exhibit the best performance in this architecture [20].…”
Section: Encoder-decoder Architectures For Prostate Segmentationmentioning
confidence: 99%