Residual Networks Behave Like Boosting Algorithms

Siu, Chapman

doi:10.1109/dsaa.2019.00017

Cited by 7 publications

(5 citation statements)

References 14 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Literature shows that the boosting concept is the backbone behind well-known architectures like Deep Residual networks (He et al, 2016;Siu, 2019), AdaNet (Cortes et al, 2017) . The theoretical background for the success of the Deep Residual networks (DeepResNet) (He et al, 2016) was explained in the context of boosting theory (Huang et al, 2018).…”

Section: Boostingmentioning

confidence: 99%

Ensemble deep learning: A review

Ganaie

Tanveer

et al. 2022

Engineering Applications of Artificial Intelligence

795

233

View full text Add to dashboard Cite

Section: Boostingmentioning

confidence: 99%

Ensemble deep learning: A review

Ganaie

Tanveer

et al. 2022

Engineering Applications of Artificial Intelligence

795

233

View full text Add to dashboard Cite

“…As shown in Figure 4, the residual connection is a simple shortcut connection structure that connects the inputs and outputs of GRU layer. Existing studies show that residual networks behave like boosting algorithms, where the main idea is to combines different weaker learners into a stronger learner 30 . We argue that the residual connection in GRRLN would help to boost the statement feature extraction ability of the basic GRU networks.…”

Section: Methodology Overviewmentioning

confidence: 87%

“…Existing studies show that residual networks behave like boosting algorithms, where the main idea is to combines different weaker learners into a stronger learner. 30 We argue that the residual connection in GRRLN would help to boost the statement feature extraction ability of the basic GRU networks. Intuitively, the residual connection will retain most features in the data, thus forcing the GRU model to focus more on the different features of source code.…”

Section: Gru Layer With Residual Connectionmentioning

confidence: 96%

GRRLN: Gated Recurrent Residual Learning Networks for code clone detection

Zhang,

Liu,

Shi

2024

J Software Evolu Process

View full text Add to dashboard Cite

Code clone detection is a critical problem in software development and maintenance domains. It aims to identify functionally identical or similar code fragments within an application. Existing works formulate the code clone detection task as a binary classification problem which predicts a code pair as a clone or not based on a pre‐defined threshold. In reality, there are various types of code clone subject to the degree of how a pair of code fragments are similar to each other. To investigate the effect of different code clone detection manners on the clone detection result, we propose Gated Recurrent Residual Learning Networks (GRRLN), a novel neural network model for code clone detection. To train GRRLN, we first represent each code fragment as a statement‐level tree sequence derived from the whole abstract syntax tree (AST). Then, a gated recurrent neural network with residual connections is adopted to fully extract the semantics of all individual statement trees together with their dependency relationships across the input statement sequence. Finally, the output representations of code fragments by GRRLN are used for similarity calculation and clone detection. We evaluate GRRLN using two real‐world datasets for code clone detection and clone type classification. Experiments show that GRRLN achieves promising and compelling results and meanwhile needs significantly less time and memory consumption compared with the state‐of‐the‐art methods.

show abstract

“…To make BIER more robust, Hierarchical Boosted deep metric learning [69] Literature shows that the boosting concept is the backbone behind well-known architectures like Deep Residual networks [13,72], AdaNet [73] . The theoretical background for the success of the Deep Residual networks (DeepResNet) [13] was explained in the context of boosting theory [74].…”

Section: Boostingmentioning

confidence: 99%

Ensemble deep learning: A review

Ganaie,

Hu,

Malik

et al. 2021

Preprint

View full text Add to dashboard Cite

Ensemble learning combines several individual models to obtain better generalization performance. Currently, deep learning models with multilayer processing architecture is showing better performance as compared to the shallow or traditional classification models. Deep ensemble learning models combine the advantages of both the deep learning models as well as the ensemble learning such that the final model has better generalization performance. This paper reviews the state-of-art deep ensemble models and hence serves as an extensive summary for the researchers.The ensemble models are broadly categorised into ensemble models like bagging, boosting and stacking, negative correlation based deep ensemble models, explicit/implicit ensembles, homogeneous/heterogeneous ensemble, decision fusion strategies, unsupervised, semi-supervised, reinforcement learning and online/incremental, multilabel based deep ensemble models. Application of deep ensemble models in different domains is also briefly discussed. Finally, we conclude this paper with some future recommendations and research directions.

show abstract

Residual Networks Behave Like Boosting Algorithms

Cited by 7 publications

References 14 publications

Ensemble deep learning: A review

Ensemble deep learning: A review

GRRLN: Gated Recurrent Residual Learning Networks for code clone detection

Ensemble deep learning: A review

Contact Info

Product

Resources

About