“…The proposed architectures can be considered a further step towards more flexible neural network models for learning robust visual representations on the basis of visual experience. Successful applications of deep neural network self-organization include human action recognition (Parisi, Weber & Wermter 2014, Elfaramawy et al 2017, gesture recognition (Parisi, Barros & Wermter 2014, Parisi, Jirak & Wermter 2014, body motion assessment (Parisi, von Stosch, Magg & Wermter 2015, Parisi, Magg & Wermter 2016, humanobject interaction (Mici et al 2017(Mici et al , 2018, continual learning (Parisi et al 2017, Parisi, Tani, Weber & Wermter 2018, and audio-visual integration (Parisi, Tani, Weber & Wermter 2016). Models of hierarchical action learning are typically feedforward.…”