“…The usefulness of deep networks has been tested and proven in many NLP tasks, such as machine translation (Young et al, 2018 ), sentiment analysis (Zhang et al, 2018a ), text classification (Conneau et al, 2017 ; Zhang et al, 2018b ), relations extraction (Huang and Wang, 2017 ), as well as in AM (Cocarascu and Toni, 2017 , 2018 ; Daxenberger et al, 2017 ; Galassi et al, 2018 ; Lauscher et al, 2018 ; Lugini and Litman, 2018 ; Schulz et al, 2018 ). While a straightforward approach to exploit domain knowledge in AM is to apply a set of hand-crafted rules on the output of some first stage classifier (such as a neural network), NeSy or SRL approaches can directly enforce (hard or soft) constraints during training , so that a solution that does not satisfy them is penalized, or even ruled out.…”