“…[32,33,34,35]. Attention mechanisms seek the clues of possible candidates from intermediate representations [21,26,27,28,29,32,36,37,38,39]. Uncertainty estimation, on the other hand, implicitly captures the noises and enables the models to respond to the measured stochasticity inherent in the observations [11,22,40,41,42,43,44,45].…”