“…On the other hand, to better capture the global coherence, pointer network (Vinyals et al, 2015) has been gradually used for the decoder of the ordering model. It is able to capture the paragraphlevel contextual information for generating an ordered sequence with the highest coherence probability (Gong et al, 2016;Logeswaran et al, 2018;Cui et al, 2018;Yin et al, 2019). Further, HAN (Wang and Wan, 2019) and TGCM (Oh et al, 2019) introduce the attention mechanism (Vaswani et al, 2017), and FUDecoder (Yin et al, 2020) proposes pairwise ordering prediction modules to enhance the traditional pointer network.…”