“…With the release of PDTB 2.0 (Prasad et al, 2008), lots of work has been done for discourse relation identification on natural (i.e., genuine) discourse data (Pitler et al, 2009;Lin et al, 2009;Wang et al, 2010;Zhou et al, 2010;Braud and Denis, 2015;Fisher and Simmons, 2015) with the use of traditional NLP linguistically informed features and machine learning algorithms. Recently, more and more researchers resorted to neural networks for implicit discourse recognition (Zhang et al, 2015;Qin et al, 2016a;Liu and Li, 2016;Braud and Denis, 2016;Wu et al, 2016).…”