“…Even though the approach of learning the splicing code from reference sequence was successful, the model may suffer from evolutionary confounding and fail to learn causal features. To address this issue, large-scale perturbation assays, such as massively parallel reporter assay (MPRA) and saturation mutagenesis screens, have been developed (Barash et al, 2010;Xiong et al, 2015;Rosenberg, Patwardhan, Shendure, & Seelig, 2015;Adamson, Zhan, & Graveley, 2018;Ke et al, 2018). In particular, Rosenberg, Patwardhan, Shendure, and Seelig (2015) probed millions of exonic and intronic random sequences to test their impact on splicing.…”