“…In bottom-up proteomics, the mass spectra of (often tryptic) peptides are matched against their in silico digested counterparts generated from a database. Under a broader proteogenomic framework, various computational strategies have been developed to integrate proteomic data with (canonical and non-canonical) genomic annotation pipelines or to generate standalone in silico translation databases for discovery of novel proteins ( Risk et al, 2013 ; Jagtap et al, 2014 ; Mackowiak et al, 2015 ; Nagaraj et al, 2015 ; Zickmann and Renard, 2015 ; Kolmogorov et al, 2016 ; Olexiouk et al, 2016 ; Brunet et al, 2018 ; Guillot et al, 2019 ). At the MS-based experimental front, various fractionation and small protein enrichment methods have been employed to successfully identify novel non-canonical proteins in eukaryotic cell lines and tissues ( Ma et al, 2016a ; Li et al, 2017 ; He et al, 2018 ; Cao et al, 2020 ; Cardon et al, 2020 ; Kaulich et al, 2020 ; Cassidy et al, 2021 ; Wang et al, 2021 ).…”