A major limitation of intact protein
fragmentation is the lack
of sequence coverage within proteins’ interiors. We show that
collisionally activated dissociation (CAD) produces extensive internal
fragmentation within proteins’ interiors that fill the existing
gaps in sequence coverage, including disulfide loop regions that cannot
be characterized using terminal fragments. A barrier to the adoption
of internal fragments is the lack of methods for their generation
and assignment. To provide these, we explore the effects of protein
size, mass accuracy, internal fragment size, CAD activation energy,
and data preprocessing upon the production and identification of internal
fragments. We also identify and mitigate the major source of ambiguity
in internal fragment identification, which we term “frameshift
ambiguity.” Such ambiguity results from sequences containing
any “middle” portion surrounded by the same composition
on both termini, which upon fragmentation can produce two internal
fragments of identical mass, yet out of frame by one or more amino
acids (e.g., TRAIT producing TRAI or RAIT). We show that such instances
permit the a priori assignment of the middle sequence portion. This
insight and our optimized methods permit the unambiguous assignment
of greater than 97% of internal fragments using only the accurate
mass. We show that any remaining ambiguity in internal fragment assignment
can be removed by consideration of fragmentation propensities or by
(pseudo)-MS3. Applying these methods resulted in a 10-fold
and 43-fold expanded number of identified ions, and a concomitant
7- and 16-fold increase in fragmentation sites, respectively, for
native and reduced forms of a disease-associated SOD1 variant.