We report the use of neutron-encoded (NeuCode) stable isotope labeling of amino acids in cell culture for the purpose of C-terminal product ion annotation. Two NeuCode labeling isotopologues of lysine, 13 C 6 15 N 2 and 2 H 8 , which differ by 36 mDa, were metabolically embedded in a sample proteome, and the resultant labeled proteins were combined, digested, and analyzed via liquid chromatography and mass spectrometry. With MS/MS scan resolving powers of ϳ50,000 or higher, product ions containing the C terminus (i.e. lysine) appear as a doublet spaced by exactly 36 mDa, whereas N-terminal fragments exist as a single m/z peak. Through theory and experiment, we demonstrate that over 90% of all y-type product ions have detectable doublets. We report on an algorithm that can extract these neutron signatures with high sensitivity and specificity. In other words, of 15,503 y-type product ion peaks, the y-type ion identification algorithm correctly identified 14,552 (93.2%) based on detection of the NeuCode doublet; 6.8% were misclassified (i.e. other ion types that were assigned as y-type products). Searching NeuCode labeled yeast with PepNovo ؉ resulted in a 34% increase in correct de novo identifications relative to searching through MS/MS only. We use this tool to simplify spectra prior to database searching, to sort unmatched tandem mass spectra for spectral richness, for correlation of co-fragmented ions to their parent precursor, and for de novo sequence identification. The ability to make de novo sequence identifications directly from tandem mass spectra has long been a holy grail of the proteomic community. Such a capability would wean the field from its reliance upon sequenced genome databases. Even for organisms with fully annotated genomes, events such as single nucleotide polymorphisms, alternative splicing, gene fusion, and a host of other genomic transformations can result in altered proteomes. These alterations can vary from cell to cell and individual to individual. Thus, one could argue that the most valuable proteomic information, the individual and cellular proteome variation from the genome, remains elusive (1). This problem has received considerable attention; that said, it is not easy to de novo correlate spectrum to sequence in a large-scale, automated fashion (2-6). Improvements in mass accuracy have helped, but routine, reliable de novo sequencing without database assistance is not standard (7-10).A primary means to facilitate de novo spectral interpretation is the simple annotation of m/z peaks in tandem mass spectra as either N-or C-terminal. We and others have investigated this seemingly simple first step. Real-world spectra, however, are complex. Difficulties often arise in determining the charge state of the fragment or in differentiating between fragment ions and peaks arising from neutral loss, internal fragmentation, or spectral noise, both electronic and chemical. Several strategies have focused on product ion annotation. These approaches have included manipulation of the N-terminus ...