“…Mutational signatures are discovered by analyzing ensemble point-mutation counts from a sample of individuals. Each individual record lists the count of mutations detected in the cancer sample, classified with respect to the trinucleotide contexts: the observed nucleotide substitution (C>A, C>G, C>T, T>A, T>C, or T>G) and the neighboring 5′ and 3′ nucleotides, for a total of 96 trinucleotide mutation types ( Lal et al., 2021 ). The counts are naturally summarized in a rectangular data-frame (an R dataframe, in our case) whose row records refer to the individual cancer samples and whose columns correspond to the 96 types.…”