The analysis and profiling of short tandem repeat (STR) loci is routinely used in forensic genetics. Current methods to investigate STR loci, including PCR-based standard fragment analyses and capillary electrophoresis, only provide amplicon lengths that are used to estimate the number of STR repeat units. These methods do not allow for the full resolution of STR base composition that sequencing approaches could provide. Here we present an STR profiling method based on the use of the Roche Genome Sequencer (GS) FLX to simultaneously sequence multiple core STR loci. Using this method in combination with a bioinformatic tool designed specifically to analyze sequence lengths and frequencies, we found that GS FLX STR sequence data are comparable to conventional capillary electrophoresis-based STR typing. Furthermore, we found DNA base substitutions and repeat sequence variations that would not have been identified using conventional STR typing.
Second generation sequencing (SGS) may revolutionize the field of forensic STR typing. Two of the essential requirements for implementation of an SGS based approach for forensic investigations are (1) establishment of adequate frequency databases and (2) adoption of a new STR nomenclature. We report the STR sequences and allele frequencies of three STR loci: D3S1358, D12S391 and D21S11 in 197 unrelated Danes. We used a new STR nomenclature that depicts the locus name used in forensic genetics, the length of the repeat region divided by the repeat length (typically 4 nucleotides) and detailed sequence information of possible sub-repeats and SNPs within the amplified fragment.
We sequenced the D21S11 locus in 77 individuals from Danish paternity cases using 454 FLX next generation sequencing (NGS) technology. All samples were also typed with the AmpFlSTR Profiler Plus or the AmpFlSTR Identifiler PCR Amplification kits as part of paternity investigations. In 18 of the confirmed trios, a genetic inconsistency was observed between one of the parents and the child at the D21S11 locus. NGS of the D21S11 locus revealed which allele had mutated from which parent to the child in 13 of these trios. All characterized mutations could be explained by single-step mutations in the longest sub-repeat of D21S11. A total of 53 of the 77 sequenced samples originated from unrelated individuals. Twenty different D21S11 alleles were detected by NGS in these individuals whereas only 13 different alleles were observed with fragment analysis. Several alleles had the same lengths but different sequences, e.g. four and three different alleles were detected by NGS with lengths determined by CE corresponding to allele 30 and allele 31, respectively.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.