We previously showed that an ambigraphic nucleic acid notation, based on symmetrical lowercase Roman characters, permits users to complement DNA by physically rotating the sequence text 180 degrees . This article describes an enhanced ambigraphic notation, which uses concept-related symbol design, rather than the arbitrary set of symbols that constitute the Roman alphabet, to logically encode the four DNA bases and 11 ambiguity characters. As ambigrams, the symbols continue to permit the rapid derivation of complementary sequences and visualization of palindromic DNA. In addition, the new AmbiScript notation uses legibility principles to support the identification of sequence polymorphism and improves writing efficiency by requiring fewer strokes per character than the International Union of Pure and Applied Chemistry (IUPAC) notation.
BackgroundAmbiscript is a graphically-designed nucleic acid notation that uses symbol symmetries to support sequence complementation, highlight biologically-relevant palindromes, and facilitate the analysis of consensus sequences. Although the original Ambiscript notation was designed to easily represent consensus sequences for multiple sequence alignments, the notation’s black-on-white ambiguity characters are unable to reflect the statistical distribution of nucleotides found at each position. We now propose a color-augmented ambigraphic notation to encode the frequency of positional polymorphisms in these consensus sequences.ResultsWe have implemented this color-coding approach by creating an Adobe Flash® application (
http://www.ambiscript.org) that shades and colors modified Ambiscript characters according to the prevalence of the encoded nucleotide at each position in the alignment. The resulting graphic helps viewers perceive biologically-relevant patterns in multiple sequence alignments by uniquely combining color, shading, and character symmetries to highlight palindromes and inverted repeats in conserved DNA motifs.ConclusionJuxtaposing an intuitive color scheme over the deliberate character symmetries of an ambigraphic nucleic acid notation yields a highly-functional nucleic acid notation that maximizes information content and successfully embodies key principles of graphic excellence put forth by the statistician and graphic design theorist, Edward Tufte.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.