2021
DOI: 10.1266/ggs.21-00025
|View full text |Cite
|
Sign up to set email alerts
|

AI for the collective analysis of a massive number of genome sequences: various examples from the small genome of pandemic SARS-CoV-2 to the human genome

Abstract: In genetics and related fields, huge amounts of data, such as genome sequences, are accumulating, and the use of artificial intelligence (AI) suitable for big data analysis has become increasingly important. Unsupervised AI that can reveal novel knowledge from big data without prior knowledge or particular models is highly desirable for analyses of genome sequences, particularly for obtaining unexpected insights. We have developed a batch-learning self-organizing map (BLSOM) for oligonucleotide compositions th… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

1
14
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
4
1

Relationship

3
2

Authors

Journals

citations
Cited by 5 publications
(15 citation statements)
references
References 50 publications
1
14
0
Order By: Relevance
“…It should be noted that when nodes that have sequences with distinctly different frequencies are located nearby, empty nodes, to which no sequences are attributed after the learning, often appear between them, and these nodes are left blank (no color) on the BLSOM [8,9]. As in our previous BLSOM analysis of SARS-CoV-2 strains [9,23,24], which were mainly isolated within the first year of the pandemic, good clustering by lineage was obtained, and Omicron strains formed their own territory (red) on the right side of the map (Fig 2A).…”
Section: Resultsmentioning
confidence: 99%
See 4 more Smart Citations
“…It should be noted that when nodes that have sequences with distinctly different frequencies are located nearby, empty nodes, to which no sequences are attributed after the learning, often appear between them, and these nodes are left blank (no color) on the BLSOM [8,9]. As in our previous BLSOM analysis of SARS-CoV-2 strains [9,23,24], which were mainly isolated within the first year of the pandemic, good clustering by lineage was obtained, and Omicron strains formed their own territory (red) on the right side of the map (Fig 2A).…”
Section: Resultsmentioning
confidence: 99%
“…After the machine learning, to determine whether the clustering (self-organization) achieved by the BLSOM corresponded to the known lineages, nodes containing genomes of a single lineage were colored to indicate each lineage, and those containing genomes of multiple lineages were displayed in black. It should be noted that when nodes that have sequences with distinctly different frequencies are located nearby, empty nodes, to which no sequences are attributed after the learning, often appear between them, and these nodes are left blank (no color) on the BLSOM [8,9]. As in our previous BLSOM analysis of SARS-CoV-2 strains [9,23,24], which were mainly isolated within the first year of the pandemic, good clustering by lineage was obtained, and Omicron strains formed their own territory (red) on the right side of the map (Fig 2A).…”
Section: Resultsmentioning
confidence: 99%
See 3 more Smart Citations