“…The Swiss-Prot dataset was composed of 550,116 proteins divided into four kingdoms: 19,370 protein sequences from Archaea, 332,327 from Bacteria, 181,814 from Eukaryota, and 16,605 from Viruses [ 6 ]. However, to maintain a fair comparison with the previous results [ 2 , 8 , 9 ], in the presented 2D plots and parallel coordinates representations, the used number of proteins was reduced to 18,999 in Archaea, 326,945 in Bacteria, 176,646 in Eukaryota, and 16,316 in Viruses.…”