“…To compare metabolic profiles of MAGs recovered, glycosidase profiles were expressed as binary data, including presence (value = 1) and absence (value = 0) of glycosidase domains from the same CAZy family in each MAG. According to different criteria, these data were used as input for different machine learning models, powerful pattern-recognition algorithms that allow accurate classification of samples from their biological origin [ 26 , 27 ]. Our machine-learning data analysis strategy involved three main steps: (i) unsupervised distribution of glycosidase profiles of MAGs according to both Bifidobacterium species ( B. adolescentis , B. bifidum , B. breve , B. catenulatum , B. dentium , B. longum , B. pseudocatenulatum , B. scardovii ) and diet type (breastfed, breastfed + GOS, breastfed + GOS + FOS, infant formula-fed, infant formula-fed + FOS, infant formula-fed + GOS, infant formula-fed + GOS + FOS and whole-milk-fed infants as well as fiber-rich diets in adults), (ii) supervised classification of MAGs to accurately elucidate characteristic glycosidic patterns of the main bifidobacteria identified ( B. adolescentis , B. bifidum , B. breve , B. longum , B. pseudocatenulatum ), (iii) association study between glycosidase families found in MAGs.…”