“…MIL frameworks rely on specific strategies for tile selection, feature extraction, and multiple inference aggregation. Common MIL paradigms rely on one training "bag" per image after careful tile selection (e.g., tissue content above a threshold) (12)(13)(14) ; data augmentations (12)(13)(14)(15)(16)(17) ; use of transfer learning from ImageNet via GoogLeNet, InceptionNet, ResNet, and MobileNet (18,19) for cancer, and AlexNet and ResNet for NAFLD (20,21) ; aggregation of tile-level inferences via max-pooling, or of tile-level features via average-pooling, attention-based or RNN based frameworks ahead of WSI-level prediction (12,13,16,21,22) .…”