“…31,39,40 Recently, a variational theorem for evaluating MSMs has been introduced, which enables the modeler to select the best MSM for a system based on its distance from a theoretical upper limit. 41,42 In this work, we reanalyze twelve ultra-long protein folding MD datasets. 5,18,34,43 For each system, we create MSMs with many protocol choices and utilize the variational theorem introduced by Noé and Nüske 41 as a metric for crossvalidation 44 to determine how different modeling choices affect the quality of the MSM as defined by its ability to detect and represent the systems' long-time scale dynamical processes.…”