“…However, some of them have methodological flaws, which would threaten the trustworthiness of their results (Gorard, 2013). Specifically, no standardised test is used in the judgement of CT (e.g., Fakunle et al, 2016;Guo & O'Sullivan, 2012;Li, 2013). While some researchers may show concern about the format of multiple-choice questions that involves a chance of guessing (Snyder, Edwards, & Sanders, 2019), the pre-specified evaluation criteria and the validation of testing items allow for a high level of objectivism.…”