“…Existing findings showed that the use of different weighting of case test items or options, depending on their importance and appropriateness to the assessed task, has little impact on the resulting scores (Stillman et al, 1986b). In addition, empirical or expert-derived scoring, such as aggregate scoring, where the score of an item is proportional to the degree of agreement between experts (Norman, 1985), is found to correlate with other type of scoring and neither affects nor improves the validity of test scores (Webster, Shea, Norcini, Grosso, & Swanson, 1988). Finally, preliminary results showed that a scoring that takes into account both the examinees' performance outcomes and their underlying reasoning seems to be more discriminating than one that relies solely on the outcomes (Vu, Lee, & Steward, 1990).…”