“…Notably, evidence supports its ability to discriminate amongst staff surgeons of differing case volume [16], as well as across a single surgeon's learning curve [13]. The vast majority of literature using the GEARS score has found it to be a reliable assessment method [4,11,16,24,25,27,28]. However, a study of robotic renal hilar dissection using oriented expert raters showing poor internal consistency [17], and Hung et al [21] found that while trainee self-assessments and faculty evaluations correlated weakly, inter-faculty reliability was better when assessing residents [intraclass correlation (ICC) = 0.77] and fellows (ICC = 0.45).…”