“…Some of these scales, such as grammar and confidence, were common across all three tasks and some were task-specific, such as temporal shift in the narration and melodizing the script in the read-aloud. I based these specific scales on those usually employed by researchers when assessing L2 oral proficiency (e.g., ACTFL, 1982;Albrechtsen, Henriksen, & Faerch, 1980;Bachman, 1990;Brown, Anderson, Shillcock, &Yule, 1984;Canale, 1983;Canale & Swain, 1980;Fayer & Krasinski, 1987;Harlow & Caminero, 1990;Underhill, 1987). I presented the rating instrument scales to several Arabic language and content experts, and t o naive individuals, who were asked to comment on the scales' comprehensibility t o Arab raters and their adequacy t o assess students' performance on the three tasks.…”