SEX DIFFERENCES IN SAT ® PREDICTIONS OF COLLEGE GRADES

Recent studies have found substantial reductions in gender differences in the prediction of academic achievement in college when variations in grading standards among courses were taken into account. The purpose of this project was to examine gender differences in the prediction of freshman grades after controlling for differential course grading based on college majors. This method involved deriving a variable that measured grading leniency using residual scores from the within‐gender regressions of freshman grades on high school grades and scores on the SAT for the non‐Latino white group. The procedure worked quite well and generalized to other groups not involved in the derivation of the grading‐leniency scale. Nevertheless, there were modest, sometimes statistically significant, gender differences in prediction that remained after this control variable was introduced into the regressions. The largest and smallest differences for females between actual grades and grades predicted from the males' regressions tended to be found in the African American and Asian American groups, respectively. The results imply that the use of information on college majors is a reasonable, practical procedure for controlling for grading leniency.

Section: Comparisons Across Regression Modelssupporting

confidence: 89%

Section: Discussionmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

See 3 more Smart Citations

College Major and Gender Differences in the Prediction of College Grades

Pennock-Román¹

1994

Adjusting College Grade‐point Average for Variations in Grading Standards

Stricker¹,

Rock²,

Burton³

et al. 1992

Self Cite

This study compared the effectiveness of several existing and proposed methods for statistically adjusting college GPAs for course and departmental differences in grading standards, using first-semester grades from an entire entering class at a large state university. Most of the adjusted GPAs produced by these methods functioned similarly and, despite high correlations with actual GPA, had greater internal-consistency reliability than actual GPA and were more predictable from SAT scores and high school rank (HSR). Most of the adjusted GPAs also functioned similarly with regard to sex differences in over-underprediction. The adjusted GPAs and actual GPA exhibited the same small but significant sex differences in over-underprediction by SAT scores, but the adjusted GPAs displayed smaller differences than actual GPA in overunderprediction by SAT scores and HSR. Adjusting College Grade-Point Average for Variations in Grading StandardsCollege grade-point average (GPA) , though originally intended for administrative purposes (Smallwood, 1935), is widely employed in educational and psychological research, particularly as a criterion for validating admissions measures (e.g., see the reviews by Breland, 1981;Fishman & Pasanella, 1960;Lavin, 1965).Despite the popularity of GPA, it is generally recognized that this is a fallible index of academic performance (e.g., see the reviews by Milton, Pollio, & Eison, 1986;Warren, 1971;Willingham, 1990). A major problem is that GPA is based on a different set of courses for each student, and the grading standards are not uniform from course to course, a phenomenon that has been observed for many years (e.g., Meyer, 1908). Hence, GPA is not comparable for students who take courses with severe grading standards and students who take courses with lenient standards, and its reliability and validity are attenuated.Differences in grading standards have been rigorously documented among departments (Anderhalter, 1962;de Nevers, 1984;Elliott & Strenta, 1988;Frisbee, 1984: Gamson, 1967Goldman & Hewitt, 1975;Goldman, Schmidt, Hewitt, & Fisher, 1974;Goldman & Widawski, 1976;Juola, 1968;Prather & Smith, 1976; Prather, Smith, & Kodras, 1979; Ramist, Lewis, & McCamley, 1990;Sabot & Wakeman-Linn, 1991;Strenta & Elliott, 1987; Willingham, 1985), as well as within departments (Garrison, 1979;Juola, 1968).The consequences of variations in grading standards on the reliability and validity of GPA are suggested by studies that attempted to adjust GPA for differences in these standards. The adjustments increased the median correlation between yearly GPAs from ,67 to .72 (Elliott & Strenta, 1988).The adjustments also generally boosted the correlations of admissions measures with GPA: the multiple correlation of the Scholastic Aptitude Test (SAT; -2-Donlon, 1984) scores and high school GPA with four-year GPA increased from .58 to .64 (Young, 1990b), and the correlations of the total SAT score (combining the Verbal [V] and Mathematical [M] scores) with four-year GPA went from .43 to .50 (Strenta...

Student Group Differences in Predicting College Grades: Sex, Language, and Ethnic Groups

Ramist¹,

Lewis²,

McCamley-Jenkins³

1994

101

139

Part 1 of this study investigated possible causes of the observed decline in correlations between SAT scores and freshman grade‐point average (FGPA). The results were described in Chapter 12, “Implications of Using Freshman GPA as the Criterion for the Predictive Validity of the SAT,” and were the basis for much of Chapters 2 and 3 of the monograph Predicting College Grades: An Analysis of Institutional Trends Over Two Decades (Willingham, Lewis, Morgan, and Ramist 1990). Working with a data base of 38 colleges, the study found that the comparability of course grades received by entering freshmen declined in the 1980s. Three new measures of grade comparability—variety of courses taken, variation in average student aptitude among courses, and appropriateness of average course grade in relation to student aptitude level—proved to be excellent indicators of both the level of and the change in SAT validity for predicting FGPA among the 38 colleges. Using course grade as the criterion instead of FGPA reduced the decline in both SAT and high school GPA (HSGPA) validity for predicting course grades by 40 percent. Contrary to the assumption that high school record (HSR) is a better predictor than the SAT, compared with HSR the SAT had higher or equal average validities for predicting course grade in almost all categories of courses. (Each course was placed into one of 37 categories based on subject, skills required, and level.) Part 2 of this project examines course selection, grading patterns, grade comparability, SAT predictive effectiveness, and average over‐ and underpredictions in each type of course for groups defined by an academic composite index, sex, English as best or not best language, and ethnic group. SAT predictive effectiveness is determined with and without HSR on the basis of correlations that are corrected for restriction of range. Over‐ and underpredictions are determined by residuals from predictions. All results are analyzed by college selectivity level and size. On average, males took more rigorously graded courses and females obtained a higher FGPA: two‐thirds of the .09 difference by sex in FGPA related to course selection. Predictions of course grades based on the SAT were better for females, on average, than for males, and the SAT added more incremental information over HSR for females. Underprediction of FGPA for females, using the SAT and HSR, averaged .06. Underprediction of course grade for females, using the SAT and HSR, averaged .03, but was reduced to .02 using the Test of Standard Written English (TSWE) as an additional predictor, and was eliminated entirely at more selective colleges. Although on average the SAT predicted FGPA and course grades better for students whose best language was English, it added more incremental information over HSR for students whose best language was not English. Asian American students took, on average, very strictly graded courses, but obtained a high average FGPA. The SAT predicted FGPA and course grades better for them than for any other ethnic...