Does interrater (dis)agreement on Psychopathy Checklist scores in sexually violent predator trials suggest partisan allegiance in forensic evaluations?
Abstract:Many studies reveal strong interrater agreement for Hare's Psychopathy Checklist-Revised (PCL-R) when used by trained raters in research contexts. However, no systematic research has examined agreement between PCL-R scores from independent clinicians who are retained by opposing sides in adversarial legal proceedings. We reviewed all 43 sexual-offender civil-commitment trials in one state and identified 23 cases in which opposing evaluators reported PCL-R total scores for the same individual. Differences betwe… Show more
“…(e.g., DeMatteo, et al, 2014;Murrie et al, 2008;Murrie, Boccaccini, Turner, Meeks, Woods, & Tussey, 2009) and in Canada (Lloyd, Clark, & Forth, 2010). Beyond these studies, surveys of judges and attorneys consistently reveal their concern about bias among experts as well (e.g., Krafka, Dunn, Johnson, Cecil, & Miletich, 2002;Shuman, Whitaker, & Champagne, 1994).…”
A qualitative study with 20 board-certified forensic psychologists was followed up by a mail survey of 351 forensic psychologists in this mixed-methods investigation of examiner bias awareness and strategies used to debias forensic judgments. Rich qualitative data emerged about awareness of bias, specific biasing situations that recur in forensic evaluations, and potential debiasing strategies. The continuum of bias awareness in forensic evaluators mapped cogently onto the "stages of change" model. Evaluators perceived themselves as less vulnerable to bias than their colleagues, consistent with the phenomenon called the "bias blind spot." Recurring situations that posed challenges for forensic clinicians included disliking or feeling sympathy for the defendant, disgust or anger toward the offense, limited cultural competency, preexisting values, colleagues' influences, and protecting referral streams. Twenty-five debiasing strategies emerged in the qualitative study, all but one of which rated as highly useful in the quantitative survey. Some of those strategies are consistent with empirical evidence about their effectiveness, but others have been shown to be ineffective. We identified which strategies do not help, focused on promising strategies with empirical support, discussed additional promising strategies not mentioned by participants, and described new strategies generated by these participants that have not yet been subjected to empirical examination.Finally, debiasing strategies were considered with respect to future directions for research and forensic practice.
“…(e.g., DeMatteo, et al, 2014;Murrie et al, 2008;Murrie, Boccaccini, Turner, Meeks, Woods, & Tussey, 2009) and in Canada (Lloyd, Clark, & Forth, 2010). Beyond these studies, surveys of judges and attorneys consistently reveal their concern about bias among experts as well (e.g., Krafka, Dunn, Johnson, Cecil, & Miletich, 2002;Shuman, Whitaker, & Champagne, 1994).…”
A qualitative study with 20 board-certified forensic psychologists was followed up by a mail survey of 351 forensic psychologists in this mixed-methods investigation of examiner bias awareness and strategies used to debias forensic judgments. Rich qualitative data emerged about awareness of bias, specific biasing situations that recur in forensic evaluations, and potential debiasing strategies. The continuum of bias awareness in forensic evaluators mapped cogently onto the "stages of change" model. Evaluators perceived themselves as less vulnerable to bias than their colleagues, consistent with the phenomenon called the "bias blind spot." Recurring situations that posed challenges for forensic clinicians included disliking or feeling sympathy for the defendant, disgust or anger toward the offense, limited cultural competency, preexisting values, colleagues' influences, and protecting referral streams. Twenty-five debiasing strategies emerged in the qualitative study, all but one of which rated as highly useful in the quantitative survey. Some of those strategies are consistent with empirical evidence about their effectiveness, but others have been shown to be ineffective. We identified which strategies do not help, focused on promising strategies with empirical support, discussed additional promising strategies not mentioned by participants, and described new strategies generated by these participants that have not yet been subjected to empirical examination.Finally, debiasing strategies were considered with respect to future directions for research and forensic practice.
“…In a research context, misuse of these instruments will have few negative consequences for the person tested. However, when the scores are used in clinical and criminal justice contexts 42 It is important when conducting an assessment to use all information available to provide a complete picture of the person. In each case, the PCL-R must be used properly and in accordance with the highest ethical and professional standards.…”
Section: Pcl-r Assessment Of Psychopathymentioning
Psychopathy was the first personality disorder to be recognized in psychiatry. The concept has a long historical and clinical tradition, and in the last decade a growing body of research has supported its validity. 1, p 28 I n the decade following this 1998 statement, the theoretical and empirical literature on psychopathy has expanded virtually at an exponential rate, with the addition of well over 500 scientific publications and many books and edited volumes. Much of this literature examines and evaluates the application of psychopathy to the mental health and criminal justice systems, 2-4 where it has been described as "the most important and useful psychological construct yet discovered for criminal justice policies" 5, p 231 and as "what may be the most important forensic concept of the early 21st century." 6, book jacket However, the past few years have also seen a dramatic increase in basic research based on the theories and methodologies from basic science including, but certainly not limited to, behavioural genetics, developmental psychopathology, cognitive-affective neuroscience, biochemistry, general personality theory, 7 and organizational psychology. 8 In 2004, the SSSP was established as a vehicle for the exchange of ideas and research findings among international investigators. Because psychopathy is associated with so much social and personal damage and distress, the basic and applied research endeavours are now
“…A recent review of rater agreement findings from more than 120 PCL-R studies found that about half provided little or no information concerning whether independent interviews were used for the cases used to check rater agreement (Rufino et al, 2009). In only a handful of studies was it clear that rater agreement statistics were based on cases in which both evaluators conducted an independent interview (see, e.g., Levenson, 2004;Murrie, Boccaccini, Johnson, & Janke, 2008;Rutherford, Cacciola, Alterman, McKay, & Cook, 1999;Tyrer et al, 2005). In several of these studies, ICC values were notably lower than those reported in the PCL-R manual.…”
mentioning
confidence: 99%
“…More recently, Murrie et al (2008) compared rates of agreement between PCL-R scores provided by experts called by opposing sides (termed ''petitioners'' and ''respondents'') in Texas Sexually Violent Predator (SVP) civil commitment cases. In 23 cases, an expert retained by the respondent also administered the PCL-R. For the total score, the single-evaluator ICC 1 for absolute agreement across these cases was .39, well below the high levels of agreement observed for the PCL-R in most research contexts.…”
Despite considerable support for the inter-rater reliability of the Hare Psychopathy Checklist--Revised (PCL-R; Hare, 1991, 2003) in research contexts, there is increasing concern that scores from this instrument may be considerably less stable across examiners in applied contexts, particularly when scoring is based on separate interviews. The present study examines archival data from a sample of imprisoned sex offenders (n = 20) who obtained relatively high PCL-R total scores (> or =25) and were administered this instrument on a second occasion by a different examiner. Intraclass correlations for the total and Factor 2 score were lower than those generally reported in research studies. Of greater concern, Factor 1 scores were only negligibly related to each other (ICC(A,1) = .16). Correcting for potential range restriction among these high scoring individuals resulted in total and Factor 2 score measures of agreement that were somewhat more consistent with published research, but Factor 1 continued to display exceedingly poor agreement across examiners.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.