to more severe or chronic forms of the disease 5,6 and avoid The objective of this study is to present and validate a the occurrence of new episodes, including some cases of clinical scale for the diagnosis of drug-induced liver injury fulminant hepatic failure. 7 (DILI). Five components were selected to be included in the Because there are no specific markers or tests for DILI, the scale: temporal relationship between drug intake and the ondiagnosis is usually based on circumstantial evidence. 8 It set of clinical picture, exclusion of alternative causes, extraherelies on a great deal of speculation by the clinician, the patic manifestations, rechallenge or accidental re-exposure, collection of a detailed pharmacological history, the estaband previous report in medical literature. The relative imporlishment of a consistent relationship between drug intake tance of each component was weighed, and arbitrary scores and the onset of the clinical picture, and the exclusion of were attributed. The probability of the diagnosis of DILI was alternative causes. Clinical improvement following drug expressed as a final score, which could vary from 06 to 20.withdrawal is another element that may indicate a drugContent validity, criterion validity, construct validity, and inrelated cause, although there is an increased recognition of ter-rater reliability were studied. To analyze validity and reliaprolonged cholestatic hepatitis, sometimes lasting for more bility, a random sample of 50 cases of suspected DILI was than 6 months, after drug withdrawal. 9 Rechallenge is fredrawn from a series of 120 cases reported to our unit. The quently considered to be the most reliable test in the diagnoclassification of the 50 cases by three experts in DILI was sis of suspected cases of DILI, 10 but it is clearly dangerous used as the external standard in the study of criterion validity.and is best avoidable. 7 Agreement between the scale and the standard, and agreement This complex process of diagnosis usually requires an exbetween two independent raters (inter-rater reliability) was perienced clinician who is deeply aware of the critical compoanalyzed by weighted k coefficient. There was agreement benents to be weighed for an accurate diagnosis, including a tween the scale and the standard in 42 cases (84%) with a good knowledge of the published literature. The translation weighted k coefficient of 0.90. A good discriminatory capacity of such experience and subjective clinical judgment into a of the scale was found when construct validity was studied.quantitative measurement to define, by more objective criteAgreement between raters was observed in 86% of the cases, ria, the probability of an adverse event being related to a corresponding to the weighted k of 0.93. In conclusion, the drug, is of major importance. The complexity of the clinical clinical scale was shown to have a high-level of validity and diagnosis has led to attempts to improve in vitro diagnostic inter-rater reliability as well as a good discriminatory capacity tests to...