This article discusses the problem of quality in the production of language tests in the context of Italian language examinations. The concept of quality is closely related to the application of stated standards and related procedures. These standards, developed over the last thirty years, are mainly related to the concepts of the accountability and responsibility of organisations and testing bodies. In this context and for these purposes, a quality management system (QMS) could be a useful tool for the setting out of procedures and the consistent application of standards to the different phases in the production of language examinations, thereby supporting and providing evidence of the valid interpretation and use of test results (Messick 1989). Taking into account the development of standards in educational assessment (Saville 2014), the article starts with a brief exploration of the terms standard, quality, and quality management system, then goes on to discuss the potential relationship between standards, QMS and the argument-based approach to validation (Kane 2006(Kane , 2010(Kane , 2012Bachman and Palmer 2010). The second part of the article describes, in concrete terms, the experience of the Centro per la Valutazione e le Certificazioni Linguistiche (CVCL) at the Università per Stranieri di Perugia, in applying the QMS produced by the Association of Language Testers in Europe (ALTE), considering how this application aimed to implement a set of procedures in order to achieve the stated standards through a continuous process of improvement of the overall testing system.
The contextOne of the major problems that test producers, both individuals and testing organisations, are faced with is the difficulty of combining complex theoretical concepts, such as validity and reliability linked to linguistic theories and psychometric properties, with day-to-day practice, implementing them using a set of consistently applied procedures, and providing evidence of the steps followed during the process of designing, producing and delivering tests. The overall process is intended to guarantee quality of language tests assuring valid results. Over the last 40 years theoretical models and operational frameworks have been proposed to be used as guidelines in language test production (e.g. between theoretical models and operational frameworks has been discussed and criticised. Moreover the need for a more empirically based approach to the development of assessment frameworks has been highlighted (Chalhoub-Deville 1997). These arguments and the subsequent debate are beyond the scope of this article. For present purposes, it is sufficient to bear in mind that, despite the fundamental work that has been carried out by scholars and researchers over the years, test providers still lack adequate guidance in their current practice, and are still left unsure whether what they are doing corresponds to norms agreed within both the scientific community and a community of practice.This need for further guidance is becoming increasingly e...