to contribute to health research literature dealing with psychological assessment by primarily arguing that radical clarity in the procedures undertaken for creating data are necessary if research is to detect a more probable truth as it distances itself from absolute uncertainty. In contribution towards these goals, the project presents and makes available a novel statistical program (originated by author without the use of any external sources) to help data managers create five different variables that can help secondary data user understand the withinperson level of in-completeness for the full CES-D scale and its four sub-scales separately.
BackgroundAt the most fundamental level, investigating human health through statistical techniques (e.g., using regression equations) requires numerically coding their behaviors, perceptions, physical attributes, and environments. In order for the legitimacy of statistics to be employed in analyses, human behavior must be converted into a numerical existence-i.e., the abstraction of the physical world must occur with the use of numbers. Numbers attributed with meaning are then explored in a world of equations where results from samples are used to infer population characteristics or where statistical significance is inferred typically using frequentist views on probabilities. Unfortunately, a study participant could be born into the world of numbers through ambiguous mechanisms and careless documentation. Paying attention to how information is handled at the most early stages of data creation is crucial to the value of any subsequent work because the quality of data influences the potential value of results.
AbstractThe Center for Epidemiologic Studies Depression Scale (CES-D) is a widely used screening test for depression symptomatology in population studies. Publications have dealt with the psychometric properties of the instrument. Until now, no published work has examined the how questionnaire items are handled in data processing protocols. This technical report introduces the "CESD-Flag" macro algorithm using SAS ® 9.3 syntax. The main contribution of the statistical program in this report is to provide data managers with a program that creates five CES-D flag variables to help identify the level and location of missingness in CES-D related data. The paper highlights implications from how data editing algorithms handle missing or inconclusive responses to CES-D questions. If absolute transparency should be encouraged as a desirable goal, then the methodology used to "clean" CES-D data should be provided along descriptive statistics for the five flag variables (measure of item incompleteness) created by the CESD-Flag program. The assignment of responses through non-evidence based algorithms merits further attention as the use of CES-D continues to proliferate internationally.