Gordon P. Brooks scite author profile

Pilot studies are often recommended by scholars and consultants to address a variety of issues, including preliminary scale or instrument development. Specific concerns such as item difficulty, item discrimination, internal consistency, response rates, and parameter estimation in general are all relevant. Unfortunately, there is little discussion in the extant literature of how to determine appropriate sample sizes for these types of pilot studies. This article investigates the choice of sample size for pilot studies from a perspective particularly related to instrument development. Specific recommendations are made for researchers regarding how many participants they should use in a pilot study for initial scale development.

show abstract

Functions of humor in conversation: Conceptualization and measurement

Graham

Papa

Brooks

1992

Western Journal of Communication

127

View full text Add to dashboard Cite

Sample Size Considerations for Multiple Comparison Procedures in ANOVA

Brooks

Johanson

2011

J. Mod. App. Stat. Meth.

View full text Add to dashboard Cite

show abstract

Outlier Impact and Accommodation Methods: Multiple Comparisons of Type I Error Rates

Liao

Brooks

2016

J. Mod. App. Stat. Meth.

View full text Add to dashboard Cite

show abstract

Item Discrimination and Type I Error in the Detection of Differential Item Functioning

Brooks

Johanson

2012

Educational and Psychological Measurement

View full text Add to dashboard Cite

In 2009, DeMars stated that when impact exists there will be Type I error inflation, especially with larger sample sizes and larger discrimination parameters for items. One purpose of this study is to present the patterns of Type I error rates using Mantel–Haenszel (MH) and logistic regression (LR) procedures when the mean ability between the focal and reference groups varies from zero to one standard deviation. The findings can be used as guides for alpha adjustment when using MH or LR methods when impact exists. A second purpose is to better understand the conditions that cause Type I error rates to inflate. The results indicate that inflation can be controlled even in the presence of large ability differences and with large samples.

show abstract

Factor analysis and psychometric evaluation of the mathematical modeling attitude scale for teachers of mathematics

Asempapa

Brooks

2020

J Math Teacher Educ

View full text Add to dashboard Cite

TAP: Test Analysis Program

Brooks¹,

Johanson

2003

Applied Psychological Measurement

View full text Add to dashboard Cite

Courses in introductory educational measurement are often hampered by the lack of computer programs by which to analyze test data. To be sure, computer software exists (e.g., SPSS, SAS, Iteman) that performs these analyses; however, these programs come at a high cost and are not designed for instructional use. As a result, many practicing teachers who take the introductory educational measurement course learn about reliability and item analysis but are not able to continue to use these skills after the course ends. The Test Analysis Program (TAP), written in Borland Delphi Professional Version 6.0, performs classical test and item analyses under Windows 9x/NT/XP. In addition to performing test analyses, the TAP software includes certain features that will assist instructors of educational measurement in the classroom.

show abstract

Mixing Interviews and Rasch Modeling: Demonstrating a Procedure Used to Develop an Instrument That Measures Trust

David

Hitchcock

Ragan

et al. 2016

Journal of Mixed Methods Research

View full text Add to dashboard Cite

Developing psychometrically sound instruments can be difficult, especially if little is known about the constructs of interest. When constructs of interest are unclear, a mixed methods approach can be useful. Qualitative inquiry can be used to explore a construct's meaning in a way that informs item writing and allows the strengths of one analysis method to compensate for the weaknesses of the other. Mixing method applications can be complex, however, there are few examples within the literature pertaining to the mix of interviews, Rasch modeling, and classical test theory. This article demonstrates how to mix qualitative inquiry with Rasch modeling (and classical test theory) in order to develop an instrument that measures a complex construct: patient trust.

show abstract

12 3

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Gordon P. Brooks

Initial Scale Development: Sample Size for Pilot Studies

Functions of humor in conversation: Conceptualization and measurement

Sample Size Considerations for Multiple Comparison Procedures in ANOVA

Outlier Impact and Accommodation Methods: Multiple Comparisons of Type I Error Rates

Item Discrimination and Type I Error in the Detection of Differential Item Functioning

Factor analysis and psychometric evaluation of the mathematical modeling attitude scale for teachers of mathematics

TAP: Test Analysis Program

Mixing Interviews and Rasch Modeling: Demonstrating a Procedure Used to Develop an Instrument That Measures Trust

Contact Info

Product

Resources

About