Many testing programs face the practical challenge of having limited resources to conduct comprehensive standard setting studies. Some researchers have suggested that replicating a groupâ€™s recommended cut score on a full-length test may be possible by using a subset of the items. However, these studies were based on simulated data. This study describes a standard setting application using two independent panels providing judgments on a 300-item licensure test. Specifically, one panel provided judgments on all 300 items; whereas the second panel made judgments on a randomly-selected subset of 150 items. Both panels also participated in an alternate standard setting method to evaluate panel comparability. Results suggest caution for practitioners considering using subsets of items for standard setting studies. Accessed 7,224 times on https://pareonline.net from May 11, 2010 to December 31, 2019. For downloads from January 1, 2020 forward, please click on the PlumX Metrics link to the right.
Creative Commons License
This work is licensed under a Creative Commons Attribution-Noncommercial-No Derivative Works 4.0 License.
Buckendahl, Chad W.; Ferdous, Abdullah A.; and Gerrow, Jack
"Recommending cut scores with a subset of items: An empirical illustration,"
Practical Assessment, Research, and Evaluation: Vol. 15
, Article 6.
Available at: https://scholarworks.umass.edu/pare/vol15/iss1/6