Recommending cut scores with a subset of items: An empirical illustration

Chad W. Buckendahl
Abdullah A. Ferdous
Jack Gerrow

DOI

https://doi.org/10.7275/tv3s-cz67

Abstract

Many testing programs face the practical challenge of having limited resources to conduct comprehensive standard setting studies. Some researchers have suggested that replicating a groupâ€™s recommended cut score on a full-length test may be possible by using a subset of the items. However, these studies were based on simulated data. This study describes a standard setting application using two independent panels providing judgments on a 300-item licensure test. Specifically, one panel provided judgments on all 300 items; whereas the second panel made judgments on a randomly-selected subset of 150 items. Both panels also participated in an alternate standard setting method to evaluate panel comparability. Results suggest caution for practitioners considering using subsets of items for standard setting studies. Accessed 7,224 times on https://pareonline.net from May 11, 2010 to December 31, 2019. For downloads from January 1, 2020 forward, please click on the PlumX Metrics link to the right.

Creative Commons License

This work is licensed under a Creative Commons Attribution-NonCommercial-No Derivative Works 4.0 International License.

Recommended Citation

Buckendahl, Chad W.; Ferdous, Abdullah A.; and Gerrow, Jack (2019) "Recommending cut scores with a subset of items: An empirical illustration," Practical Assessment, Research, and Evaluation: Vol. 15, Article 6.
DOI: https://doi.org/10.7275/tv3s-cz67
Available at: https://scholarworks.umass.edu/pare/vol15/iss1/6

Link to Full Text

COinS

Recommending cut scores with a subset of items: An empirical illustration

Authors

DOI

Abstract

Creative Commons License

Recommended Citation

Share