:: Korean Journal of Medical Education

Original Article
DOI : https://doi.org/10.3946/kjme.2007.19.3.185

Korean J Med Educ. 2007; 19(3): 185-195.

Published online 2007 September 30.

doi: https://doi.org/10.3946/kjme.2007.19.3.185

Item Analysis of Clinical Performance Examination Using Item Response Theory and Classical Test Theory

Hyun-Sun Lim¹, Young-Mee Lee¹, Duck-Sun Ahn¹, Joon-Young Lee², and Hyung Im³

¹Department of Medical Education, Korea University, College of Medicine, Seoul, Korea.
²Department of Medical Statistics, Korea University, College of Medicine, Seoul, Korea.
³Department of Software Engineering, Sungkonghoe University, Seoul, Korea.

Corresponding Author: Email: ymleehj@korea.ac.kr

ABSTRACT

PURPOSE: The objectives of this study were: 1) to analyze Clinical Performance Examination(CPX) items using item response theory(IRT) and classical test theory(CTT) and 2) to discuss how to apply and interpret these results in order to improve the quality of CPX items. In addition, we intended to explore statistical procedures in order to merge examination data from several different medical schools. METHODS: The subject of the study was the 2005 CPX examination data from 10 medical schools located in Seoul and the Kyunggi province. For merging data from ten different medical schools, Levene's test for homogeneity of variances was used. Homogeneous group selection was conducted based on ANOVA or Kruskal-Wallis' test and Tukey's multiple comparisons appropriately. The generalized partial credit model was applied to analyze polytomous items and the 2-parameter logistic model was used to analyze dichotomous items. RESULTS: Data from 8 medical schools were incorporated into the analysis. The result of the discrimination index by IRT was different from that of CTT in both polytomous and dichotomous items. Discrimination index from IRT tended to be lower than that of CTT. Difficulty index of dichotomous items of two models was correlated well with each other. However, for polytomous items, IRT model provided more information than CCT. CONCLUSION: We discovered that the CPX items were mostly easy in terms of difficulty index, and the result from IRT and CCT model did not correlated well in the discrimination index. IRT may provide more detailed information for polytomous items, but the checklist and criteria of scoring system should be cautiously reviewed.

Keywords : Clinical performance examination;Item response theory;Classical test theory