Item Analysis of Clinical Performance Examination Using Item Response Theory and Classical Test Theory

Lim, Hyun-Sun; Lee, Young-Mee; Ahn, Duck-Sun; Lee, Joon-Young; Im, Hyung; Lim, Hyun-Sun; Lee, Young-Mee; Ahn, Duck-Sun; Lee, Joon-Young; Im, Hyung

doi:2007.19.3.185

Korean J Med Educ > Volume 19(3); 2007 > Article

Original Article

Korean Journal of Medical Education 2007;19(3): 185-195. doi: https://doi.org/10.3946/kjme.2007.19.3.185

문항반응이론과 고전검사이론을 이용한 진료수행시험의 문항 분석

임현선¹, 이영미¹, 안덕선¹, 이준영², 임형³

¹고려대학교 의과대학 의학교육학교실
²고려대학교 의과대학 의학통계학교실
³성공회대학교 소프트웨어공학과

Item Analysis of Clinical Performance Examination Using Item Response Theory and Classical Test Theory

Hyun-Sun Lim¹, Young-Mee Lee¹, Duck-Sun Ahn¹, Joon-Young Lee², Hyung Im³

¹Department of Medical Education, Korea University, College of Medicine, Seoul, Korea.
²Department of Medical Statistics, Korea University, College of Medicine, Seoul, Korea.
³Department of Software Engineering, Sungkonghoe University, Seoul, Korea.

Corresponding Author: Young-Mee Lee, Tel: 02)920-6098, Fax: 02)928-1647, Email: ymleehj@korea.ac.kr
Hyung Im, Tel: 02)920-6098, Fax: 02)928-1647, Email: ymleehj@korea.ac.kr

ABSTRACT

PURPOSE: The objectives of this study were: 1) to analyze Clinical Performance Examination(CPX) items using item response theory(IRT) and classical test theory(CTT) and 2) to discuss how to apply and interpret these results in order to improve the quality of CPX items. In addition, we intended to explore statistical procedures in order to merge examination data from several different medical schools. METHODS: The subject of the study was the 2005 CPX examination data from 10 medical schools located in Seoul and the Kyunggi province. For merging data from ten different medical schools, Levene's test for homogeneity of variances was used. Homogeneous group selection was conducted based on ANOVA or Kruskal-Wallis' test and Tukey's multiple comparisons appropriately. The generalized partial credit model was applied to analyze polytomous items and the 2-parameter logistic model was used to analyze dichotomous items. RESULTS: Data from 8 medical schools were incorporated into the analysis. The result of the discrimination index by IRT was different from that of CTT in both polytomous and dichotomous items. Discrimination index from IRT tended to be lower than that of CTT. Difficulty index of dichotomous items of two models was correlated well with each other. However, for polytomous items, IRT model provided more information than CCT. CONCLUSION: We discovered that the CPX items were mostly easy in terms of difficulty index, and the result from IRT and CCT model did not correlated well in the discrimination index. IRT may provide more detailed information for polytomous items, but the checklist and criteria of scoring system should be cautiously reviewed.

Keywords: Clinical performance examination;Item response theory;Classical test theory