문항반응이론과 고전검사이론을 이용한 진료수행시험의 문항 분석 |
임현선1, 이영미1, 안덕선1, 이준영2, 임형3 |
1고려대학교 의과대학 의학교육학교실 2고려대학교 의과대학 의학통계학교실 3성공회대학교 소프트웨어공학과 |
Item Analysis of Clinical Performance Examination Using Item Response Theory and Classical Test Theory |
Hyun-Sun Lim1, Young-Mee Lee1, Duck-Sun Ahn1, Joon-Young Lee2, Hyung Im3 |
1Department of Medical Education, Korea University, College of Medicine, Seoul, Korea. 2Department of Medical Statistics, Korea University, College of Medicine, Seoul, Korea. 3Department of Software Engineering, Sungkonghoe University, Seoul, Korea. |
Corresponding Author:
Young-Mee Lee, Tel: 02)920-6098, Fax: 02)928-1647, Email: ymleehj@korea.ac.kr Hyung Im, Tel: 02)920-6098, Fax: 02)928-1647, Email: ymleehj@korea.ac.kr |
|
|
|
ABSTRACT |
PURPOSE: The objectives of this study were: 1) to analyze Clinical Performance Examination(CPX) items using item response theory(IRT) and classical test theory(CTT) and 2) to discuss how to apply and interpret these results in order to improve the quality of CPX items. In addition, we intended to explore statistical procedures in order to merge examination data from several different medical schools.
METHODS: The subject of the study was the 2005 CPX examination data from 10 medical schools located in Seoul and the Kyunggi province. For merging data from ten different medical schools, Levene's test for homogeneity of variances was used. Homogeneous group selection was conducted based on ANOVA or Kruskal-Wallis' test and Tukey's multiple comparisons appropriately. The generalized partial credit model was applied to analyze polytomous items and the 2-parameter logistic model was used to analyze dichotomous items.
RESULTS: Data from 8 medical schools were incorporated into the analysis. The result of the discrimination index by IRT was different from that of CTT in both polytomous and dichotomous items. Discrimination index from IRT tended to be lower than that of CTT. Difficulty index of dichotomous items of two models was correlated well with each other.
However, for polytomous items, IRT model provided more information than CCT.
CONCLUSION: We discovered that the CPX items were mostly easy in terms of difficulty index, and the result from IRT and CCT model did not correlated well in the discrimination index. IRT may provide more detailed information for polytomous items, but the checklist and criteria of scoring system should be cautiously reviewed. |
Keywords:
Clinical performance examination;Item response theory;Classical test theory |
|