Ou Lydia Liu,Brent Bridgeman,Lixiong Gu et al.
Ou Lydia Liu et al.
Research on examinees' response changes on multiple-choice tests over the past 80 years has yielded some consistent findings, including that most examinees make score gains by changing answers. This study expands the research on response ch...
Denis Cousineau,Louis Laurencelle
Denis Cousineau
Existing tests of interrater agreements have high statistical power; however, they lack specificity. If the ratings of the two raters do not show agreement but are not random, the current tests, some of which are based on Cohen's kappa, wil...
Best Design for Multidimensional Computerized Adaptive Testing With the Bifactor Model [0.03%]
基于双因素模型的多维度计算机化适应性测试最优设计研究
Dong Gi Seo,David J Weiss
Dong Gi Seo
Most computerized adaptive tests (CATs) have been studied using the framework of unidimensional item response theory. However, many psychological variables are multidimensional and might benefit from using a multidimensional approach to CAT...
James A Wollack,Allan S Cohen,Carol A Eckerly
James A Wollack
Test tampering, especially on tests for educational accountability, is an unfortunate reality, necessitating that the state (or its testing vendor) perform data forensic analyses, such as erasure analyses, to look for signs of possible malf...
Applying the Nominal Response Model Within a Longitudinal Framework to Construct the Positive Family Relationships Scale [0.03%]
基于纵向框架运用名称响应模型构建积极家庭关系量表
Kathleen Suzanne Johnson Preston,Skye N Parral,Allen W Gottfried et al.
Kathleen Suzanne Johnson Preston et al.
A psychometric analysis was conducted using the nominal response model under the item response theory framework to construct the Positive Family Relationships scale. Using data from the Fullerton Longitudinal Study, this scale was construct...
Tenko Raykov,George A Marcoulides
Tenko Raykov
A latent variable modeling approach for scale reliability evaluation in heterogeneous populations is discussed. The method can be used for point and interval estimation of reliability of multicomponent measuring instruments in populations r...
Taking the Missing Propensity Into Account When Estimating Competence Scores: Evaluation of Item Response Theory Models for Nonignorable Omissions [0.03%]
考虑遗漏倾向的胜任力评分估计:非忽略遗漏项目反应理论模型的评估
Carmen Köhler,Steffi Pohl,Claus H Carstensen
Carmen Köhler
When competence tests are administered, subjects frequently omit items. These missing responses pose a threat to correctly estimating the proficiency level. Newer model-based approaches aim to take nonignorable missing data processes into a...
Using a Model of Analysts' Judgments to Augment an Item Calibration Process [0.03%]
运用分析判断模型增强项目校准过程
Carl Hauser,Yeow Meng Thum,Wei He et al.
Carl Hauser et al.
When conducting item reviews, analysts evaluate an array of statistical and graphical information to assess the fit of a field test (FT) item to an item response theory model. The process can be tedious, particularly when the number of huma...
The Evidence for a Subscore Structure in a Test of English Language Competency for English Language Learners [0.03%]
英语学习者英语语言能力测试 subscore 结构的证据
Mark D Reckase,Jing-Ru Xu
Mark D Reckase
How to compute and report subscores for a test that was originally designed for reporting scores on a unidimensional scale has been a topic of interest in recent years. In the research reported here, we describe an application of multidimen...
Reducing Bias and Error in the Correlation Coefficient Due to Nonnormality [0.03%]
非正态性导致的皮尔逊相关系数偏差和误差的降低
Anthony J Bishara,James B Hittner
Anthony J Bishara
It is more common for educational and psychological data to be nonnormal than to be approximately normal. This tendency may lead to bias and error in point estimates of the Pearson correlation coefficient. In a series of Monte Carlo simulat...