Confidence Screening Detector: A New Method for Detecting Test Collusion [0.03%]
一种新的检测考试作弊的方法:置信筛查检测器方法
Yongze Xu,Ying Cui,Xinyi Wang et al.
Yongze Xu et al.
Test collusion (TC) is a form of cheating in which, examinees operate in groups to alter normal item responses. TC is becoming increasingly common, especially within high-stakes, large-scale examinations. However, research on TC detection m...
A Likelihood Approach to Item Response Theory Equating of Multiple Forms [0.03%]
基于项目响应理论的多等值参照元真题卷 equate 的似然方法研究
Michela Battauz,Waldir Leôncio
Michela Battauz
Test equating is a statistical procedure to make scores from different test forms comparable and interchangeable. Focusing on an IRT approach, this paper proposes a novel method that simultaneously links the item parameter estimates of a la...
On the Folly of Introducing A (Time-Based UMV), While Designing for B (Time-Based CMV) [0.03%]
论在为B(基于时间的CMV)设计时引入A(基于时间的UMV)的愚蠢性
Alice Brawley Newlin
Alice Brawley Newlin
Enhancing Computerized Adaptive Testing with Batteries of Unidimensional Tests [0.03%]
利用一系列一维测试增强计算机自适应测试的效果
Pasquale Anselmi,Egidio Robusto,Francesca Cristante
Pasquale Anselmi
The article presents a new computerized adaptive testing (CAT) procedure for use with batteries of unidimensional tests. At each step of testing, the estimate of a certain ability is updated on the basis of the response to the latest admini...
A New Approach to Desirable Responding: Multidimensional Item Response Model of Overclaiming Data [0.03%]
一种新的有效反应方法:过度宣称数据多维项目反应模型
Kuan-Yu Jin,Delroy L Paulhus,Ching-Lin Shih
Kuan-Yu Jin
A variety of approaches have been presented for assessing desirable responding in self-report measures. Among them, the overclaiming technique asks respondents to rate their familiarity with a large set of real and nonexistent items (foils)...
Heywood Cases in Unidimensional Factor Models and Item Response Models for Binary Data [0.03%]
一维因素分析与二分法响应模型中的Heywood案例研究
Selena Wang,Paul De Boeck,Marcel Yotebieng
Selena Wang
Heywood cases are known from linear factor analysis literature as variables with communalities larger than 1.00, and in present day factor models, the problem also shows in negative residual variances. For binary data, factor models for ord...
The Effects of Rating Designs on Rater Classification Accuracy and Rater Measurement Precision in Large-Scale Mixed-Format Assessments [0.03%]
评分方法对大规模混合题型考试中评价者分类准确性及评价者测量精确性的影响研究
Wenjing Guo,Stefanie A Wind
Wenjing Guo
In standalone performance assessments, researchers have explored the influence of different rating designs on the sensitivity of latent trait model indicators to different rater effects as well as the impacts of different rating designs on ...
Targeted Double Scoring of Performance Tasks Using a Decision-Theoretic Approach [0.03%]
基于决策理论的绩效任务双重评分方法研究
Sandip Sinharay,Matthew S Johnson,Wei Wang et al.
Sandip Sinharay et al.
Targeted double scoring, or, double scoring of only some (but not all) responses, is used to reduce the burden of scoring performance tasks for several mastery tests (Finkelman, Darby, & Nering, 2008). An approach based on statistical decis...
Evaluating Equating Transformations in IRT Observed-Score and Kernel Equating Methods [0.03%]
基于IRT的观察分数和核等值法中的等值变换评价研究
Waldir Leôncio,Marie Wiberg,Michela Battauz
Waldir Leôncio
Test equating is a statistical procedure to ensure that scores from different test forms can be used interchangeably. There are several methodologies available to perform equating, some of which are based on the Classical Test Theory (CTT) ...
A Comparison of Confirmatory Factor Analysis and Network Models for Measurement Invariance Assessment When Indicator Residuals are Correlated [0.03%]
当指标残差相关时,验证因素分析与网络模型在测量不变性评估中的比较研究
W Holmes Finch,Brian F French,Alicia Hazelwood
W Holmes Finch
Social science research is heavily dependent on the use of standardized assessments of a variety of phenomena, such as mood, executive functioning, and cognitive ability. An important assumption when using these instruments is that they per...