Reliability as Projection in Operator-Theoretic Test Theory: Conditional Expectation, Hilbert Space Geometry, and Implications for Psychometric Practice [0.03%]
基于算子理论的测验可靠性的投影性质:条件期望、希尔伯特空间几何以及心理测量学启示
Bruno D Zumbo
Bruno D Zumbo
This article reconceptualizes reliability as a theorem derived from the projection geometry of Hilbert space rather than an assumption of classical test theory. Within this framework, the true score is defined as the conditional expectation...
Agreement Lambda for Weighted Disagreement With Ordinal Scales: Correction for Category Prevalence [0.03%]
带有序量表的加权不一致协议 lambda 的修正:分类普遍性的影响
Rashid Saif Almehrizi
Rashid Saif Almehrizi
Weighted inter-rater agreement allows for differentiation between levels of disagreement among rating categories and is especially useful when there is an ordinal relationship between categories. Many existing weighted inter-rater agreement...
On the Complex Sources of Differential Item Functioning: A Comparison of Three Methods [0.03%]
论项目功能差异的复杂来源:三种方法的比较
Haeju Lee,Sijia Huang,Dubravka Svetina Valdivia et al.
Haeju Lee et al.
Differential item functioning (DIF) has been a long-standing problem in educational and psychological measurement. In practice, the source from which DIF originates can be complex in the sense that an item can show DIF on multiple backgroun...
An Evaluation of the Replicable Factor Analytic Solutions Algorithm for Variable Selection: A Simulation Study [0.03%]
一种可重复因素分析解算法的评估:一项模拟研究
Daniel A Sass,Michael A Sanchez
Daniel A Sass
Observed variable and factor selection are critical components of factor analysis, particularly when the optimal subset of observed variables and the number of factors are unknown and results cannot be replicated across studies. The Replica...
Coefficient Lambda for Interrater Agreement Among Multiple Raters: Correction for Category Prevalence [0.03%]
考虑类别流行率的多评价人者一致性系数Lambda的校正公式
Rashid Saif Almehrizi
Rashid Saif Almehrizi
Fleiss's Kappa is an extension of Cohen's Kappa, developed to assess the degree of interrater agreement among multiple raters or methods classifying subjects using categorical scales. Like Cohen's Kappa, it adjusts the observed proportion o...
Common Persons Design in Score Equating: A Monte Carlo Investigation [0.03%]
共同反应人特征设计在分数等值中的应用:一个蒙特卡罗研究
Jiayi Liu,Zhehan Jiang,Tianpeng Zheng et al.
Jiayi Liu et al.
The Common Persons (CP) equating design offers critical advantages for high-security testing contexts-eliminating anchor item exposure risks while accommodating non-equivalent groups-yet few studies have systematically examined how CP chara...
Path Analysis With Mixed-Scale Variables: Categorical ML, Least Squares, and Bayesian Estimations [0.03%]
混合尺度变量的路径分析:类别ML、最小二乘和贝叶斯估计
Xinya Liang,Paula Castro,Chunhua Cao et al.
Xinya Liang et al.
In applied research across education, the social and behavioral sciences, and medicine, path models frequently incorporate both continuous and ordinal manifest variables to predict binary outcomes. This study employs Monte Carlo simulations...
Correcting the Variance of Effect Sizes Based on Binary Outcomes for Clustering [0.03%]
基于二元结果的效应大小的方差的聚类修正
Larry V Hedges
Larry V Hedges
Researchers conducting systematic reviews and meta-analyses often encounter studies in which the research design is a well conducted cluster randomized trial, but the statistical analysis does not take clustering into account. For example, ...
Network Approaches to Binary Assessment Data: Network Psychometrics Versus Latent Space Item Response Models [0.03%]
二元评估数据的网络方法:网络心理测量学VS潜在空间项目响应模型
Ludovica De Carolis,Minjeong Jeon
Ludovica De Carolis
This study compares two network-based approaches for analyzing binary psychological assessment data: network psychometrics and latent space item response modeling (LSIRM). Network psychometrics, a well-established method, infers relationshi...
Guessing During Testing is a Person Attribute Not an Instrument Parameter [0.03%]
测试中的猜测是人的属性而不是仪器的参数
Georgios D Sideridis,Mohammed Alghamdi
Georgios D Sideridis
The three-parameter logistic (3PL) model in item-response theory (IRT) has long been used to account for guessing in multiple-choice assessments through a fixed item-level parameter. However, this approach treats guessing as a property of t...