Refining the asymptotically correct standardization of person-fit statistics for mixed-format tests [0.03%]
改进混合题型测试的人格拟合统计的渐近正确标准
Sandip Sinharay
Sandip Sinharay
Sinharay (Psychometrika, 2016, 81, 992) suggested the asymptotically correct standardized version of a class of person-fit statistics for mixed-format tests. This paper provides an alternative and arguably simpler derivation of the standard...
Blending substantive and methodological expertise into statistical models: Longitudinal model development [0.03%]
内容专家与方法专家合作进行纵向模型开发
Kevin J Grimm,Russell Houpt,Maggie Cleaver et al.
Kevin J Grimm et al.
Study design and subsequent data analysis is ideally a collaborative endeavour between applied researchers and statistical experts (e.g. methodologists, data scientists). Applied researchers know what research questions need to be answered ...
Leah M Feuerstahler,Jay Verkuilen,Fabio Setti et al.
Leah M Feuerstahler et al.
Asymmetric item response theory (asymIRT) has emerged as an important extension of classical IRT, motivated by empirical evidence and theoretical arguments that symmetric item response functions (IRFs) often inadequately describe real respo...
Nikola Sekulovski,Meike Waaijers,Giuseppe Arena
Nikola Sekulovski
In the Bayesian graphical modeling framework, priors on network structure encode theoretical assumptions and uncertainty about the topology of psychological constructs under study. For instance, the Bernoulli prior specifies the probability...
To vary or not to vary: A flexible empirical Bayes factor for testing variance components [0.03%]
变或不变:一种灵活的贝叶斯因子用于测试方差分量
Fabio Vieira,Hongwei Zhao,Joris Mulder
Fabio Vieira
Random effects are the gold standard for capturing structural heterogeneity, such as individual differences or temporal dependence. Yet testing their presence is difficult because variance components are constrained to be non-negative, crea...
Asymptotic standard errors for reliability coefficients in item response theory [0.03%]
项目反应理论中可靠性系数的渐近标准误差
Youjin Sung,Yang Liu
Youjin Sung
In a recent review, Liu et al. (Psychological Methods, 2025b) classified reliability coefficients into two types: classical test theory (CTT) reliability and proportional reduction in mean squared error (PRMSE). This article focuses on quan...
Estimating the reliability of round-robin judgments with social relations confirmatory factor analyses [0.03%]
社交关系确认性因素分析下循环判断可靠性的估计
Steffen Nestler,Oliver Lüdtke,Alexander Robitzsch
Steffen Nestler
The social relations model (SRM) is commonly used in psychological research to analyse interdependent data from round-robin designs, where all members of a group rate each other. Based on the recently suggested social relations confirmatory...
A cognitive diagnosis model for latent classification of bounded continuous variables [0.03%]
一种用于界定连续变量潜在类别的认知诊断模型
Eduardo S B de Oliveira,Xiaojing Wang,Jorge L Bazán et al.
Eduardo S B de Oliveira et al.
Cognitive Diagnosis Models (CDMs) are widely used in latent-variable modeling for classification tasks that diagnose abilities or skills. Originally developed for dichotomous indicators, CDMs have been extended to polytomous and continuous ...
Jeffrey N Rouder,Mahbod Mehrvarz,Martin Schnuerch
Jeffrey N Rouder
We are concerned about an emphasis on reliability for analysis of psychology experiments. Experiments have two elements of sample size: the number of individuals and the number of replicate trials within a task, and that complicates reliabi...
Using multilabel classification neural network to detect intersectional DIF with small sample sizes [0.03%]
基于多标签分类神经网络的小样本交叉多元差别影响检测方法
Yale Quan,Chun Wang
Yale Quan
This study introduces InterDIFNet, a multilabel classification neural network for detecting intersectional differential item functioning (DIF) in educational and psychological assessments, with a focus on small sample sizes. Unlike traditio...