Computes effect size measures of differential item functioning and differential test/bundle functioning based on expected scores from Meade (2010). Item parameters from both reference and focal group are used in conjunction with focal group empirical theta estimates (and an assumed normally distributed theta) to compute expected scores.
Arguments
- mod
a multipleGroup object which estimated only 2 groups. The first group in this object is assumed to be the reference group by default (i.e.,
ref.group = 1
), which conforms to theinvariance
arguments inmultipleGroup
- Theta.focal
an optional matrix of Theta values from the focal group to be evaluated. If not supplied the default values to
fscores
will be used in conjunction with the...
arguments passed- focal_items
a numeric vector indicating which items to include the tests. The default uses all of the items. Selecting fewer items will result in tests of 'differential bundle functioning' when
DIF = FALSE
- DIF
logical; return a data.frame of item-level imputation properties? If
FALSE
, only DBF and DTF statistics will be reported- npts
number of points to use in the integration. Default is 61
- theta_lim
lower and upper limits of the latent trait (theta) to be evaluated, and is used in conjunction with
npts
- plot
logical; plot expected scores of items/test where expected scores are computed using focal group thetas and both focal and reference group item parameters
- type
type of objects to draw in
lattice
; default plots both points and lines- par.strip.text
plotting argument passed to
lattice
- par.settings
plotting argument passed to
lattice
- ...
DIF
The default DIF = TRUE
produces several effect sizes indices at the item level.
Signed indices allow DIF favoring the focal group at one point on the theta
distribution to cancel DIF favoring the reference group at another point on the theta
distribution. Unsigned indices take the absolute value before summing or averaging,
thus not allowing cancellation of DIF across theta.
- SIDS
Signed Item Difference in the Sample. The average difference in expected scores across the focal sample using both focal and reference group item parameters.
- UIDS
Unsigned Item Difference in the Sample. Same as SIDS except absolute value of expected scores is taken prior to averaging across the sample.
- D-Max
The maximum difference in expected scores in the sample.
- ESSD
Expected Score Standardized Difference. Cohen's D for difference in expected scores.
- SIDN
Signed Item Difference in a Normal distribution. Identical to SIDS but averaged across a normal distribution rather than the sample.
- UIDN
Unsigned Item Difference in a Normal distribution. Identical to UIDS but averaged across a normal distribution rather than the sample.
DBF/DTF
DIF = FALSE
produces a series of test/bundle-level indices that are based on item-level
indices.
- STDS
Signed Test Differences in the Sample. The sum of the SIDS across items.
- UTDS
Unsigned Test Differences in the Sample. The sum of the UIDS across items.
- Stark's DTFR
Stark's version of STDS using a normal distribution rather than sample estimated thetas.
- UDTFR
Unsigned Expected Test Scores Differences in the Sample. The difference in observed summed scale scores expected, on average, across a hypothetical focal group with a normally distributed theta, had DF been uniform in nature for all items
- UETSDS
Unsigned Expected Test Score Differences in the Sample. The hypothetical difference expected scale scores that would have been present if scale-level DF had been uniform across respondents (i.e., always favoring the focal group).
- UETSDN
Identical to UETSDS but computed using a normal distribution.
- Test D-Max
Maximum expected test score differences in the sample.
- ETSSD
Expected Test Score Standardized Difference. Cohen's D for expected test scores.
References
Chalmers, R., P. (2012). mirt: A Multidimensional Item Response Theory Package for the R Environment. Journal of Statistical Software, 48(6), 1-29. doi:10.18637/jss.v048.i06
Meade, A. W. (2010). A taxonomy of effect size measures for the differential functioning of items and scales. Journal of Applied Psychology, 95, 728-743.
Author
Adam Meade, with contributions by Phil Chalmers rphilip.chalmers@gmail.com
Examples
if (FALSE) { # \dontrun{
# no DIF
set.seed(12345)
a <- matrix(abs(rnorm(15,1,.3)), ncol=1)
d <- matrix(rnorm(15,0,.7),ncol=1)
itemtype <- rep('2PL', nrow(a))
N <- 1000
dataset1 <- simdata(a, d, N, itemtype)
dataset2 <- simdata(a, d, N, itemtype, mu = .1, sigma = matrix(1.5))
dat <- rbind(dataset1, dataset2)
# ensure 'Ref' is the first group (and therefore reference group during estimation)
group <- factor(c(rep('Ref', N), rep('Focal', N)), levels = c('Ref', 'Focal'))
mod <- multipleGroup(dat, 1, group = group,
invariance = c(colnames(dat)[1:5], 'free_means', 'free_var'))
coef(mod, simplify=TRUE)
empirical_ES(mod)
empirical_ES(mod, DIF=FALSE)
empirical_ES(mod, DIF=FALSE, focal_items = 10:15)
empirical_ES(mod, plot=TRUE)
empirical_ES(mod, plot=TRUE, DIF=FALSE)
###---------------------------------------------
# DIF
set.seed(12345)
a1 <- a2 <- matrix(abs(rnorm(15,1,.3)), ncol=1)
d1 <- d2 <- matrix(rnorm(15,0,.7),ncol=1)
a2[10:15,] <- a2[10:15,] + rnorm(6, 0, .3)
d2[10:15,] <- d2[10:15,] + rnorm(6, 0, .3)
itemtype <- rep('dich', nrow(a1))
N <- 1000
dataset1 <- simdata(a1, d1, N, itemtype)
dataset2 <- simdata(a2, d2, N, itemtype, mu = .1, sigma = matrix(1.5))
dat <- rbind(dataset1, dataset2)
group <- factor(c(rep('Ref', N), rep('Focal', N)), levels = c('Ref', 'Focal'))
mod <- multipleGroup(dat, 1, group = group,
invariance = c(colnames(dat)[1:5], 'free_means', 'free_var'))
coef(mod, simplify=TRUE)
empirical_ES(mod)
empirical_ES(mod, DIF = FALSE)
empirical_ES(mod, plot=TRUE)
empirical_ES(mod, plot=TRUE, DIF=FALSE)
} # }