4.4 GDINA model estimation
To estimate G-DINA model, call GDINA function and specify the data and Q-matrix as the first two arguments.
Code
# Fit the data using G-DINA model
<- GDINA(dat = data1, Q = Q1, verbose = 0) fit1
Returned fit1 contains (almost) all estimation results. It is called an object of class GDINA, and we can apply various methods to it. To print some general model estimation information, type fit1 in Rstudio console:
Code
fit1
## Call:
## GDINA(dat = data1, Q = Q1, verbose = 0)
##
## GDINA version 2.9.3 (2022-08-17)
## ===============================================
## Data
## -----------------------------------------------
## # of individuals groups items
## 837 1 15
## ===============================================
## Model
## -----------------------------------------------
## Fitted model(s) = GDINA
## Attribute structure = saturated
## Attribute level = Dichotomous
## ===============================================
## Estimation
## -----------------------------------------------
## Number of iterations = 288
##
## For the final iteration:
## Max abs change in item success prob. = 0.0001
## Max abs change in mixing proportions = 0.0000
## Change in -2 log-likelihood = 0.0001
## Converged? = TRUE
##
## Time used = 0.3023 secs
As shown above, some information about the data, fitted model and calibration was printed. Such information can be used to check the model specification and evaluate convergence (By default, the maximum iterations allowed is 2000).
summary is another function that can be applied to objects of class GDINA. Specify fit1 as the input:
Code
summary(fit1)
##
## Test Fit Statistics
##
## Loglik = -7431.45
##
## AIC = 14996.91 | penalty [2 * p] = 134.00
## BIC = 15313.81 | penalty [log(n) * p] = 450.90
## CAIC = 15380.81 | penalty [(log(n) + 1) * p] = 517.90
## SABIC = 15101.04 | penalty [log((n + 2)/24) * p] = 238.13
##
## No. of parameters (p) = 67
## No. of estimated item parameters = 60
## No. of fixed item parameters = 0
## No. of distribution parameters = 7
##
## Attribute Prevalence
##
## Level0 Level1
## A1 0.37 0.63
## A2 0.46 0.54
## A3 0.29 0.71
As shown above, the summary function prints some test fit statistics,
like AIC and BIC, the number of parameters and attribute prevalence. The
attribute prevalence gives the proportion of individuals who master (or
do not master) each attribute, labelled as Level1'' (or
Level0’’).
To extract item parameters, we can use coef function, as in
Code
coef(fit1)
## $`Item 1`
## P(0) P(1)
## 0.93 0.98
##
## $`Item 2`
## P(0) P(1)
## 0.26 0.82
##
## $`Item 3`
## P(0) P(1)
## 0.54 0.82
##
## $`Item 4`
## P(0) P(1)
## 0.48 0.87
##
## $`Item 5`
## P(00) P(10) P(01) P(11)
## 0.29 0.73 0.54 0.89
##
## $`Item 6`
## P(00) P(10) P(01) P(11)
## 0.157 0.441 0.046 0.757
##
## $`Item 7`
## P(00) P(10) P(01) P(11)
## 0.0001 0.5172 0.5316 0.7762
##
## $`Item 8`
## P(00) P(10) P(01) P(11)
## 0.04 0.67 0.35 0.80
##
## $`Item 9`
## P(00) P(10) P(01) P(11)
## 0.094 0.407 0.345 0.682
##
## $`Item 10`
## P(00) P(10) P(01) P(11)
## 0.43 0.71 0.36 0.77
##
## $`Item 11`
## P(00) P(10) P(01) P(11)
## 0.49 0.62 0.73 0.91
##
## $`Item 12`
## P(00) P(10) P(01) P(11)
## 0.19 0.34 0.36 0.68
##
## $`Item 13`
## P(00) P(10) P(01) P(11)
## 0.20 0.19 0.31 0.59
##
## $`Item 14`
## P(000) P(100) P(010) P(001) P(110) P(101) P(011) P(111)
## 0.076 1.000 0.024 0.236 0.612 0.411 0.087 0.650
##
## $`Item 15`
## P(000) P(100) P(010) P(001) P(110) P(101) P(011) P(111)
## 0.0989 0.7047 0.0072 0.3002 0.3321 0.3307 0.2473 0.6451
The returned output by coef (when we only specify the first argument) is item success probabilities for each latent group (reduced latent classes).
coef has a few arguments including what,with SE and SE.type. what specifies what to show; It can be “itemprob” for item success probabilities, “gs” for guessing and slip parameters, “delta” for delta parameters, “rrum” for RRUM parameters when items are estimated using RRUM, “LCprob” for item success probabilities for all latent classes, and “lambda” for structural parameters of joint attribute distribution (such as higher-order structural parameters).
Set withSE=TRUE if you want to estimate standard errors. Note that standard errors can only be calculated for item success probabilities, guessing and slip parameters, and delta parameters. For other parameters, you can use function bootSE to calculate standard errors using bootstrap procedures. SE.type can be 1, 2 or 3, indicating outer product of gradient (OPG) estimates based on itemwise, incomplete or complete information matrix. Default is 2. Currently, the OPG method based on the complete information matrix assumes that all latent classes are identifiable and is only available for saturated joint attribute distribution.
Code
# item success probabilities for item 15
coef(fit1, withSE = TRUE)[[15]]
## P(000) P(100) P(010) P(001) P(110) P(101) P(011) P(111)
## Est. 0.099 0.70 0.0072 0.300 0.33 0.331 0.247 0.645
## S.E. 0.086 0.15 0.1438 0.055 0.06 0.064 0.094 0.045
Code
# delta parameters for item 15
coef(fit1, what = "delta", withSE = TRUE)[[15]]
## d0 d1 d2 d3 d12 d13 d23 d123
## Est. 0.099 0.61 -0.092 0.20 -0.28 -0.58 0.039 0.65
## S.E. 0.086 0.18 0.174 0.12 0.26 0.22 0.241 0.33
Code
# population proportions when saturated model is used for joint
# attribute distribution
coef(fit1, what = "lambda")
## p(000) p(100) p(010) p(001) p(110) p(101) p(011) p(111)
## 0.061 0.031 0.037 0.186 0.157 0.181 0.084 0.262
To plot the probabilities of success for each item \(P(X_{ij}=1|\mathbf{\alpha}_{lj}^*)\), we can use plot:
Code
plot(fit1, what = "IRF", item = c(9, 15))
If we want to add an error bar to the IRF plot of item 15, using the following code:
Code
plot(fit1, what = "IRF", item = c(9, 15), withSE = TRUE)
To obtain attribute estimates, use the function personparm:
Code
head(personparm(fit1))
## A1 A2 A3
## [1,] 1 1 1
## [2,] 0 0 1
## [3,] 1 1 0
## [4,] 1 1 0
## [5,] 1 0 0
## [6,] 1 1 0
You can also plot probabilities of mastering each attribute for individuals 1, 15, 20 and 25:
Code
plot(fit1, what = "mp", person = c(1, 15, 20, 25))
To extract other elements, we should use extract function. Its first input should be an object returned from GDINA function (or other functions), and its second input (i.e., what) specifies what to extract. See its help page ?extract for a complete list of elements that can be extracted.
For example, to calculate the discrimination indices, the following code can be used:
Code
::extract(fit1, "discrim") GDINA
## P(1)-P(0) GDI
## Item 1 0.052 0.00062
## Item 2 0.559 0.06395
## Item 3 0.287 0.02043
## Item 4 0.390 0.03111
## Item 5 0.604 0.05942
## Item 6 0.600 0.08040
## Item 7 0.776 0.09399
## Item 8 0.764 0.06416
## Item 9 0.589 0.03483
## Item 10 0.341 0.03629
## Item 11 0.418 0.01827
## Item 12 0.496 0.03461
## Item 13 0.393 0.03068
## Item 14 0.574 0.05892
## Item 15 0.546 0.03519
To calculate the proportion of individuals in each latent class, we can use:
Code
::extract(fit1, "posterior.prob") GDINA
## 000 100 010 001 110 101 011 111
## [1,] 0.062 0.031 0.037 0.19 0.16 0.18 0.084 0.26