Issues with likelihood computation from grouped binomial glm using cbind

Asked Sep 16 '20 at 20:36

Active Sep 16 '20 at 20:36

Viewed 39 times

Is there an issue with the likelihood computation of the binomial model fit with glm when using cbind or am I missing something. See below for reproducible example:

set.seed(123)
n <- 100
sze <- 5
x <- rnorm(n, 0, 1)
p <- binomial()$linkinv(x)
y <- rbinom(n, sze, p)
tot <- rep(sze, n)
# Fit binomial model on grouped data
modA <- glm(cbind(y, tot - y) ~ x, family = binomial())
modA
# Ungroup the bernoulli trials
xy <- lapply(1:n, function(i) cbind(c(rep(1,y[i]), rep(0,sze-y[i])), c(rep(x[i], sze))))
xy <- do.call(rbind, xy)
# Fit binomial model on ungrouped 0/1 data
modB <- glm(xy[,1] ~ xy[,2], family = binomial())
modB
logLik(modA)
logLik(modB)

If I'm not mistaken, both models should be the same. Coefficient estimates are exactly the same. However, the log likelihood is different.

asked Sep 16 '20 at 20:36

user2506086

Do you know how to calculate the log likelihood by hand? – Dason Sep 16 '20 at 21:03
Thanks - point taken. The likelihood is different if you considered it grouped vs. ungrouped. – user2506086 Sep 16 '20 at 21:48

Issues with likelihood computation from grouped binomial glm using cbind

0 Answers0