I have some problems calculating SE in my survey. Here is an exemple of what I want to do and I have tried to use the survey package in R. (fpc in the example below equals number of observations in each strata)
Code to generate data:
id = c(1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12)
strata = c(6, 6, 6, 7, 7, 7, 8, 8, 8, 8, 8, 8)
weight = c(60, 75, 85, 140, 170, 175, 270, 310, 325, 785, 1450, 3920)
fpc = c(8, 8, 8, 7, 7, 7, 6, 6, 6, 6, 6, 6)
answer = c("2", "2", "3", "1", "2", NA, NA, 2, "3", NA, "1", NA)
df = data.frame(id, strata, weight, fpc, answer)
df <- df[complete.cases(df), ]
I then try to calculate the mean and SE using the survey package:
dstrat<-svydesign(id=~1,strata=~strata, weights=~weight, data=df, fpc=~fpc)
svymean(~answer, dstrat)
mean SE
answer1 0.60803 0.2573
answer2 0.23518 0.1755
answer3 0.15679 0.1479
My first question is: How can i take in to account the weights of the observations that did not answer in my study? In my example above I remove my NA observations before running the function but I would like to include this information. I assume that the SE will be bigger or smaller depending on if I have answers for the observations with the largest weights or not?
My second question is: How can I calculate a SE for a "net value"? Assume:
answer1 = good
answer2 = neutral
answer3 = bad
I can calculate the "net value" as answer1 - answer3 = 0.60803 - 0.15679 = 0.45124. How can I get the SE for this "net value"?