I am looking to model the binary probability of someone to enroll/not enroll in an energy saving program. My data and formula look like this:
ID Buildingtype Energyconsumption Enrollment Zip Code
1 Detached 2000 1 1111
2 Detached 2200 0 2222
3 Semi Detached 1700 0 2299
4 Detached 1500 1 3902
glm.fit <- glm(Enrollment ~ Buildingtype + Energyconsumption, data = df, family = "binomial")
Since the dataset is big and I have over 300 zip codes, how can I add this variable into the formula so I can account for observed and unobserved locational characteristics from the areas?