I am using multi-variate guassian distribution to analyze abnormality. This is how the training set looks
19-04-16 05:30:31 1 0 0 377816 305172 5567044 0 0 0 14 62 75 0 0 100 0 0
<Date> <time> <--------------------------- ------- Features --------------------------->
Lets say one of the above features do not change, they remain zero.
Calculation mean = mu
mu = mean(X)'
Calculating sigma2 as
sigma2 = ((1/m) * (sum((X - mu') .^ 2)))'
Probability of individual feature in each data set is calculated using standard gaussian formula as
For a particular feature, if all values come out to be zero, then mean (mu) is also zero. Subsequently sigma2 will also be zero. Thereby when I calculate the probability through gaussian distribution, I would get a "device by zero" problem.
However, in test sets, this feature value can fluctuate and I would like term that as a an abnormality. How, should this be handled? I dont want to ignore such a feature.