To the statistic experts out there: I am getting a headache by thinking about the interpretation of a regression.
If you test anomalies you can to this by using a dummy variable D in the regression. Let's say you want to find out if a specific day reacts not normal. Because we have the feeling we make more money on Fridays. The regression looks like:
Return/Earnings = a + b1 DMonday + b2 DTuesday + b3 DWednesday + b4 DThursday + b5 DFriday + e
For sure your earnings depend on other things, such as number of customers, price level, weather, ...who knows...
Let's say b5 has got a p-value close to zero. But R2 is zero as well. How can I interpret this result?
Saying the whole model can't forecast earnings because R2 is zero!? Makes sense to me. On the other hand I can say Friday is signifcantly better than other days. Makes sense as well to me.
But I don't understand why, if Friday is significant, the whole model has a R2 close to zero. I know that some people use ANOVA and Kruskal Wallis. But I know that Regressions are used often. I just don't get the idea behind it. Any interpretation would be very much appreciated.
PS: add on - why do some people drop the monday in the regression? Okay, the monday works as a reference in this case. I can understand this. But what's the advantage of doing this? Isn't the result the same?