I have a highly unbalanced data with very scarce positive labels. The data is very high dimensional. On top of that my features are also very sparse.
So what would be the best way to do feature selection in this case. Any correlation measure rank based like spearmann or pearson correlation will not be a good one. Because most of my labels as well as features are zeros and it might seem that this feature is highly correlated or something even though it is not that much significant.
Any suggestion guys?