I am new to data science & machine learning, so I'll write my question in detail.
I have an imbalanced dataset (binary classification dataset), and I want to apply these methods by using Weka paltform:
- 10-Fold cross validation.
- Oversampling to balance the data.
- A Wrapper feature selection method.
- 6 classifiers and compare between their performance.
I want to apply them under these conditions:
- Balancing the data before applying a feature selection method (reference).
- Balancing the data during cross validation (reference).
What is the correct procedure?
I've written a post below with a suggested procedure.