I'm trying to make some predictive model using Baking Dataset - Marketing Targets from kaggle here is the link : https://www.kaggle.com/datasets/prakharrathi25/banking-dataset-marketing-targets
The dataset from kaggle already been separated into train data csv and testing data csv. But both csv files have a target column y
. Should i concatenate both of them into 1 data frame before start EDA and preprocessing? Then use train_test_split from sklearn library when creating model?
The second question is i've also seen some dataset from kaggle like this : https://www.kaggle.com/datasets/anmolkumar/health-insurance-cross-sell-prediction also already separated into train data csv and test data csv. But the difference is the test data csv doesnt have the target column (Response
). That makes me think i can't concatenate both datasets.
Can someone please explain to me?