I have data on a user's birthplace, specifically a city. Since I have a few thousand cities in my dataset, I looked for alternatives of OneHot encoding, since I didn't want to add thousands of columns to my dataset for a single column. I found BaseN encoding is a good alternative to OneHot, so I went with that. I encoded my data with base 4, so instead of a string column City now I have cumeric columns City_0, City_1 etc.
However, after modeling the dataset with a Random Forest Classifier, I have found that certain City_#
variables are amongst the most important features. But how do I interpret this result? Since the cities have been encoded into 4 separate columns, how can I draw an actual conclusion (e.g. which cities impact my target variable the most)? Is there a method, or did I completely lose interpretability by encoding the cities this way?