I have created a data-set of various movies produced in the past few years, technicians worked for the film, genre, country it represented, runtime, language, the respective film festival that film has won, etc.
the data-set is similar to this, it is an excel file.
I'm interested in multi-label classification of the movies to film festivals based on the inherent features of the movie(irrespective of the plot)
I thought we need to work in numbers/vectors to multi-label classify the data. But, I'm unaware of how vectorization of names(proper nouns) and few individual words can be carried out.
Is there any other way I can carry out the process to achieve my goal of multi-label classification with the above data? Please help me identify it. Thank you.