In the image attached below, the genre column has multiple attributes for a single entry. I am trying to build a neural network model and for that I need to encode it. I am having problems regarding that.
Asked
Active
Viewed 73 times
-1
-
1Where is the image. – codester_09 Jan 20 '22 at 07:17
-
Read this post: https://www.geeksforgeeks.org/how-to-create-dummy-variables-in-python-with-pandas/ – Gedas Miksenas Jan 20 '22 at 07:22
-
Please clarify your specific problem or provide additional details to highlight exactly what you need. As it's currently written, it's hard to tell exactly what you're asking. – Community Jan 29 '22 at 11:32
1 Answers
0
There are tree options:
- One Hot Encoding: each label is mapped to a binary vector;
- Integer Encoding: each unique label is mapped to an integer;
- Use Embedding: will map each of the unique category into a N-dimensional vector;

Danylo Baibak
- 2,106
- 1
- 11
- 18
-
Yes, I knew that but can you please elaborate on how to apply that on a column. For instance, I have a skills column where there are multiple skills in the form of an array in a single row and the unique skills are more than 100. Should I apply one-hot encoding if I do that the number of columns will increase and I do not know how that will affect the neural network model. – KAUSHAL KUMAR Jan 25 '22 at 06:52