I am trying to learn some classification in Scikit-learn. However, I couldn't figure out what this error means.
import pandas as pd
from sklearn.feature_extraction.text import CountVectorizer
data_frame = pd.read_csv('data.csv', header=0)
data_in_numpy = data_frame.values
c = CountVectorizer()
c.fit_transform(data_in_numpy.data)
This throws an error:
NotImplementedError: multi-dimensional sub-views are not implemented
How can I go around this issue? One record from my csv file looks like:
Time Directors Actors Rating Label
123 Abc, Def A, B,c 7.2 1
I suppose this error is due to the fact that there are more than one values under Directors or Actors column. Any help would be appreciated. Thanks,