I have a csv file with 3483 lines and 460K characters and 65K words, and I'm trying to use this corpus to train a NaiveBayes classifier in Scikit-learn.
The problem is when I use this statement below, takes too long (1 hour and did not finish).
from textblob import TextBlob
from textblob.classifiers import NaiveBayesClassifier
import csv
with open('train.csv', 'r') as fp:
cl = NaiveBayesClassifier(fp, format="csv")
Any guesses of what I doing wrong?
Thanks in advance.