I am working on Titanic dataset. I am trying to fill in blanks in CABIN column. I have extracted first letter from CABIN column values, then put those in CABIN_NEW column. After that I use rpart
for prediction, but somehow every time I run the code below, R takes a lot of time (haven't finished one time yet, every time, I have to terminate it).
DATAset has 1309 rows and the columns I am using are below in code. The system I am using is running on 4 GB Ram, i5 processor and Window 7.
combifit <- rpart(Cabin_New ~ Pclass + Sex + Age + SibSp + Parch + Fare + Embarked + Title
+ FamilySize + Surname + FamilyID,
data = combi[!is.na(combi$Cabin_New),], method = 'class')