I have 3 datasets in csv problem.csv with attributes:
- id
- level
- accuracy
- solved_count
- error_count
- tag1
- tag2
- tag3
- tag4
- tag5
Submission.csv with attibutes:
- user_id
- problem_id
- solved_status
user.csv with
- user_id
- solved_count
- attempts
I want to now predict on a test dataset that whether user will be able to solve a problem or not.
I was thinking of applying Naive Bayes Classification. But i don't know how to approach this problem. I suppose i have to make a common dataset in arff for use with Weka or sckit learn. Give me some idea of how i can approach this problem.