I have a dataframe that contains answers to many questions.
Each row represents an answer-er and the columns are the answers to the questions given Because people often spam those questionnaires sometimes there are answer-ers that give the same answer many times like ''yes good'', ''yes good''....
I would like to remove those rows where same answers are repeated more than once or twice (because a single repetition could be coincidence)
My dataframe looks like this: Questions differ from file to file but always column 0 is ID and all rest columns are questions and their number vary.
ID , Question 1 , Question 2 , Question 3 , Question 4 , ...
Id1 , Ans. str1 ,Ans. string2 ,Ans. string3 , Ans. string4 , ...
Id2 , Ans. str1 ,Ans. string2 ,Ans. string3 , Ans. string4 , ...
Id3 , Ans. str1 ,Ans. string2 ,Ans. string3 , Ans. string4 , ...
Id4 , Ans. str1 ,Ans. string2 ,Ans. string3 , Ans. string4 , ...
What I need is to drop rows that contain same answers to more than one questions Idealy i would like to be able to adjust the number of identical answers found that for a row to be dropped. Because when you have big questionnaires 2 answers can be same without being a spammer. If such case is not easy lets try to drop when any 2 are same.