-2

i am working on a project in which i am using a dataset -https://www.kaggle.com/aljarah/xAPI-Edu-Data . I want to do clustering of students (each student represented as index), based upon various attributes of the dataset like raised hands, visited resources, announcements viewed, etc. Please suggest how can i implement this using DBSCAN, if not please propose some technique through which i can do it. I am a newbie in this field of data science.

Thanks

i tried studying gmm and dbscan.

i want to do clustering on a dataset.

1 Answers1

0

Any standard implementation of DBSCAN will support multiple attributes.

Mostly it will depend on your decision how to measure similarity, when attributes have very different type. Euclidean distance will likely not make sense. But there is no "correct" way of doing these, it's your decision on how to model the data. On this data set, it will be rather arbitrary, unfortunately, because these attributes have no natural scale.

Has QUIT--Anony-Mousse
  • 76,138
  • 12
  • 138
  • 194