i have a dataset of a social network contaning information about how follows how i need to find most active user(for example the user that dose most followings) my data set lines are like bellow
1000066:262792,273106,590979,1152305,1691577,1888250
and some of them are like these
1000073:private
1000069:notfound
questions 1: how to make rdd of any line in the way the key of all rdd pairs would be the first number that separated with ':' and values one by one separated with ','? question 2: how could i solve this problem using graphx? All i need is to find most active user in this dataset thanks in advance, answering any off these too will help