1

I have a csv file with 600000 unique user_ids and 70000 unique_products. I plan to implement a ALS based recommender system and for that I am planning to apply crosstab function on user_ids and unique_products. It creates memory error in pandas and 1e4 limit error in pyspark. Can anyone suggest me a way to solve this problem. Or I am also happy to get some suggestions on how to use the data for implementing a recommender system using ALS method. Thanks in Advance

Karthick
  • 58
  • 4

0 Answers0