-1

Now. I build a formula for news to get list hot news. have any factor as pageview, time,content. what does a solution for this problem? Thanks

Vadim Kotov
  • 8,084
  • 8
  • 48
  • 62
Vi Ngo Van
  • 172
  • 2
  • 12

1 Answers1

3

You can try a machine learning approach for this problem.

  1. Extract your features, and give each a numeric value (you can use the Bag of Words model for content). Note that some feature selection algorithm might be needed.
  2. Manually label large enough set of examples - and give each of them a score according to its importance.
  3. Use linear regression and build a function that evaluates each article and gives it a score.
  4. Now that you have your regression function, you can use it to give score to each article. Use it to achieve the raw score.
  5. For post processing - combine this score with the time in a second function to get the article's final score. @MattBall's suggested link seems like reasonable approach.
Community
  • 1
  • 1
amit
  • 175,853
  • 27
  • 231
  • 333