Now. I build a formula for news to get list hot news. have any factor as pageview, time,content. what does a solution for this problem? Thanks
Asked
Active
Viewed 1,497 times
-1
-
1How 'bout http://stackoverflow.com/questions/3783892/implementing-the-hacker-news-ranking-algorithm-in-sql?rq=1 – Matt Ball Sep 11 '12 at 04:19
-
Where are you getting the feed? What does the data look like? What have you tried? – Todd Moses Sep 11 '12 at 04:20
-
@MattBall: now, i have pageview. as hacker news have upvote and downvote – Vi Ngo Van Sep 11 '12 at 04:28
-
1Now. [FAQ](http://stackoverflow.com/faq). – PengOne Sep 11 '12 at 05:32
-
Hello, Have you found a solution or more info about how to deal with this? – RodParedes Dec 01 '21 at 15:24
1 Answers
3
You can try a machine learning approach for this problem.
- Extract your features, and give each a numeric value (you can use the Bag of Words model for content). Note that some feature selection algorithm might be needed.
- Manually label large enough set of examples - and give each of them a score according to its importance.
- Use linear regression and build a function that evaluates each article and gives it a score.
- Now that you have your regression function, you can use it to give score to each article. Use it to achieve the raw score.
- For post processing - combine this score with the time in a second function to get the article's final score. @MattBall's suggested link seems like reasonable approach.