2

From the documentation here: https://msdn.microsoft.com/en-us/library/dn760793.aspx

It says: totalEstimatedMatches: The estimated number of news articles that are relevant to the query. Use this number along with the count and offset query parameters to page the results.

However, there are some serious issues.

1.The returned number of results is ALWAYS less than the requested number in the "count" variable. For example, setting a count=100 results in only 75 results.

2.What's more, even skipping the difference and sending another query to the API with an offset (in this example, offset=100), the API returns a new totalEstimatedMatches!! (first query was 70k results, second time was 138)

What is going on here? How do we fully get the totalEstimatedMatches returned from the first query? Or is that a bogus inflated number?

  • Can you send a sample query and any other search parameters you're using where you observe this issue? I tried using &count=100 with the query q="top news" and got 100 news results as expected. Let me know which country you're also located as it'll help me debug the issue faster. – John Psaroudakis Oct 07 '16 at 02:23
  • I am located in the US. Did you try offset? Here is an example query parameters: q="cama", cc=BR, safeSearch=Off, count=100, offset=100. In HTTP headers, Accept-Language=pt. Try these, starting with offset 0, you will see that not only totalEstimatedMatches keeps changing, the subsequent calls with offset returns random number of results. – user3786924 Oct 10 '16 at 13:45
  • Thanks for reporting. I observe the same behavior. I notified our News API team to look into this. Somebody from the team will reply soon. – John Psaroudakis Oct 19 '16 at 18:01

1 Answers1

3

enter image description hereWe did some investigation on this issue. Basically, search engine index does not support an accurate estimation of total match, the same behavior could be observed on Bing.com. the 217M results in the screen shot provided in the image tab above which is not very accurate either.

And, news has backend mechanism that any query output should be less than 100. So the total estimated matches number is not used properly in this example. Normally we do not allow user to download too many results of each query in news. The number of documents you could get from certain query actually capped at a certain number, in most of the case it is around 100.

Patrick
  • 121
  • 2