1

We have a client who receives 2-4 million visits a day, so off the bat we can only get unsampled reports because it exceeds google's limit :

500,000 maximum sessions for special queries where the data is not already stored.

We are attempting to collect Unique Visitors and Visits for a 1 day period. Using the Google API has proved frivolous as the data is sampled.

We have set up Unsampled reports on a daily basis that get dumped into Google Drive and our application picks up the new files and downloads them just fine. The problem we are running into is that we need 2 years worth of daily data for 20 reports. The maximum range we can run an unsampled report using google analytics web interface is 1 week before we exceed a query limit. So 52 weeks of reports x 2 years x 20 different reports to set up is 2080 scheduled unsampled reports and this is for 1 client only.

EDIT: Can we automate unsampled reports using GA API or any programming method to pull historical data with the constraints previously mentioned? Also we do have Google Analytics Premium

c3cris
  • 1,276
  • 2
  • 15
  • 37
  • 1
    If this is a premium account you can export raw data to Google Big Query (and presumably download from there, haven't tried it. Plus, extra cost). If this is a regular account Google will most likely ask you to upgrade to premium since you exceed the data limits by at least an order of magnitude (it's 10 million interaction hits per month) and there is no good way to do what you want (presumably - you are not actually asking a question). – Eike Pierstorff Jan 03 '14 at 11:00
  • @EikePierstorff We are actually looking into Google Big Query but we cannot seem to locate any GEO location because according to the schema https://support.google.com/analytics/answer/3437719?hl=en&ref_topic=3416089 they do not provide an IP nor ANY geo location data. – c3cris Jan 09 '14 at 01:41
  • I guess by revealing the IP Google would violate their own TOS (since IPs can be used to identify a person). Which might also be why there is no geo information, since that would (at least in some cases) allow you to close in on indidivual users via a cleverly constructed query. Sorry, I really thought I was on to something. – Eike Pierstorff Jan 09 '14 at 08:15

1 Answers1

2

Cris G, the only way to avoid data-sampling in Google Analytics without having an access to Premium is day-parting technique = you split a data-request for selected time period into shorter period queries (typically days) and then add all the numbers up. If your profiles/views are not sampled if you look at daily numbers, this could solve you issue.

However, this doesn't work on Unique Visitors, since they will be unique every single time (you are running data requests on daily basis), so there will be most likely duplicates and inflated totals if your site is attracting lots of returning visitors.

To automate some of the work, I suggest using tools like Analytics Canvas. It can make your life much easier and I think it could be the perfect tool for what you need to. Bear in mind the limitations about unique visitors (and some other metrics).

Having said that, I still think the best choice would be to use the benefits of Premium and the ability to get unsampled data for your reports.

Ameet Wadhwani
  • 550
  • 4
  • 8
Petr Havlik
  • 3,307
  • 1
  • 19
  • 17
  • As I stated , We do have access to premium and we still get sampled reports on API and as for pulling unsampled reports we can only get 1-2 weeks at a time. As I have mentioned there are 20+ reports and 2 years of data. And this is only for one client. – c3cris Jan 07 '14 at 20:08
  • Well, then the day-parting is probably the best way to go. See the linked article for more details. – Petr Havlik Jan 07 '14 at 23:55