Question: I want to pull a stream of Firebase event-data grouped by user id and ordered by time of occurrence for my android users. I created the two scripts below, but unfortunately in both I can't seem to get the last part correct, which is to, successfully group all the app_instance_ids first irregardless of timestamp. Should I perhaps look at using distinct user_ids instead?
Unsuccessful attempt 1:
SELECT
d.userid,
c.ev_timestamp,
c.ev_name
FROM (SELECT
user_dim.app_info.app_instance_id as userid
FROM `firebase-analytics-sample-data.ios_dataset.app_events_*`, UNNEST(event_dim) AS event
WHERE _TABLE_SUFFIX BETWEEN '20160601' AND '20160603'
AND user_dim.first_open_timestamp_micros BETWEEN 1464789600000000 AND 1464962400000000
GROUP BY 1) AS d
LEFT JOIN (SELECT user_dim.app_info.app_instance_id as userid,
event.timestamp_micros as ev_timestamp,
event.name as ev_name
FROM `firebase-analytics-sample-data.ios_dataset.app_events_*`, UNNEST(event_dim) AS event
WHERE _TABLE_SUFFIX BETWEEN '20160601' AND '20160603'
AND user_dim.first_open_timestamp_micros BETWEEN 1464789600000000 AND 1464962400000000) AS c
ON d.userid = c.userid
ORDER BY 2 ASC
LIMIT 1000;
Unsuccessful attempt 2:
SELECT
d.userid,
d.ev_timestamp,
c.ev_name
FROM (SELECT
user_dim.app_info.app_instance_id as userid,
event.timestamp_micros as ev_timestamp
FROM `firebase-analytics-sample-data.ios_dataset.app_events_*`, UNNEST(event_dim) AS event
WHERE _TABLE_SUFFIX BETWEEN '20160601' AND '20160603'
AND user_dim.first_open_timestamp_micros BETWEEN 1464789600000000 AND 1464962400000000
GROUP BY 1,2
ORDER BY 2 ASC) AS d
LEFT JOIN (SELECT user_dim.app_info.app_instance_id as userid,
event.timestamp_micros as ev_timestamp,
event.name as ev_name
FROM `firebase-analytics-sample-data.ios_dataset.app_events_*`, UNNEST(event_dim) AS event
WHERE _TABLE_SUFFIX BETWEEN '20160601' AND '20160603'
AND user_dim.first_open_timestamp_micros BETWEEN 1464789600000000 AND 1464962400000000) AS c
ON d.userid = c.userid AND d.ev_timestamp = c.ev_timestamp
#ORDER BY 2 ASC
LIMIT 1000;
Correct answer (Amod's answer converted to New Export Schema):
SELECT user_pseudo_id, event_timestamp, event_name
FROM `xxxx.analytics_xxxx.events_*`
WHERE _TABLE_SUFFIX BETWEEN '20180630' AND '20180702'
AND user_first_touch_timestamp BETWEEN 1530453600000000 AND 1530468000000000
AND platform = "ANDROID"
ORDER BY 1,2 ASC
LIMIT 1000