I'm working with another team, who has already achieved near real time (NRT) load in their GCP bigquery project dataset. The objective on our side is to utilize their NRT datasets to create another/several NRT tables on our side. This could involve (as a initial test) left joining
two NRT tables, aggregating using group by
etc.
Is there a way to achieve this using something like an event trigger (or equivalent terminology in GCP)?
What I've searched so far is using pub/sub and Dataflow in GCP. However my understanding is if I use this way, my whole process is becoming independent of what our upper stream team has done for us.
Can someone give me some suggestions?