I am working on use case wherein I have two unbounded streams and want to do left join on these streams. Using a fixed-size window of 5 minutes with no allowed lateness.For join I am using java extension join library. But After Join it's not emitting results. Code for same:
PCollection<KV<String, KV<GenericRecord, GenericRecord>>> joinedDatasets = Join.leftOuterJoin(aById, bById, GenericRecord);
PCollection<GenericRecord> result = joinedDatasets.apply(ParDo.of(new DoFn<KV<String, KV<GenericRecord, GenericRecord>>, GenericRecord>() {
@ProcessElement
public void processElement(@Element KV<String, KV<GenericRecord, GenericRecord>> element, OutputReceiver<GenericRecord> out) {
LogHelper.info(element.getKey());
//some processing logic
}
}));
I have tried cogrpbykey but behaviour is same.Running job on directrunner