I have a scenario where we have two different streams and we get data on them at two different times and i need to join them on the basis of the timestamp that is there in the value. I will try to explain through below example.
inputStream1 ->
- key 111, value 21:00 AAA
- key 111, value 21:02 AAA
- key 111, value 21:04 AAA
- key 111, value 21:15 AAA
- key 111, value 21:18 BBB
- key 111, value 21:20 BBB
inputStream2 ->
- key 111, value 21:01 10.0.0.1
- key 111, value 21:04 10.0.0.2
- key 111, value 21:14 10.0.0.3
- key 111, value 21:20 10.0.0.4
- key 111, value 21:21 10.0.0.5
output Join that i need ->
- AAA 10.0.0.1
- AAA 10.0.0.2
- AAA 10.0.0.3
- BBB 10.0.0.4
- BBB 10.0.0.5
Note: Both the stream get their input at different times. It is possible that when first record at inputStream1 arrives, inputStream2 has all 5 records present. I want to match them on the time window that is in the value.
How can i achieve this in kafka. Is it even possible?