5

I know, join between two streams is not supported yet. But there's a way to do it anyway? Or I need to wait for Spark 2.2.0? Thank you

zero323
  • 322,348
  • 103
  • 959
  • 935
Aniello Guarino
  • 197
  • 2
  • 10

3 Answers3

4

Spark 2.2.0 still won't support stream-stream join. You can watch this ticket: https://issues.apache.org/jira/browse/SPARK-18791

zsxwing
  • 20,270
  • 4
  • 37
  • 59
  • will it be available in 2.3 ? – user1870400 Feb 12 '18 at 04:10
  • 1
    Yes. 2.3 will support it. – zsxwing Feb 12 '18 at 08:34
  • Hi! I was just playing with spark 2.3 RC4 stream-stream join. Somehow I couldn't do a self join on streaming dataframe easily. and I am not sure why..It might be my syntax but I am not sure..more details here https://stackoverflow.com/questions/48900127/how-to-do-a-self-join-in-spark-2-3-0-what-is-the-correct-syntax would be great if you can help on this one!! – user1870400 Feb 21 '18 at 08:07
1

As of production release of Spark 2.3, stream-stream joins are now supported in structured streaming.

rhoeting
  • 396
  • 4
  • 10
-2

I recommend you using Kafka Streams rather than Structured Spark Streaming if you want to join streams.. check this link to see how this can be done

Gilles Essoki
  • 517
  • 6
  • 15