0

As of my requirement on real time analytics I have to insert 10 million records into Kafka and on top of that I have to write custom consumer logic.But to fulfill this requirement how can I seed 10 million records into Kafka within 3-5 minutes time?

Preferably we want to use Spring boot as per the project requirements. Please suggest

Test Mail
  • 169
  • 3
  • 15
  • 3
    Possible duplicate of [Is there any simulator/tool to generate messages for streaming?](https://stackoverflow.com/questions/40131580/is-there-any-simulator-tool-to-generate-messages-for-streaming) – Bushuev May 03 '19 at 21:39
  • 3
    https://www.confluent.io/blog/easy-ways-generate-test-data-kafka – OneCricketeer May 03 '19 at 21:45

3 Answers3

3

Several options:

Robin Moffatt
  • 30,382
  • 3
  • 65
  • 92
0

You can try streamsets tool, it has a data generator functionality. It generates data rapidly. Also this will help you with connector and consumer/producer, in-fact you can do some processing & filtering also here so ideally almost no coding and you are done.

Mayank J
  • 71
  • 3
0

You can create a simple producer and produce in a for loop. I did it, it works like a charm. Don't forget to NOT use the .flush() if you dont want to produce 1 msg/sec...

oguz ismail
  • 1
  • 16
  • 47
  • 69
Gremi64
  • 1,534
  • 12
  • 19