I am using Kinesis data stream as a source and elasticsearch as a sink.
I am using Flink job to process this data a little bit then sink this data to elasticsearch.
In the production environment, the Kinesis data stream can generate 50,000 events per second. it's taking a lot of time to process data to process 500,000 events it takes nearly around 50 minutes of time.
Elasticsearch version 7.7 running on SSD-based storage.
Elasticsearch nodes: 2
Shards: 5
Replicas: 1 per shard
Refresh interval: 1 sec (default)
We are using AWS opensearch elasticsearch.
Can someone please suggest what causes this delay?