I am currently working on a project titled by Automatic Tuning for Flink streaming framework.
Basically, we aim to create a model(Reinforcement learning agent) to select the best values for Flink parameters. Such a problem occurs in the Spark framework, as an example, choosing the right configuration can be challenging and no doing it correctly may have a significant impact on the performance.
What I would like to know is:
- Aside from code optimization, are there parameters that require tuning in a streaming job for Flink?
- Is there a shortlist of parameters that we need to focus on, created by experts?
- Is choosing the right parameters requires a trainable model(a sophisticated process) or maybe it's simply not that challenging?
Thank you.