The Flink official documentation provides a jobmanager high availability solution for Standalone And Yarn Flink clusters. But what should be done for high availability using a Mesos Flink Cluster?
I have learned that Flink relies on Marathon to bring up a failed JobManager, but it comes with a problem: every time jobmanager restart, jobs gets re-deployed. This is not acceptable for mission critical jobs. Is there workaround for this issue ? Or does any plan to support full HA with Flink on Mesos mode ?