Cadence server version: 0.19.2
I have made following observation: I have a Job workflow that triggers encoding workflow (child workflow) which has an activity to handle encoding status. I have supplied retry configuration and heartbeat configuration in both child workflow and activity in case the workflow or activity fails due to server getting killed. However out of lets say 100 jobs, i get 20 jobs where the activity doesn't retries. It fails in attempt 0 with timeout type heartbeat timeout. I am sharing below parent workflow json and child workflow json and along with the some screen shots
Child workflow JSON http://jsonblob.com/1007526462865293312
Parent Workflow JSON http://jsonblob.com/1007526121943875584