My apache beam pipeline looks like this:
vids = (p|'Read input' >> beam.io.ReadFromText(known_args.input)
| 'Parse input' >> beam.Map(lambda line: csv.reader([line]).next())
| 'Run DeepMeerkat' >> beam.ParDo(PredictDoFn(pipeline_args)))
Where I am inputting a csv with a list of videos to analyze. In this test run there were 4 videos.
The pipeline runs fine, but i'm not understanding the autoscaling feature.
Currently identifies 4 elements (right side)
but the console shows rising to 15 workers
How can there be more workers than elements?