0

How do I ensure that there's only reducer for my EMR Streaming job? Is there any way to do this from the web frontend when I'm creating a new Jobflow?

jetru
  • 1,964
  • 4
  • 16
  • 24

1 Answers1

1

You can configure Hadoop in the bootstrap action using the --arg flag. Specifically to your question you can set the mapred.tasktracker.reduce.tasks.maximum to 1.

elastic-mapreduce --create --alive \
      --name "Configure Jobflow" \
      --bootstrap-actions s3://elasticmapreduce/bootstrap-actions/configure-hadoop
      --arg mapred.tasktracker.reduce.tasks.maximum=2
Ronen Botzer
  • 6,951
  • 22
  • 41