0

i am new to map-reduce jobs.May be a some basic questions but the existing documentation didn't helped me. How to run mapreduce jobs using luigi. For example wordcount_hadoop.py what are the parameters i need to pass to start a job

python examples/wordcount_hadoop.py --date-interval 2012-06-01

output:

usage: wordcount_hadoop.py [-h] [--scheduler-port SCHEDULER_PORT] [--lock]
                           [--workers WORKERS] [--lock-pid-dir LOCK_PID_DIR]
                           [--scheduler-host SCHEDULER_HOST]
                           [--local-scheduler] [--pool POOL]
                                                    {BaseHadoopJobTask,EnvironmentParamsContainer,JobTask,Task,WordCount,WrapperTask}                           ...
wordcount_hadoop.py: error: argument {BaseHadoopJobTask,EnvironmentParamsContainer,JobTask,Task,WordCount,WrapperTask}: invalid choice: '2012-07' (choose from 'JobTask', 'Task', 'WrapperTask', 'WordCount', 'EnvironmentParamsContainer', 'BaseHadoopJobTask')
user2695817
  • 121
  • 1
  • 7

1 Answers1

3

You need to pass in the task name in the command.

For example:

python examples/wordcount_hadoop.py WordCount --date-interval 2012-06-01

interskh
  • 2,511
  • 4
  • 20
  • 20