I'm trying to create a step by Linux console:
aws emr add-steps --cluster-id j-XXXXXXXXXX --steps Type=CUSTOM_JAR,Name="S3DistCp step",Jar=/home/hadoop/lib/emr-s3distcp-1.0.jar,\
Args=["--s3Endpoint,s3-eu-west-1.amazonaws.com","--src,s3://folder-name/logs/j-XXXXXXXXXX/node/","--dest,hdfs:///output","--srcPattern,.*[a-zA-Z,]+"]
I jump the following error
Error parsing parameter '--steps': Expected: ',', received: '+' for input
How I can fix it?
I'm looking for a solution to upload multiple files to S3 and S3DistCp the Hive gather for Amazon EMR. Is there any other way?
I have another question: Now I am creating an SSH tunnel to connect to Hive, how I could connect with PHP?
At the moment I have solved the error by removing "src Pattern", however gives me another error, I include image below
This is the error that appears
INFO Synchronously wait child process to complete : hadoop jar /var/lib/aws/emr/step-runner/hadoop-
INFO waitProcessCompletion ended with exit code 1 : hadoop jar
/var/lib/aws/emr/step-runner/hadoop-
INFO total process run time: 2 seconds
2016-07-12T14:26:48.744Z INFO Step created jobs:
2016-07-12T14:26:48.744Z WARN Step failed with exitCode 1 and took 2 seconds
Thx!!!