I am working for a client where I should put some files to HDFS with Snappy compression. My problem is snappy codec is not defined in mapred-site.xml
or hdfs-site.xml
Somehow I have to put files preferably using hdfs put
command and they should be compressed. There is no chance to change configuration files since it is a production machine and other people using it actively.
Another suggested solution was to import files to HDFS without compression then create hive external tables with compression and use its source files while deleting the uncompressed files. But this is a long way to go and it is not guaranteed to work.
Any suggestions will be appreciated about using hdfs put
with some kind of parameters to compress files.