Technical background: I am getting tables data from kafka and putting it into hudi and hive tables using spark. I am using AWS EMR. I want to encrypt data in transit within the cluster as well as synced external tables data present in s3 (Data at rest)
Note: I dont want to use AWS EMR encryption, want to use spark or hudi encryption, doesn't want to stick to AWS only, want platform independent solution
I read about hudi/spark encryption (link) but thats a columnar encryption, I dont want to encrypt specific column, I want all data to be encrypted, so Is there any spark configuration to encrypt whole data in rest as well as transit within cluster?
TIA