I have parquet files which I have read using the following spark command
lazy val out = spark.read.parquet("/tmp/oip/logprint_poc/feb28eb24ffe44cab60f2832a98795b1.parquet")
The column names of a lot of column has special chracter "(". like WA_0_DWHRPD_Purge_Date_(TOD)
, WA_0_DWHRRT_Record_Type_(80=Index)
How can I remove this special character.
My end goal is to remove these special character and write the parquet file back using the following command
df_hive.write.format("parquet").save("hdfs:///tmp/oip/logprint_poc_cleaned/")
Also, I am using Scala spark shell. I am new to spark, I saw similar questions but nothing is working in my case. Any help is appreciated.