I am having a sample json where the data type of the key is the value which is in string format which i want to read and save it to pyspark dataframe

Asked Nov 18 '22 at 10:59

Active Nov 18 '22 at 11:05

Viewed 20 times

Below is a piece of sample json schema. I want my pyspark dataframe to read netWorthOfTheCompany as column and float as its data type. But currently when i read the json schema and save it in dataframe & print(df.dtypes) it prints as string as it treats it string in the schema. I dont want to create a custom schema & write all the struct type & struct fields in it because the json schema is too long.

{
    "turnover": {
        "netWorthOfTheCompany": "float", 
        "totalTurnover": "float"
    }
}

This is the line of code where i am reading the json schema & saving it in a dataframe. df=spark.read.option("multiline","true").json(filepath)

I want to read the value of the key in json schema as its data type & not as string & it should map to the corresponding data type available in pyspark. ==> netWorthOfTheCompany : type(float)

edited Nov 18 '22 at 11:05

asked Nov 18 '22 at 10:59

Aziz Shaikh

I am having a sample json where the data type of the key is the value which is in string format which i want to read and save it to pyspark dataframe

0 Answers0