I have a table I am reading from Postres db and one of those fields is a jsonb type field that contains a nested array. The array can change at the row level within the table in Postgres, meaning the number of keys/value pairs can change. Inferring the schema sets the field to a string type. Does Pyspark support jsonb type fields and infer the schema for the field type?
Asked
Active
Viewed 201 times
1 Answers
0
Found my answer here: Pyspark: Parse a column of json strings
It took the jsonb field, parsed the field names, and restructured the df to accommodate for the nested array.

user7298979
- 499
- 4
- 21