2

I am using databricks environment and reading a Kafka input.

The messages consumed from Kafka are in Thrift Binary and am having issues deserialize it to JSON.

I am writing an UDF to do this, but am unable to figure out how to convert thrift -> JSON?

I have tried

import thriftpy2.protocol.json as proto
    
def decoder(thrift_data):
  return proto.struct_to_json(thrift_data)

which throws an error PythonException: 'AttributeError: 'bytearray' object has no attribute 'thrift_spec''

and also tried

def decoder(thrift_data):
  return serialize(thrift_data, protocol_factory=TSimpleJSONProtocolFactory())

which throws an error PythonException: 'AttributeError: 'bytearray' object has no attribute 'write'',
John Constantine
  • 1,038
  • 4
  • 15
  • 43

0 Answers0