I'm trying to read some parquet files stored in a s3 bucket. I am using the following code:
s3 = boto3.resource('s3')
# get a handle on the bucket that holds your file
bucket = s3.Bucket('bucket_name')
# get a handle on the object you want (i.e. your file)
obj = bucket.Object(key = 'file/key/083b661babc54dd89139449d15fa22dd.snappy.parquet')
# get the object
response = obj.get()
# read the contents of the file and split it into a list of lines
lines = response[u'Body'].read().split('\n')
When trying to execute the last line of code lines = response[u'Body'].read().split('\n')
I'm getting the following error:
TypeError: a bytes-like object is required, not 'str'
I'm not really sure how to solve this issue.