0

I have a dataframe like below values, I'm able to achieve the expected output in pandas but not in pyspark.

SAMPLE INPUT DF

number    time
12344    5 days, 04 hours, 52 minutes, 10
14566    8 days, 16 hours, 10 minutes, 09
13477    0 days, 21 hours, 29 minutes, 59
14579    4 days, 10 hours, 13 minutes, 23

SAMPLE OUTPUT DF

number    time
12344    5d 04h 52m 10s
14566    8d 16h 10m 09s
13477    0d 21h 29m 59s
14579    4d 10h 13m 23s

Any help would be appreciated!!

Anos
  • 57
  • 8

0 Answers0