0

Hi there I would like to join all strings within a group with Python datatable in order to avoid pandas. Below is the code I am currently using and which I would like to replicate in datatable.

Does anyone know how to do it? Thank you very much!

from datatable import dt, f, by

df = dt.Frame(group1=[1, 1, 1, 2, 2, 2], group2=[1, 1, 2, 2, 2, 3], text=['a', 'b', 'c', 'd', 'e', 'f'])

df = df.to_pandas()
df2 = df.groupby(['group1', 'group2'])['text'].apply(' '.join).reset_index() # replicate this with datatable

df:
   group1  group2 text
0       1       1    a
1       1       1    b
2       1       2    c
3       2       2    d
4       2       2    e
5       2       3    f

df2 
   group1  group2 text
0       1       1  a b
1       1       2    c
2       2       2  d e
3       2       3    f
Konrad Rudolph
  • 530,221
  • 131
  • 937
  • 1,214
peter
  • 756
  • 5
  • 16

0 Answers0