I imported a big dataset (~6 million rows) to R using ffbase package that lists people enrolled in high school in Brazil. In principle, I have 2 columns: Id (Student Id Number) and University (Institution’s name).
I would like to create a column - named Group in my example - relating each university to its educational group:
Id University Group
000001 Anhanguera Kroton
000002 Unopar Kroton
000003 Anhembi Laureate
000004 FMU Laureate
PS: I have none information about educational groups in my dataset, but, I’ve got the information I need concerning which group corresponds to each university. In this way, I need to attach this detail to my data.
PS2: The class of University column is ff_vector.
I appreciate any contribution you might make.