I am trying to separate my column VEHICLE_TYPE
by Model and Engine. The code can be a normal SQL or R code.
My data looks like this:
MODEL VEHICLE_TYPE
77 Bora Bora 1.6
79 Ducato Ducato 15 120 Multijet
80 Ducato Ducato 15 120 Multijet
87 Astra Astra 1.7 CDTI
88 406 406 2.0 HDi
89 406 406 2.0 HDi
90 Focus C-MAX Focus C-MAX 1.6 TDCi
91 Focus C-MAX Focus C-MAX 1.6 TDCi
92 Focus C-MAX Focus C-MAX 1.6 TDCi
93 Focus C-MAX Focus C-MAX 1.6 TDCi
94 Focus C-MAX Focus C-MAX 1.6 TDCi
97 S-Klasse S 320 CDI
98 S-Klasse S 320 CDI
99 S-Klasse S 320 CDI
I would like to receive something like this:
MODEL VEHICLE TYPE
Bora 1.6
Ducato 15 120 Multijet
... ...
Focus C-Max 1.6 TDCi
The problem is, that VEHICLE_TYPE can have different lengths and different number of spaces, which I could separate by.
I tried it with gsub and regex, didn't work, but strsplit worked. Far away from what I actually wanted, I ran out of ideas and need some help now.
> strsplit(as.character(test$VEHICLE_TYPE)," ")
[[1]]
[1] "Bora" "1.6"
[[2]]
[1] "Ducato" "15" "120" "Multijet"
[[3]]
[1] "Ducato" "15" "120" "Multijet"
[[4]]
[1] "Astra" "1.7" "CDTI"
[[5]]
[1] "406" "2.0" "HDi"
[[6]]
[1] "406" "2.0" "HDi"
[[7]]
[1] "Focus" "C-MAX" "1.6" "TDCi"
[[8]]
[1] "Focus" "C-MAX" "1.6" "TDCi"
[[9]]
[1] "Focus" "C-MAX" "1.6" "TDCi"
[[10]]
[1] "Focus" "C-MAX" "1.6" "TDCi"
[[11]]
[1] "Focus" "C-MAX" "1.6" "TDCi"
[[12]]
[1] "S" "320" "CDI"