I have a following file:
File
----
12-3 John 121
5-1 Sam 122
The file is tab(\t
) delimited. I am loading the row as line:chararray
as I want the data not to be split in individual fields.
And now, I want to pull and store the details (12-3, and 5-1) as separate data.
I am trying with STRSPLIT
and REGEX_EXTRACT_ALL
, but the data doesn't seem to match.
splitdata = FOREACH filedata {
regex = REGEX_EXTRACT_ALL(line, '^([0-9]*)\\-([0-9]*)');
split = STRSPLIT(line, '\\t', 1);
GENERATE regex, split;
};
This is how I want my final data to be:
(12, 3, 12-3 John 121)
( 5, 1, 5-1 Sam 122)