I'm stuck with a problem. I'm not an Excel expert.
I found some threads how to use regex in Excel, like:
But I don't know how to filter several things out of a data source to a newly created column.
Example:
The data source is mostly HTML-sourcecode in one cell.
I built RegEx for matching links (ahref), youtube, facebook videos, infogr.am links and other stuff.
I want to:
Extract/filter the links from the html to a new column, like:
New column "Links"
<a href="url">link text</a> \n
<a href="url">link text</a> \n
(every filtered link a new line in the cell)
Optional: paste another column next to it, which counts the paragraphs/links (counting the lines).
Then the video links (one line per filtered data again) + optional counting column.
And so on.
I found Openrefine. Which seems very nice and professional. But I didn't find out how to do it there.
I don't have to work with Excel, later the results can be converted to *.csv - it will be data for the database.
Though I think Excel is nice for the first steps, as the optional counting paragraph thing.
+ Text, Image Src, Links and Stuff. Could that be the error that Tags are missing? Also have to check how to convert my RegEx into GREL. But thanks anyway, i will look more into OpenRefine. Should be good for that. – timmyfromspace Oct 26 '18 at 10:59