i have JSON data on user profiles that i want to eventually analyze with SPSS.
Currently i imported the data in Google Refine, to run some data cleansing. My problem is however that the original JSON consists of nested objects, namely e.g. the…
Broadly speaking, here's what I'm trying to do: Parse a string in one cell of a spreadsheet, then add keywords to another cell in that row if certain keywords are found in the parsed cell.
I'm using OpenRefine (technically Google Refine 2.5) to try…
In openrefine when I upload the data, and click on text facet and then clustering. It creates the clusters. Like : Aniket Ghodke and Ghodke Aniket it will suggest to merge them. But is there any way where I can store these values. Like if I merge…
I imported an .xlsx file where one of the columns is filled with hyperlinks, but the links don't show on OpenRefine, just the value.
Does it happens only on Linux or it's the same thing with Windows?
If not, is there any other way to import those…
I have a dataset that has two different date formats in the same column. Some are formatted like:
2008-05-15T00:00:00Z
and others are formatted like:
20090804
Google Open Refine will recognize the first type as a date and will sort and allow me…
I've been following this tutorial:
https://www.youtube.com/watch?v=5tsyz3ibYzk
I've been following all the steps, but I noticed that, for my dataset, freebase doesn't suggest any kind of type, like it does for 'movies' in the example.
I have a…
I seem to have a workflow problem with Open Refine (Google Refine 2.5 [r2407]) to do sophisticated duplicate row cleaning. All I have found so far is how to delete duplicate rows based on a single column.
My aim is to delete duplicate rows based on…
I am trying to extract followers count from the data below:
{TruOptik': {follow_request_sent': False, profile_use_background_image': True, default_profile_image': False, id': 1308292578, profile_background_image_url_https': , verified': False,…
Can I use the cross function ( https://github.com/OpenRefine/OpenRefine/wiki/GREL-Other-Functions#crosscell-c-string-projectname-string-columnname ) with jython language in openrefine (googlerefine 2.5)
g.e. I would like to prepend all occurrences of the string "foo" with three spaces:
value.replace(/(foo)/, " " + "$1")
value.replace(/(foo)/, " $1")
value.replace(/(foo)/, " " + " " + " $1")
all return
foo
instead of
foo
I have a column in Excel of domain names (like stackoverflow.com) and would like to create a corresponding column with the title of the domains (like "Stack Overflow").
I uploaded the Excel file into OpenRefine. I believe the best way to do this…
As it seems, cross-column clustering isn't supported yet with OpenRefine.
Does anyone have any suggestions of how to cluster 'models' based on 'manufacturers', much like a 'city' would be based on a 'state' (many 'Springfield' could exist in the…
The task is simple to understand, I have a table like this:
And I would like to edit the column "L1_latitud" to collapse (or remove) all the blank cells:
It looks like a simple task but I can't find out a way to deal with it.