I have a column of names like:
Quaglia, Pietro Paolo
Bernard, of Clairvaux, Saint, or
.E., Calvin F.
Swingle, M Abate, Agostino, Assereto
Abati, Antonio
10-NA)\u, Ferraro, Giuseppe, ed, Biblioteca comunale ariostea. Mss. (Esteri
I want to…
Has anyone been experiencing problems with reconciliation in OpenRefine? I've imported a list of American universities and colleges, selected 50 rows, and tried Freebase, DBpedia, OpenCorporates reconciliation services. I've previously had multiple…
I have OpenRefine (a webapp hosted by jetty) running on:
http://127.0.0.1:3333
Which looks like this:
Everything works perfectly.
Now I would like to tunnel this through Apache2 (for security and renaming reasons), so I changed my http.conf file…
I want to investigate the rows for which a certain column is empty. I'll fill these cells based on values in other columns, but I want to identify which ones have not yet been done.
If I make a filter on that column, it doesn't do anything until I…
I'm cleaning data with OpenRefine (was Google Refine) from multiple sources. I have files from different sources which contain companies, column definitions are identical i.e.
UNID | Name | Street | City | Country | Phone | ...
sg52d…
I'm trying to parse this JSON but really can't find the way to extract the data I want.
{ "results" :
[ { "address_components" :
[
{ "long_name" : "44", "short_name" : "44", "types" : [ "street_number" ] },
{…
I'm not sure if this can be achieved in Google Refine at all. But basically, I have data like this.
The first table is the table of all the users. The second table show all the friends. However, in the second table in "friends" column not all the…
I have a column of reconciled entities in OpenRefine which include entities like Q56085233 and I would like to retrieve all links inside "Multilingual sites", if possible with a separator or only one at a time.
That is Q56085233, for instance, has…
I am using the OpenRefine client: https://github.com/opencultureconsulting/openrefine-client
I need to automate processes and for this I need to be able to extract/export OpenRefine transformation history (undo/redo) in JSON format from the client,…
I'm trying to extract an array of industry code descriptions from the OpenCorporates.com JSON output using OpenRefine. I've extracted the industry_codes array from the JSON body into a new column. Some records have a full array, some just have [ ].…
I've been using OpenRefine very successfully for a couple of years, working solely with CSV (and TSV) source files. Recently I had some tables from an sql database that I wanted to bring into OpenRefine so I exported them (from SQL) as JSON and then…
I'd like to write my own OpenRefine extension
Before starting any implementation, I just want to build the sample extension from OpenRefine just to get me started.
However, I'm getting the Maven error
Could not resolve dependencies for project…
I have my own Python library that I would like to use in OpenRefine as described here
However, it seems that all the Python code in OpenRefine goes through Jython which supports only Python 2
Is there a way to run Python3 code in OpenRefine?
cheers
I have a column of values with "//" as the separator between them. For example, one cell might contain - September 17 2021 // September 18 2021.
I want to compare if what comes before and after the separator are the same, and if so, to delete the…