Questions tagged [openrefine]

OpenRefine is the new name for the data cleaning tool which used to be called Google Refine (and was born as Freebase Gridworks)

Resources

400 questions
0
votes
1 answer

Is it possible to run an OpenRefine script in the background?

Can I trigger an OpenRefine script to run in the background without user interaction? Possibly use a windows service to load a OpenRefine config file or start the OpenRefine web server with parameters and save the output? We parse various data…
Brett
  • 786
  • 1
  • 8
  • 10
0
votes
1 answer

How can I join two datasets using a key in OpenRefine, with the secondary table having more than one value?

I have a dataset X like this: Code | Name ------------ 123 | AAA 456 | BBB And the other Y like this: Code | Level ------------ 123 | A 123 | B 456 | B 456 | C I want to join them using OpenRefine to something like this: Code | Name | Level…
Bruno
  • 87
  • 9
0
votes
1 answer

RDFRefine - can't access imported ontology?

I'm using OpenRefine and its extension RDFRefine. I'm trying to import an ontology which has its URI at http://purl.org/vocab/participation/schema# and suggested prefix: participation. When I add prefix and insert the URI, it gives me no problem.…
nicki
  • 187
  • 1
  • 2
  • 11
0
votes
1 answer

Lost all my files on Openrefine

I tried de beta version of Openrefine and now I have lost all my previous files on the version 2.5. Do you know where the files are located? I am on Mac. Thanks!
0
votes
1 answer

freebaseapps reconciliation stuck in Open Refine 2.6

I have some small test data with the names of 10 actors, that I would like to expand with some additional data. This used to work fine using this reconciliation service: http://reconcile.freebaseapps.com/reconcile. However, now it is stuck there.…
Luis Miguel
  • 5,057
  • 8
  • 42
  • 75
0
votes
0 answers

How to mix regex and string in value.replace in OpenRefine / GoogleRefine?

I'm just trying to add "+33 " and remove the first "0" in a phone number like 04 35 73 84 93 (in France) to get +33 4 35 73 84 93 in a database of contacts where a field contains only the phone number. I tried : value.replace(/^'0'/,'+33 ') There is…
0
votes
2 answers

Remove content inside parentheses

Have several cells in a column that at the end of the sentence have a sentence inside of ( ) Ex. Hello World (wwfx fgty jkilo) The output desire is Hello World ()
Chapin
  • 31
  • 1
  • 2
0
votes
1 answer

Import columns to existing OpenRefine project

How do I add a column from an external .csv file to an existing project? I tried to find the solution online, but I wasn't successful.
kimizito
  • 45
  • 7
0
votes
1 answer

Google Refine - pull out identical values in cell

I have data in a column that looks like this ["Lymore Cottages", "Lymore Cottages", "Lymore Cottages", "Lymore Cottages", "Lymore Cottages", "Lymor Cottages"] Its essentially the same thing multiple times, but as these are entered by users they…
Paul M
  • 3,937
  • 9
  • 45
  • 53
0
votes
1 answer

Appending a specific string in GREL

I'm cleaning data in Open Refine, using GREL. I'm trying to append the string OOL to the string SCH, only if SCH is placed at the bottom of any cell. I'll appreciate anny tips. Miquel Centelles
0
votes
1 answer

How can I encode an integer into Base64 in Jython for Open Refine?

I want to encode an integer into a short string using Base64 and return the value to Open Refine (Google Refine). I found examples but they always give me an error. import base64 foo = base64.b64encode('1') return foo works returning "MQ==" But I…
Dizzley
  • 487
  • 6
  • 16
0
votes
2 answers

Can OpenRefine/google refine be used for pages behind a login?

And if not, is there a comparable alternative that can? I looked in the documentation but couldn't see anything on this subject, though it must be a common requirement. And to be clear I mean a web-based login form, not basic HTTP auth.
Codemonkey
  • 4,455
  • 5
  • 44
  • 76
0
votes
2 answers

Parse and remove HTML tags using Google Refine/OpenRefine & Jsoup/BeautifulSoup

I use Google Refine for dealing with messy product data sheets in order to format them for upload into Magento stores using Magmi/Dataflow profiles. I am still using Google Refine 2.5 as it is the latest stable release. The descriptions from…
0
votes
1 answer

Replace null cell with space character

In a certain column, I am trying to replace any cell with a null value with a single space. I am trying to combine multiple rows where each row only contains one letter. When I join multi-valued cells, Refine will (understandably) throw out the…
cornking
  • 25
  • 8
0
votes
1 answer

Using OpenRefine to clean addresses - Cluster Exact Match

I am using Open Refine to clean up some addresses in a sheet (.csv). How do I cluster exact match in a column? It seems that the clustering algorithm will cluster addresses that are almost similar which only differs in the number. For example: 56,…
antsemot
  • 61
  • 1
  • 1
  • 6