Questions tagged [openrefine]

OpenRefine is the new name for the data cleaning tool which used to be called Google Refine (and was born as Freebase Gridworks)

Resources

400 questions
0
votes
1 answer

Google Refine: Can I use a variable in a regular expression?

I'm trying to do something like this in Open/Google Refine: forEach(["foo", "bar"], regex, value.match(/.*(regex).*/)) That is loop an array of keywords to check if the cell contains any of them. How can I use these keywords to build regular…
Jens
  • 121
  • 1
  • 9
0
votes
1 answer

Use google-refine on csv without headers and with various number of columns per record

I'm attempting to import in open-refine a csv extracted from a NoSQL database (Cassandra) without headers and with different number of columns per record. For instance, fields are comma separated and could look like below: 1 - userid:100456,…
benr
  • 1
0
votes
1 answer

Populate value of a property of a column in another column

Column 1 Column 2 Column1 -> foaf:name -> Column 2 Lets say I want to display the name of the person pointed to by the Column 1 URI in the Column 2. Is there a way to accomplish this with openrefine and rdf extension?
trailblazer
  • 1,421
  • 5
  • 20
  • 43
0
votes
1 answer

How do I fetch text blurbs using open refine 2.6?

I am running a very simple exercise where I have a list of people's names that have been already reconciled via freebase, from within Open Refine. The Github repository for Open Refine clearly indicates that fetching Properties against a reconciled…
Luis Miguel
  • 5,057
  • 8
  • 42
  • 75
0
votes
2 answers

Using groups in OpenRefine regex

I'm wondering if it is possible to use "groups" in ReGeX used in Open Refine GREL syntax. I mean, I'd like to replace all the dots followed and preceded by a character WITH the same character and dot but followed by a space and then the…
mellin
  • 307
  • 5
  • 13
-1
votes
1 answer

How can I filter large amount of JSON in OpenRefine?

I'm using OpenRefine to pull in information on publisher policies using the Sherpa Romeo API (Sherpa Romeo is a site that aggregates publisher policies). I've got that. Now I need to parse the returned JSON so that those with certain pieces of…
-1
votes
1 answer

Open Refine regex for alphabets

i want to edit only alphabetic charcter from my cell . what i have done value.match(/.*?(\^[a-zA-Z]*$).*?/) but it returns null i am try to clean address column in my data set following are the sample address H3656 GALI#4 BLOCK-D, AREA 1 H#36/17…
Usman Asif
  • 320
  • 2
  • 12
-1
votes
1 answer

How do i tell Openrefine to fetch some data trough the google and facebook API (Grel)

I have a list of around 500 foundations and companies and i need to fetch more data like address, description, lat long, email or phone number, from the google and FB APIs I have found something more or less like this, but it won't work in…
He11bran
  • 3
  • 3
-1
votes
1 answer

group the results of a matching function

i have three columns in excel, one that holds the row id, the next one a product serial number, and the last one again some serial numbers. 098 11079851 11079851 110 11089385 25853201 118 11089385 22412115 798 11079851 22412115 what i need…
user11517014
-1
votes
1 answer

making columns from a line separated file

I just started learning openrefine 20 minutes ago. I have a text file with data that is separated by a consistent header ("JP") at the beginning of each chunk of data. The chunks of data are not all the same number of lines. I want each chunk of…
-1
votes
1 answer

Trying to retrieve place details but getting error. Google places api

Ive been trying to extract Google place details using a place id column but it keeps throwing an error if i try to reference the placeid column. However, when i put in the placeid column manually it works perfectly. Can someone highlight whats wrong…
-1
votes
3 answers

How to add numbers to a string with Python or GREL

I have >4000 numbers in a column that need to be manipulated.. They look like this: 040 413 560 89 or 0361 223240 How dow I put it into the folllowing format: +49 (040) 41356089 or +49 (0361) 223240 They all need to have the same country dialling…
Mac
  • 1
  • 2
-1
votes
1 answer

Coupling values on columns

I've a spreadsheet with values in several columns like this scheme value1 value2 value3 And I'd like to couple every value with each other like this scheme: value1 value2 value1 value3 value2 value3 Is there any excel funtion or python code…
Andrea Angeli
  • 131
  • 1
  • 16
-1
votes
1 answer

How to save only specific JSON elements in a new OpenRefine column

{ "business_id": "SQ0j7bgSTazkVQlF5AnqyQ", "full_address": "214 E Main St\nCarnegie\nCarnegie, PA 15106", "hours": {}, "open": true, ** "categories": ["Chinese", "Restaurants"] ** , "city": "Carnegie", "review_count": 9, …
-1
votes
1 answer

OpenRefine undefined/language error

New OpenRefine install: I see "Extensions: Undefined", related to the Freebase extension no longer working, plus other texts don't appear when creating the project. I read somewhere there is a workaround, by setting the language to English (or…
kimizito
  • 45
  • 7
1 2 3
26
27