Questions tagged [google-refine]

OpenRefine (formerly Google Refine) is a free, open source, data cleaning tool.

[Google Refine] is a free, open source, data cleaning tool. It was originally called Freebase Gridworks and was developed by Metaweb before Metaweb's acquisition by Google. In 2012 support from Google have been removed and code moved to GitHub 1.

44 questions
1
vote
1 answer

openrefine/googlerefine - reconcile two datasets

i'm in a situation with two json files: file A contains different datas, one of them contains numeric id. File B contains all the possible numeric ids linked with a vat number.In the rdf final file i'd like to replace the column of ids of the first…
heartlex
  • 103
  • 1
  • 13
1
vote
2 answers

Transpose survey response dataset with Open Refine (previously Google Refine)

I’m looking for some help to reshape a survey response dataset, exported as a csv, using Open Refine (previously Google Refine). Some context on the survey Collector and responder ID are collected in the background - ID1 ID2 Users select tasks from…
1
vote
1 answer

Refine with RDF extension: literals or resources?

I am using Refine with RDF Extension to produce triples from a CSV. I have imported two vocabularies and am now using them to describe the columns: How can I change the nodes to describe resources rather than literals? For example, when I preview…
Sara Lafia
  • 135
  • 1
  • 1
  • 8
1
vote
1 answer

OpenRefine split in multiple cells

I have a simple table like this : id | name ------------------- 1 | Jack, Jeff, Win ------------------- 2 | Jonhy, chin I want to split the cell name by "," and want to preserve the id, so I after the…
nacho c
  • 311
  • 1
  • 3
  • 15
1
vote
0 answers

Open Refine: Open Project Issue

I am an Open Refine user. I use this tools with RDF extension. I have created a number of RDF projects. Each time I open this tool, I find my existing projects under Open Refine's "Open Project" option. Two days ago I updated my computer's Java…
Beautiful Mind
  • 5,828
  • 4
  • 23
  • 42
1
vote
1 answer

How to get HTTP status codes from URLs using Google Refine?

I have a file that contains a long list of URLs. I want to use Google Refine to get HTTP status codes that appear when each URL is open. The URLs are stored in 1 column, 1 URL per 1 cell. The HTTP status codes should be stored in a new column. There…
M Novakova
  • 11
  • 2
1
vote
1 answer

Parsing XML using Google Refine

I'm using Google-refine to download information from Google Maps. More specifically, I want the lat/long associated with a given address. I have successfully downloaded the XML information, but I'm having a horrible time parsing it. The typical…
user950899
  • 63
  • 5
1
vote
1 answer

google refine: use facet tools to infer map between two columns

I've been searching but haven't found how to do this in refine. I've got two columns of unique IDS. For each a in A, I want to find the top 10 closest matches in B. My backup plan is to just use Levenshtein to iterate ... but Refine has such a nice…
mathtick
  • 6,487
  • 13
  • 56
  • 101
1
vote
1 answer

Pulling Freebase data in Google Refine

I'm fairly new to Google Refine and Frebase. I want to use use the internet/websites list, but the dump doesn't contain the website URL, it's however accessible in related "official website" object. In Refine, how is it possible to fetch this data…
Félix Ménard
  • 453
  • 4
  • 10
1
vote
1 answer

Google Refine Reconciliation: How to auto-replace with result text

Let's say I have cell like this: "Nat. Taiwan Normal Univ., Taipei" Once I do reconciliation, I get this: Nat. Taiwan Normal Univ., Taipei V V Create new topic Search for match Then I click on "Search for match" and get a drop down result: "National…
1
vote
1 answer

How to deal with large dataset when fetching url in google refine?

So, I have an excel sheet with movie names around 190000 titles from freebase, I'd like to get information from wikipedia from fetching url by title, that would take a long time, I left my computer running for 8 hours and it just got to 2%.…
toy
  • 11,711
  • 24
  • 93
  • 176
0
votes
2 answers

Create column in OpenRefine using a logic test on an existing column?

I have an original data set with over 4,500 row entries representing land transactions. One of the existing columns is "Place," and in each cell value I've put the locality name usually as Some Neighborhood, but sometimes when there is a place I…
grad student
  • 111
  • 1
  • 4
0
votes
1 answer

How convert a table format or it structure with Google Refine

I have a table with the following format: ID Estation Y M D H N Nh h Cl 1 78357 2017 5 1 1 0 0 -9001 0 2 78357 2017 5 1 2 0 0 -9001 0 3 78357 2017 5 1 3 1 1 750 5 I want…
armando85
  • 5
  • 4
0
votes
1 answer

How to generate random numbers with GoogleRefine or OpenRefine in GREL?

I'd like to generate random numbers in GREL. Is there any way to do this? I'd like to use a GREL phrase like: "http://example.org/id/" + random + ".html"
CH_
  • 685
  • 1
  • 7
  • 18
0
votes
2 answers

add numbers down a column in OpenRefine

I'd like to automatically number a column. Similar to Excel, where I can type "1" in one cell and the cells below it automatically get numbered 2, 3, 4, 5, etc. I don't know why I'm having so much trouble figuring out this function on Openrefine but…
Gail G
  • 1