3

I understand that live.dbpedia.org is closer to a real time version of the dbpedia.org data, but that invites the question, how often is the regular dbpedia extraction/update process run? How often are the data dumps updated? Also, it's been said that the main endpoint incorporates other datasets in addition to what is extracted from Wikipedia.

What are the differences in data between dbpedia.org, live.dbpedia.org, and the data dumps?

Joshua Taylor
  • 84,998
  • 9
  • 154
  • 353
Kristian
  • 21,204
  • 19
  • 101
  • 176
  • 1
    Can you clarify the question? In "some of it doesn't behave the same" doesnt' tell us what differences you're seeing. [DBpedia Live](http://wiki.dbpedia.org/DBpediaLive) says, "DBpedia is considered the Semantic Web mirror of Wikipedia. Over time, Wikipedia articles are revised, which makes the data in DBpedia outdated. The main objective of DBpedia Live is to keep DBpedia always in synchronization with Wikipedia." Is there more to it than that? – Joshua Taylor Oct 03 '14 at 17:55
  • @JoshuaTaylor there is certainly more to the question. so I've tried to rewrite the entire question. perhaps now that it has been updated, you can see what I'm confused about. – Kristian Oct 03 '14 at 18:28

1 Answers1

4

I did some research on DBpedia for a project and I am going to share what I found out:

Q: The live-updates of DBpedia (changesets) have the structure year/month/day/hour/xxxx.nt.gz. What does it mean if there are some gaps in between, e.g. a folder for some hour is missing?

A: This means that the service was down at that time.

And DBpedia live - 3. new features (WayBackMachineLink)says:

5.Development of synchronization tool: The synchronization tool enables a DBpedia Live mirror to stay in synchronization with our live endpoint. It downloads the changeset files sequentially, decompresses them, and integrates them with another DBpedia Live mirror.

So I think if you're synchron with the live-endpoint when applying the changeset, the live endpoint is also applying the changeset.

Community
  • 1
  • 1
Fabian
  • 223
  • 4
  • 15