I am currently reading up on Data Vault 2.0 in the book "Building a Scalable Data Warehouse with Data Vault 2.0".
I have a question about Links which I find a little hard to wrap my head around. In the book, the author states that Links do not capture time-lines like "valid from" and "valid to" or "begin date" and "end date".
The direct quote on why this is, goes like this:
"Adding temporality to link structures, for example by adding a Begin and End Date, bounds the relationship to a single timeline and forces the data warehouse to start and stop this relationship only once."
From my understanding, the reasoning in the quote can be translated to an example:
Lets imagine that we are a firm that sells gold, silver and cobber and we choose to hold all of these in separate hubs. We will have some kind of links from each of these hubs to other hubs. Now lets say that we stop selling cobber which would result in the death of the Link from the cobber hub.
2 years goes by, and we now choose to sell cobber again so our Link fires back up and is live once again.
In this example, we have 2 periods where the Link is active:
- from beginning of the Link to when we stop selling cobber
- and again from when we start selling cobber once again and until we do anything further
This would seem to be two time lines, so why is this a bad idea to store in the link? From what I gather, this does not change the fact that we can see which period is the latest, so I am missing something.