Data Vault is a methodology and architecture created to address the business of designing, implementing, and managing a data warehouse.
Questions tagged [data-vault]
73 questions
1
vote
0 answers
Modifying Parameter in Table Value Function - Datetime - Local to UTC
I am working in a type 2 data vault and currently working on a function to try and simulate a data cube for certain users to do fast data pulls from specific snapshots.
At the moment i can modify the code to make it work by convert the paramerter…

Dheebs
- 398
- 1
- 6
- 19
1
vote
2 answers
Data vault: Hash keys in staging table - advanced
I'm looking into using the datavault 2.0 methodology. I understand the reasons for hashing and trying to apply it. I'd like to apply this in the "staging" phase of the datavault rather than loading it into the DV.
If a table has a business key in…

radialmind
- 279
- 2
- 15
1
vote
0 answers
DataVault modelling for Domain Reference Table
Folks,
Quick Version:
How should I model HUBs, SATs and LINKs when I have multiple domain lookup references in my HUB_SAT?
If you are were to generically model these from the source schema, how would you differentiate between FKs that should be…

Aaron Reese
- 544
- 6
- 18
1
vote
1 answer
Data Vault, Links between hubs and lookup tables
I have an entity in my data vault that has a hub, satellite tables, and links to other entities with hubs. I have a separate entity in my data vault that only serves a singular purpose of being a lookup tables. The lookup table is a simple…

Nicholas Morrison
- 31
- 4
1
vote
2 answers
How to use cross reference table in a database modeled using Data Vault principles?
I have a Person Satellite with a Gender attribute. From source systems the values for this attribute can be: F, M, FEMALE, or MALE. Which of the two following approaches is the correct one for Data Vault modeling?
Store data in Gender as it comes…

Cesar Vinas
- 343
- 1
- 10
- 20
1
vote
1 answer
Complex Join to Return SuperSet
Evening All,
I have been chipping away at this one for a while and for some reason i just can't seem to get my logic to return the way I expect it to.
I have 3 Data tables as well as 3 business concept linking tables.
Table1…

Dheebs
- 398
- 1
- 6
- 19
1
vote
5 answers
Data Vault in Redshift and ETL Strategy
I have a bunch of data in files stored in Amazon S3 and am planning to use it to build a Data Vault in Redshift. My first question is if the right approach is to build the DV and Data Marts all in Redshift or if I should consider the S3 as my Data…

Cesar Vinas
- 343
- 1
- 10
- 20
1
vote
1 answer
Data Vault 2.0: non-reference identification for LINK
I wish to represent an entity that is identified both by the the identities of other entities (foreign keys), and an extra bit of information. Below is a specific scenario.
Entities:
Products
Customers
When a customer purchases a product, an order…

Khanetor
- 11,595
- 8
- 40
- 76
1
vote
1 answer
why using sequence number against version number in dimension table by datawarehouse modeling
In the context of dimension modeling, as typical case, it is great to have surrogate key in dimension table to track the change of rows(http://www.kimballgroup.com/2006/07/design-tip-81-fact-table-surrogate-key/).
There are three common ways to…

Hello lad
- 17,344
- 46
- 127
- 200
0
votes
1 answer
Datavault modeling where source data is slightly heterogenous
I am working with a database table that can have multiple types of data within it.
Here is a trivial example (Orders can be either b2c, b2b or internal transfers):
| OrderId | B2C Customer | B2B Customer | Internal | ShippingDate | OrderTotal | ...…

Erik
- 898
- 2
- 8
- 28
0
votes
1 answer
Absence of temporality to Links in Data Vault 2.0
I am currently reading up on Data Vault 2.0 in the book "Building a Scalable Data Warehouse with Data Vault 2.0".
I have a question about Links which I find a little hard to wrap my head around. In the book, the author states that Links do not…

mbih
- 5
- 5
0
votes
2 answers
Add Valid_From & Valid_To Columns to Table w/ only load_timestamp - where relationships can revert back
I have the table below and I want to write a query that add's two columns:
Valid From = Indicating when a relationship between account & opp started
Valid To = Indicating when a relationship between account & opp ended
**NOTE THAT THE INSERT…

0004
- 1,156
- 1
- 14
- 49
0
votes
1 answer
Data Vault 2.0: Creating an Effectivity Satellite off link table that only has insert_date for each record
I want to create an Effectivity Satellite off the link table below ( you can open and create in http://sqlfiddle.com/ via the code below).
CREATE TABLE l_opportunity_account (
opportunity_account_hash_id VARCHAR(100),
load_timestamp TIMESTAMP,
…

0004
- 1,156
- 1
- 14
- 49
0
votes
1 answer
How can I specify a table name in the DBT merge process?
I'm trying to create a merge statement using dbt. Here is my code:
{{
config(
materialized='incremental',
unique_key=(['five9_calls_hk', 'effective_from']),
incremental_strategy='merge'
)
}}
WITH
using_clause AS…

Scott Wood
- 1,077
- 3
- 18
- 34
0
votes
0 answers
Generate Metric on how frequently an individual table attribute changes , ideally relative to another
I am building out a data vault & specifically working through when to breakout a satellite table into sub-satellite's by attributes that change at a different rate
That being said, I only have the modified date at the record level ie when any…

0004
- 1,156
- 1
- 14
- 49