Questions tagged [data-vault]

Data Vault is a methodology and architecture created to address the business of designing, implementing, and managing a data warehouse.

73 questions
1
vote
0 answers

Modifying Parameter in Table Value Function - Datetime - Local to UTC

I am working in a type 2 data vault and currently working on a function to try and simulate a data cube for certain users to do fast data pulls from specific snapshots. At the moment i can modify the code to make it work by convert the paramerter…
Dheebs
  • 398
  • 1
  • 6
  • 19
1
vote
2 answers

Data vault: Hash keys in staging table - advanced

I'm looking into using the datavault 2.0 methodology. I understand the reasons for hashing and trying to apply it. I'd like to apply this in the "staging" phase of the datavault rather than loading it into the DV. If a table has a business key in…
radialmind
  • 279
  • 2
  • 15
1
vote
0 answers

DataVault modelling for Domain Reference Table

Folks, Quick Version: How should I model HUBs, SATs and LINKs when I have multiple domain lookup references in my HUB_SAT? If you are were to generically model these from the source schema, how would you differentiate between FKs that should be…
Aaron Reese
  • 544
  • 6
  • 18
1
vote
1 answer

Data Vault, Links between hubs and lookup tables

I have an entity in my data vault that has a hub, satellite tables, and links to other entities with hubs. I have a separate entity in my data vault that only serves a singular purpose of being a lookup tables. The lookup table is a simple…
1
vote
2 answers

How to use cross reference table in a database modeled using Data Vault principles?

I have a Person Satellite with a Gender attribute. From source systems the values for this attribute can be: F, M, FEMALE, or MALE. Which of the two following approaches is the correct one for Data Vault modeling? Store data in Gender as it comes…
Cesar Vinas
  • 343
  • 1
  • 10
  • 20
1
vote
1 answer

Complex Join to Return SuperSet

Evening All, I have been chipping away at this one for a while and for some reason i just can't seem to get my logic to return the way I expect it to. I have 3 Data tables as well as 3 business concept linking tables. Table1…
Dheebs
  • 398
  • 1
  • 6
  • 19
1
vote
5 answers

Data Vault in Redshift and ETL Strategy

I have a bunch of data in files stored in Amazon S3 and am planning to use it to build a Data Vault in Redshift. My first question is if the right approach is to build the DV and Data Marts all in Redshift or if I should consider the S3 as my Data…
Cesar Vinas
  • 343
  • 1
  • 10
  • 20
1
vote
1 answer

Data Vault 2.0: non-reference identification for LINK

I wish to represent an entity that is identified both by the the identities of other entities (foreign keys), and an extra bit of information. Below is a specific scenario. Entities: Products Customers When a customer purchases a product, an order…
Khanetor
  • 11,595
  • 8
  • 40
  • 76
1
vote
1 answer

why using sequence number against version number in dimension table by datawarehouse modeling

In the context of dimension modeling, as typical case, it is great to have surrogate key in dimension table to track the change of rows(http://www.kimballgroup.com/2006/07/design-tip-81-fact-table-surrogate-key/). There are three common ways to…
Hello lad
  • 17,344
  • 46
  • 127
  • 200
0
votes
1 answer

Datavault modeling where source data is slightly heterogenous

I am working with a database table that can have multiple types of data within it. Here is a trivial example (Orders can be either b2c, b2b or internal transfers): | OrderId | B2C Customer | B2B Customer | Internal | ShippingDate | OrderTotal | ...…
Erik
  • 898
  • 2
  • 8
  • 28
0
votes
1 answer

Absence of temporality to Links in Data Vault 2.0

I am currently reading up on Data Vault 2.0 in the book "Building a Scalable Data Warehouse with Data Vault 2.0". I have a question about Links which I find a little hard to wrap my head around. In the book, the author states that Links do not…
mbih
  • 5
  • 5
0
votes
2 answers

Add Valid_From & Valid_To Columns to Table w/ only load_timestamp - where relationships can revert back

I have the table below and I want to write a query that add's two columns: Valid From = Indicating when a relationship between account & opp started Valid To = Indicating when a relationship between account & opp ended **NOTE THAT THE INSERT…
0004
  • 1,156
  • 1
  • 14
  • 49
0
votes
1 answer

Data Vault 2.0: Creating an Effectivity Satellite off link table that only has insert_date for each record

I want to create an Effectivity Satellite off the link table below ( you can open and create in http://sqlfiddle.com/ via the code below). CREATE TABLE l_opportunity_account ( opportunity_account_hash_id VARCHAR(100), load_timestamp TIMESTAMP, …
0004
  • 1,156
  • 1
  • 14
  • 49
0
votes
1 answer

How can I specify a table name in the DBT merge process?

I'm trying to create a merge statement using dbt. Here is my code: {{ config( materialized='incremental', unique_key=(['five9_calls_hk', 'effective_from']), incremental_strategy='merge' ) }} WITH using_clause AS…
Scott Wood
  • 1,077
  • 3
  • 18
  • 34
0
votes
0 answers

Generate Metric on how frequently an individual table attribute changes , ideally relative to another

I am building out a data vault & specifically working through when to breakout a satellite table into sub-satellite's by attributes that change at a different rate That being said, I only have the modified date at the record level ie when any…
0004
  • 1,156
  • 1
  • 14
  • 49