Currently we have 12 different databases and 7 of them are dimensional. We are a non-profit knowledge based org where we have databases based on kind of disease the person has.
eg. our databases look like
- HIV
- Hepatitis C
- Meningitis
and so on...
Each of these would have data with tables such as:
Patient
Sample( blood samples)
location
diagnosis
Gender
Provider
We dont keep track on how much money was spent as we just keep track of +ve and -ve samples.
Now, question has come into upper management that we should build a Datawarehouse from the silo Data marts.
But, business users have never asked a question where they would need data from two different databases. Do we still need DW if users have not even thought about it?
Some more questions which came in my mind were:
- What kind of granularity for each of those datamarts?
- Which dimension could act as Conformed Dimension?
- How would the ETL flow?
- Achieve the single version of truth across all the DM's?
I am just doing an initiative to understand what could be a solution to the situation we are in. Any help is appreciated.
Thanks