Questions tagged [data-warehouse]

A data warehouse (DW) is a database specifically structured/designed to aid in querying, analyzing and reporting (generating reports out of) of current and historical data. DWs are central repositories of integrated data from one or more disparate sources. Basic difference between a data warehouse and a set of DB tables is how the data is organized/structured.

A data warehouse is a subject-oriented, integrated, time-variant, and non-volatile collection of data used for querying, analyzing and reporting for decision-support purposes.

Data Mart is the access layer of the data warehouse. It serves a particular department like Marketing, HR, etc. Dedicated to business function and unit specifications, data marts make the information more focused and faster to find.

Some differences between Data mart and Data warehouse:

  • Data Warehouses have multiple subject areas with more detailed information. They are integrating all sources of data. Dimensional modelling is not necessary, but it feeds dimensional models
  • Data Marts usually hold one subject area with not that detailed information - often summarized. Concentrate on integrating information from one subject area or source system. Built on dimensional models like star-schema.

There are many products readily available that provide data warehousing capability e.g. MSAccess, Essbase (Hyperion, now Oracle), Cognos, Business Objects, MicroStrategy, ...

Basics of Data Warehousing:

  • Dimensional Modelling - Consist of the identify the measurements, or facts, that are given the context by their related dimensions. The grain of the fact table describes the level of detail at which the facts are recorded.

Main steps of relational modelling:

  • Choose the business process
  • Declare the grain
  • Identify the dimensions
  • Identify the fact

Online Analytical Processing (OLAP) and it's types (ROLAP, MOLAP, HOLAP, ...): Describes basics of the DB designs and pros/cons of each way. - A variety of different design patterns are used in a data warehouse environment. Some common approaches include: Normalized (5NF); DataVault; Anchor Modelling; Dimensional (5,6); other temporal (e.g. 6NF). - SQL: Describes how a Data Warehouse can be queried. Following is a list of basic keywords that every data warehouse developer must know: - JOIN - GROUPBY

At a high level the Data Warehousing can be divided into:

  • Tools (IBM Cognos, Microsoft Business Intelligence, Oracle Business Intelligence Enterprise, dition(OBIEE), Business Objects Enterprise XI, Jaspersoft, Talend Open studio, Pentaho, Qlikview etc) readily available and how to use them. Used for small to medium sized data sets. This usually requires [at least] knowledge of tool's:
    • data model and
    • user interface
  • Building your own data warehouse for specific usecases. Used when dealing with really huge data sets (e.g. the data collected by Google, Yahoo, Facebook or a couters/performance-management-data from a large telecommunication network. This usually requires [at least] knowledge of:
    • scalability, high availability and clustering concepts.
    • data warehouse (schema, queries, data model, ...) design.
    • available databases (Oracle, Clustra, Greenplum, MySQL, DB2, ...)
    • problem domain (implicit).
    • relevant GUI/UI (SWING, JSP, ...) and business logic (J2EE, C++, ...) technologies
2778 questions
0
votes
2 answers

Extract/Get the table name and their respective columns used in a view from a query in BigQuery

Let us take the following MYSQL query written in BigQuery for creating a view. SELECT a.col1, a.col2, a.col3, b.col1, b.col2, b.col3, c.col1, c.col2 FROM project_name_dataset_table_a a INNER JOIN…
Danish Bansal
  • 608
  • 1
  • 7
  • 25
0
votes
1 answer

How to change BI Answers user's password in OBIEE12c

BI Answers users manage in WebLogic 12c. I did find any options in BI answers that allow users to change his/her password. BI answers users need to communicate with WebLogic administrator to change his/her password. In OBIEE10g, there is an option…
Reja
  • 534
  • 1
  • 9
  • 18
0
votes
0 answers

How to run ODI12c mapping from bash to automate ODI mapping process

We have created ODI12c(12.2.1.2.6) mapping to load data from external table to staging table and staging table to fact table. Our database is oracle 19c. Now, we would like to automate ETL process by running ODI mapping from AIX 7.2 crontab to do…
Reja
  • 534
  • 1
  • 9
  • 18
0
votes
1 answer

Configuration Assistant (biee.py) Failed with Exit Value 1 at OBIEE12c

During installation and configuration of obiee12c(12.2.1.4), configuration assistant failed with an error. The following is the installation log. [2021-09-19T16:31:35.218+09:00] [bi] [NOTIFICATION] [] [oracle.bi.install.config.basesteps] [tid: 42]…
Reja
  • 534
  • 1
  • 9
  • 18
0
votes
0 answers

How would I merge these two tables into one table and create a recursive relationship?

I am totally confused on how I would merge these two tables and create a recursive relationship. I have attached a picture of the two tables.
0
votes
1 answer

OBIEE12c configuration assistant failed with Cannot find identity keystore file

I am getting the following error while run the obiee12c configuration assistant. weblogic.nodemanager.common.ConfigException: Identity key store file not found DemoIdentity.jks The following is the error log:
Reja
  • 534
  • 1
  • 9
  • 18
0
votes
1 answer

Implicit conversion on hash key column is causes very slow insert

I am trying to insert some data into some stage tables where the insert is taking far too long. For example a table containing up 600000 records is taking nearly an hour to complete. In the select part of the query we are are creating a hash of the…
Eseosa Omoregie
  • 175
  • 2
  • 11
0
votes
1 answer

Repository creation fails while upgrading OWB11gR1(11.1.0.7) to OWB11gR2(11.2.0.4)

I need to a new workspace in OWB11gR2(11.2.0.4) to upgrade OWB11gR1(11.1.0.7). Repository Assistant fails after processing 64%. The following is the error log. main.TaskScheduler timer[5]20200714@08:45:58.058: 00>…
Reja
  • 534
  • 1
  • 9
  • 18
0
votes
1 answer

What's the difference between master data and dimension data

I am kind of confused with the difference between master data and dimension data. Both of them are said to be relatively stable data, eg, organization information, employee information, producti information, compared with transactional data,such as…
Tom
  • 5,848
  • 12
  • 44
  • 104
0
votes
1 answer

Loading a .docx file into ETL/ELT tool?

Could someone please guide me on how to extract a .docx file and load it onto a database using an ETL(Extract-Transform-Load) or ELT(Extract-Load-Transform) tool? Assuming that the .docx file contains mostly unstructured data, isn't it an ELT tool I…
0
votes
1 answer

SSAS and calculated dimention from multiple other dims

Given 3 dimentions DimA and DimB and DimC with some DimSk, DimId and DimName as attributes the problem is defined as "add to cube new attribute which is calculated as": NewAttr = CASE WHEN DimC.DimId IN (1, 2, 3) THEN 'A ' + DimA.DimName WHEN…
SKK
  • 39
  • 1
  • 5
0
votes
2 answers

No active warehouse selected in the current session - Select an active warehouse with the 'use warehouse' command

I am following this tutorial: https://quickstarts.snowflake.com/guide/data_engineering_with_dbt/#4 when I try to run this in a worksheet: SELECT * FROM "KNOEMA_ECONOMY_DATA_ATLAS"."ECONOMY"."DATASETS" WHERE "DatasetName" ILIKE 'US Stock%' OR…
x89
  • 2,798
  • 5
  • 46
  • 110
0
votes
0 answers

Measures (values) showing only with specific rows selected

So I have an OLAP cube with many dimensions and measures defined, some of which are linked from another cube. In Excel, when I try to insert pivot table connected to that cube, in some cases a measures wouldn't show (will retrieve empty values).…
0
votes
1 answer

Oracle SQL, Combine Update or Merge with Join, Insert Surrogate Keys into Fact table using 2 dimensions tables

I have populated my dimension tables (Oracle SQL Dev.) For the next step I created the facts table (F_Orders), I loaded quantity, price , the Order_ID's and surrogate keys into the facts table. I want to Insert the latest Surrogate_ID's (customer,…
Deans
  • 3
  • 2
0
votes
1 answer

many to many relationship in oracle analytics

I'm using Oracle Analytics v5.9 , I have a datawarehouse including a bridge table for implementing a hierarchical dimension. I cannot find a way to join this table with my fact table and my dimension table. i even created a view joining bridge table…
1 2 3
99
100