Questions tagged [star-schema]

Star schema is the most basic data warehousing dimensional structure and database schema, consisting of one or more fact tables referencing any number of dimension tables.

Star schema is the most basic data warehousing (data mart) dimensional structure and database schema, consisting of one or more fact tables referencing any number of dimension tables. It is organized like:

  • Facts - event that is counted or measured. They can be at a very atomic level. Generally assigned a surogate key to ensure every row can be identified.
  • Dimensions - reference information about facts. The records contain detailed information and attributes describing the fact data. Usually assigned a surrogate primary key.

Within the data warehouse and data mart the dimension table is associated with fact tables using foreign key relationship.

Advantages

In well-designed schemas multidimensional data can be analyzed easily. They are good in decision-support environments. Some of the advantages are query speed, data load and administration. The built-in referential integrity is good but not as in highly normalized databases though.

Examples

An example star schema could have a Sales Fact with Date, Product, and Store Dimensions. Any numerical performance measurements related to Sales would also be stored in the fact, like "Quantity Sold" and "Dollar Amount of Sale". The remaining columns in the Sales Fact would be foreign keys linking to the Dimensions.

More information

354 questions
0
votes
2 answers

Is the user table considered a dimension or part of fact

I am designing a star scheme db where the fact is user login. My dimensions are time (up to hour of day) and location (based on IP). I have about 300k users in my system. This size grows by 5k/year. Should the user details be part of the fact table…
Itay Moav -Malimovka
  • 52,579
  • 61
  • 190
  • 278
0
votes
1 answer

What is the Database impact: Querying a table with filter on two colums OR table with twice the records with filter on one single column?

Which has more database performance impact? Querying a short table (say 20 Million records) with filter on two columns or querying a long table twice as more records than the short table with filter on one column? To give some context: I have…
0
votes
2 answers

SSIS Error - Load data into Fact Table - The system reports 80 percent memory load

I've the following package in order to load data from dimensions to Fact Table: An when I execute this packages I getting the following error: Error: The system reports 80 percent memory load. There are 25769267200 bytes of physical memory with…
John_Rodgers
  • 181
  • 1
  • 11
0
votes
2 answers

Aggregation of 2 colums in result

i am trying to aggregate two columns in my result table but I can't get my head around it... I have a star schema and the following query: select 1und1.SubCategory,count(industry_facts.NACE_ID), statistics_2016.businesses from industry_facts,…
0
votes
1 answer

Data Warehousing - Snowflake Schema Normalization

To start, I am trying to differentiate from Star Schema and Snowflake Schema by illustrating them. But am having trouble trying to normalizing the table to create the snowflake schema. The attached image is the Star Schema enter image description…
AdrianAndrews
  • 33
  • 1
  • 6
0
votes
1 answer

Dimensions without any link to the fact table

I am currently designing a dimensional model where there are dimensions like orders, product, shipment, returns, items. My goal is to calculate metrics at day level and populate the fact table. The metrics are count of orders per day, total…
Teja
  • 13,214
  • 36
  • 93
  • 155
0
votes
2 answers

Data Warehouse schema design - how to improve schema model

I have to create Data Warehouse for travel agency. I'm doing it for the first time. I've learned all the basics about star, snowflake and constellation schema and about creating data warhouses. I would like to ask what could be changed for better…
0
votes
2 answers

should PAX be in Flighth Dimension or Fact Sales table?

I need to build a data mart using power pivot for a duty free shop at Airport. Sales manager is analying sales data using by flight number and by PAX, number of people per flight. So, I don't know where to put PAX. In DimFlight or FactSales. It is…
0
votes
1 answer

How to model an OLTP audit table in dimensional schema?

We have an audit table which we get from OLTP system, it records any activity done by the user including if he downloaded some attachment, or read some note or written some note , or any change for an incident etc.How do we include these audit table…
Jack
  • 1
  • 1
0
votes
1 answer

What happens to surrogate keys of transactional system , when converting it to dimensioanl schema?

Our OLTP systems use several surrogate keys .Now we want to create a dimensional model for our system for analysis. Should we keep OLTP system surrogate keys and natural keys and also create one more datamart surrogate key? or shall we ignore the…
0
votes
1 answer

Collapsing Rarely Changing Diminsions to a Fact Column

In star-schema design, dimension tables for fact table(s) are almost a must. In many business cases, the modeler can assume with acceptable confidence that certain dimension values will not change without control. For example the Gender is sometimes…
0
votes
1 answer

Can non-additive facts be part of transactional fact table

I know there are three types of facts, and I've read that Transactional-Fact tables have fully additive facts which are the most useful type, but can non-additive facts be there as well? Or even semi-additive facts for that matter. I'm asking this…
jruivo
  • 447
  • 1
  • 8
  • 22
0
votes
1 answer

How to create Star Schema Benchmark(SSB) tables?

I am trying to generate SSB tables by getting github. https://github.com/electrum/ssb-dbgen but, it can't run the `dbgen' program from it. Please how to create the tables Correctly.
0
votes
1 answer

SSIS Surrogate Key incrementation

I'm using SSIS to create a star schema for a data warehouse with surrogate keys (sg). My process goes like this: find max sg (using SQL) in data flow: data source-> c# script that adds +1 to the max sg -> write to destination. Now, with fixed…
mRiddle
  • 214
  • 1
  • 7
  • 22
0
votes
1 answer

Main concept of dimension and DMQL?

I am creating a test DWH for personal purpose. I have read about star,snowflake and constallation schemas concepts, and for initial purposes I will use star schema. The RDBMS I am using is postgreSQL. Throug the bibliography I read I have some…
mrc
  • 2,845
  • 8
  • 39
  • 73