Questions tagged [star-schema]

Star schema is the most basic data warehousing dimensional structure and database schema, consisting of one or more fact tables referencing any number of dimension tables.

Star schema is the most basic data warehousing (data mart) dimensional structure and database schema, consisting of one or more fact tables referencing any number of dimension tables. It is organized like:

  • Facts - event that is counted or measured. They can be at a very atomic level. Generally assigned a surogate key to ensure every row can be identified.
  • Dimensions - reference information about facts. The records contain detailed information and attributes describing the fact data. Usually assigned a surrogate primary key.

Within the data warehouse and data mart the dimension table is associated with fact tables using foreign key relationship.

Advantages

In well-designed schemas multidimensional data can be analyzed easily. They are good in decision-support environments. Some of the advantages are query speed, data load and administration. The built-in referential integrity is good but not as in highly normalized databases though.

Examples

An example star schema could have a Sales Fact with Date, Product, and Store Dimensions. Any numerical performance measurements related to Sales would also be stored in the fact, like "Quantity Sold" and "Dollar Amount of Sale". The remaining columns in the Sales Fact would be foreign keys linking to the Dimensions.

More information

354 questions
1
vote
0 answers

Teradata Aggregation operation of a Billion row fact table with some 14 dimensions. SERIOUS SQL Tuning Stuff. Interested ? get TGI card from me

I am trying to optimize this query which is patterned as below . There is a 1 Trillion row fact (F1) table ( supposedly fork lifted out of another propriety into TD and no one even has taken the hassle of changing the Data Model to a more TD…
1
vote
0 answers

Star schema for target and actual comparison Kimball

I am going to model one of the star schemas for a university data warehousing project. We need to compare the actual application count with a target. There are target counts (set by the colleges every year) associated with Departments, Course…
user1254579
  • 3,901
  • 21
  • 65
  • 104
1
vote
1 answer

How to solve this? Use a bridge table?

I am designing a star schema and come across this problem. I have two dimension table - a Product (Key = Product Id) and a Customer (Key = Customer Id). In another table there are three fields - Business Unit, Product Id and Customer Id.In this…
Arun.K
  • 103
  • 2
  • 4
  • 21
1
vote
1 answer

modelling "questions" & "answers" dimensions where answer can be multiple choice or freetext

I'm building a star schema to allow reporting against usage of an application, but have hit a stumbling block on two points: Some answers can be free text. I need to report on what questions were not answered by what users. To…
wilson_smyth
  • 1,202
  • 1
  • 14
  • 39
1
vote
3 answers

Difference between sql query aggregation and aggegration and querying an OLAP cube

I have a query with respect to the advantages of building a OLAP cube vs aggregating data in database table for querying ,data of say 6 months and then archiving the sql table later for analytics purpose. Which one is better, table or OLAP cube? and…
Argho Chatterjee
  • 579
  • 2
  • 9
  • 26
1
vote
1 answer

Handling change of grain for a snapshot fact table in a star-schema

The question How do you handle a change in grain (from weekly measurement to daily measurement) for a snapshot fact table. Background info For a star-schema design I want to incorporate the results of a survey as a fact (e.g. in week 2 of 2015 80%…
1
vote
2 answers

Name Value pairs and fact tables

I'm working on a star schema for analysis of posted form data. The site that the form data will be posted to is actually external to the site hosting the form, so only the data in the form will be available. I'm going to give the option to include…
1
vote
2 answers

Dimensional data warehouse customer with multiple accounts

I'm designing and building a sales fact table in a star schema and I can't seem to work out how to go about the following problem: A customer can have 1 or 2 accounts, but an account can only belong to 1 customer. So this is a 1 to many…
1
vote
2 answers

Designing a Data Warehouse/ Star Schema - Choosing facts

Consider a crowdfunding system whereby anyone in the world can invest in a project. I have the normalized database design in place and now I am trying to create a data warehouse it (OLAP). I have come up with the following: This has been…
Jack
  • 319
  • 6
  • 16
1
vote
1 answer

Hierarchies and levels (Pentaho schema workbench)?

I'm new in BI world and I have a lot of questions. I have to do a BI home work project, so I decided to use: MYSQL (database) Pentaho Kettle (ETL) Pentaho schema workbench (star schema) QlikView (reporting) I have a dimension table which is…
user3480925
  • 43
  • 2
  • 6
1
vote
1 answer

Star Schema Structure - To many Dimensions

I have a star schema warehouse (MS SQL Server, accessed via MS Report Builder with OLAP) which has a lot of tiny dimensions - by this I mean the dimensions are built from two columns (Id and Description) with several hundred linked from the Fact…
user3644997
  • 89
  • 1
  • 9
1
vote
1 answer

hierarchical modeling data warehouse - snowflake or star?

Hi I am doing a project on datawarehouse and i am not sure whether i correctly modeling my datawarehouse. My datawarehouse is not on business process thus i find very little information about that. Basically I have alot of library file and each…
xvi_16
  • 115
  • 1
  • 1
  • 10
1
vote
0 answers

Scalability of OLAP Cubes - Impact of Rows/Columns, hierarchical order of attributes, empty/redundant attributes

In order to properly redesign some legacy OLAP cubes, I need to understand the general scalability and some specific drivers of OLAP cube speed: General: How do OLAP cubes approximately scale for rows and columns (attributes)? e.g., I would assume…
ateich
  • 203
  • 3
  • 5
  • 16
1
vote
1 answer

select all values from a dimension for which there are facts in all other dimensions

I've tried to simplify for the purposes of asking this question. Hopefully, this will be comprehensible. Basically, I have a fact table with a time dimension, another dimension, and a hierarchical dimension. For the purposes of the question, let's…
ideasculptor
  • 1,226
  • 3
  • 14
  • 20
1
vote
2 answers

Fact table organization

I am participating in creation of reporting software which utilizes Kimball star schema methodology. Entire team (including me) hasn't worked with this technology so we are new in this. There are couple of dimension and fact tables in or system so…