Questions tagged [star-schema]

Star schema is the most basic data warehousing dimensional structure and database schema, consisting of one or more fact tables referencing any number of dimension tables.

Star schema is the most basic data warehousing (data mart) dimensional structure and database schema, consisting of one or more fact tables referencing any number of dimension tables. It is organized like:

  • Facts - event that is counted or measured. They can be at a very atomic level. Generally assigned a surogate key to ensure every row can be identified.
  • Dimensions - reference information about facts. The records contain detailed information and attributes describing the fact data. Usually assigned a surrogate primary key.

Within the data warehouse and data mart the dimension table is associated with fact tables using foreign key relationship.

Advantages

In well-designed schemas multidimensional data can be analyzed easily. They are good in decision-support environments. Some of the advantages are query speed, data load and administration. The built-in referential integrity is good but not as in highly normalized databases though.

Examples

An example star schema could have a Sales Fact with Date, Product, and Store Dimensions. Any numerical performance measurements related to Sales would also be stored in the fact, like "Quantity Sold" and "Dollar Amount of Sale". The remaining columns in the Sales Fact would be foreign keys linking to the Dimensions.

More information

354 questions
2
votes
2 answers

Solve one-to-many relationship in Data warehouse?

I'm a new member of Data warehouse. I'm designing model alter Star Schema. I have a 2 table: Contract table & Complain table. One Contract can have one or more value in Complain table. You can see 2 photo below. in facttable, 1 contract only have…
2
votes
3 answers

Is there a name for this type of data structure?

Is there a name for a type of Star Schema in which there is a single Fact table which has a single column for value, and the type of value (the measure) is defined by a dimension? In other words, a table like this: Dim1ID Dim2ID MeasureID …
codeulike
  • 22,514
  • 29
  • 120
  • 167
2
votes
2 answers

star schema for implementing near real time analytics with changing data

I implementing analytics for a medical software. The data to be processed is mainly appointment related. I'm planing to implement star schema for generating reports. I have a few doubts My data can change like a appointment can be marked as…
abhinavlal
  • 88
  • 2
  • 10
2
votes
1 answer

What is the best way to model baselines, projections, or goals in dimensional modeling?

I'm designing on my first real Kimball-style star schema dimensional model. I've slogged through a lot of his books, and I'm part-way through Star Schema - The Complete Reference by Adamson (which I find much more practical and straightforward than…
2
votes
1 answer

Snowflaking Date dimension

In my star schema, I have a project dimension which has columns such as start_date, finish_date, service_date, onhold_date, resume_date etc. Should I introduce foreign keys for all the dates in the fact table and connect them to a date dimension or…
Ravi
  • 2,470
  • 3
  • 26
  • 32
2
votes
4 answers

Application insight -> export -> Power BI Data Warehouse Architecture

Our team have just recently started using Application Insights to add telemetry data to our windows desktop application. This data is sent almost exclusively in the form of events (rather than page views etc). Application Insights is useful only up…
2
votes
1 answer

NoSQL as a data mining solution?

In what ways are NoSQL databases be more useful in data mining than say OLAP databases or how is it less useful? Is there an advantage in having a fast data-retrieval from gigantic volume of data but also having a schema-less database?
DazedNConfused
  • 189
  • 2
  • 13
2
votes
1 answer

What is a good web application SQL Server data mart implementation in ElasticSearch?

Coming from a RDBMS background and trying to wrap my head around ElasticSearch data storage patterns... Currently in SQL Server, we have a star schema data mart, RecordData. Rows are organized by user ID, geographic location that pertains to the…
ElHaix
  • 12,846
  • 27
  • 115
  • 203
2
votes
0 answers

Star schema role in the Pentaho Mondrian OLAP server

I'm trying to understand the OLAP paradigm. More specifically I would like to do some OLAP queries on our 'OLTP' data using the Pentaho Mondrian OLAP server. I have read the online documntation, but I don't understand the data flow being used by…
Jan Krakora
  • 2,500
  • 3
  • 25
  • 52
2
votes
1 answer

Best practice for natural keys in a dimension that includes data from multiple source table

I am designing a dimension for a data warehouse that includes several related attributes from various tables. When loading the fact tables I generally like to look up the surrogate keys from the dimension tables based on keys from from the source…
wshato
  • 513
  • 3
  • 16
2
votes
2 answers

Postgresql - window functions within window functions

Facing a query design issue and not sure about whether my approach to the problem is unnecessarily complicated: I have a fact table: Column | Type | Modifiers …
Slania
  • 91
  • 1
  • 7
2
votes
0 answers

Drill Across Functionality in Tabular Model

We've a Tabular model which has 3 facts and few dimensions. Out of the 3 facts 2 are about Account & Products. Facts realated to acocunt of a customer. How many are active, how much is the current balance, how much was the balance at the start etc.…
2
votes
2 answers

Data warehouse multivalued attributes

Disclaimer: I have never created a data warehouse before. I have read several chapters of Kimball's Data Warehouse Toolkit. Background: Plant (factory) management team needs to be able to slice and dice production information in various ways, and we…
tonyapolis
  • 201
  • 2
  • 12
2
votes
1 answer

Nulls in dimension table for numeric attributes

What is the best way to handle missing values in a dimension table? In the case of a textual column, it is easy to write "NA: Missing," but what should be done for numeric columns where it is important to retain the specific values. Note: I do not…
d_a_c321
  • 533
  • 1
  • 11
  • 23
2
votes
0 answers

OLAP Cube design issue - how to remove many to one relation

I had include my projectdetail in my previous question OLAP Cube design issue for Telecommunication Data. I had one question Is it necessary to remove one to many relationship on OLAP. What I heard is that OLAP should be able to give answer for zero…
nKandel
  • 2,543
  • 1
  • 29
  • 47