0% found this document useful (0 votes)

33 views12 pages

Dimensional Data Modeling - Lecture 3

The document discusses dimensional data modeling, focusing on strategies for managing large dimension tables and the concept of degenerate dimensions. It explains different types of facts in data warehousing, including additive, semi-additive, non-additive, and factless facts, as well as the importance of transaction and snapshot facts. The lecture emphasizes the need to design data models that support specific business processes and analysis requirements.

Uploaded by

jhoncvivas

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

33 views12 pages

Dimensional Data Modeling - Lecture 3

Uploaded by

jhoncvivas

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 12

Dimensional Data Modeling

Lecture 3 – Dimensional Modeling

Considerations
Monster Dimensions

 Case where the dimension table is very large

(millions of rows) changes frequently and
contains a large number of attributes
 Example: Customer
 Assume there is a small set of variables that
change often Customer_Dim
•Name
•Address
•etc Sales_Fact

•Income
•Education
•Marital Status
•Etc.
2
Monster Dimensions

 Strategy: split the variables that change

frequently into own table.
 Need to ‘band’ continuous variables like salary
into ranges (20000-40000, 40000-60000, etc).
 Each variable on the new table needs to have
small number of variables.
 Need to create one row in the new table for
each combination of variables. Assume: 2
variables with 5 values each. How many rows
would be in the table?

3
Monster Dimensions

 Answer: 25 (5*5)
 The ‘base’ Customer_Dim
Customer •Cust_key
•Name Sales_Fact
Dimension table •Address
Cust_key
contains the key to •Demo_key
Demo_Key
the new table. Demo_Dim
Keys of both the •Demo_key
•Income_Range
new key and the •Education
are placed as •Marital Status

foreign keys on
fact table What happens when
customer income
changes?
4
Degenerate Dimensions

 Consider an order. What is the grain of

information we would want in a fact table?
 What information is left at the order level, other
than the information that we place in the fact
table (date, customer, product, quantity, etc.?)

5
Degenerate Dimensions

 The answer: probably only the order number.

 A degenerate dimension is one that has no
attributes other that the key value
 Strategy: make the order number an attribute
of the fact table. It will look like a dimension
key, but will not join to anything – it is just an
attribute.
 This allows us to perform analysis at the order
level (GROUP BY)
 What are some other degenerate dimensions?

6
Different types of facts

 There are multiple types: additive, semi-

additive and non-additive.
 Additive: can ‘add’ the values across all
dimensions (e.g., sales revenue).
 Semi-additive: Certain types of facts are not
‘perfectly’ additive but represent a snapshot at a
point in time (account balances, inventory
balances). These cannot be treated the same
as perfectly additive facts
 Non-additive: Some facts can be textual (non-
additive). Basically, can only count these.
Example of a non-additive fact?
7
Families of facts

 When designing a data warehouse, need to

think of the process to be supported.
 Its important to realize that this translates into a
set of related facts – a value chain. Examples:
 Inquiry  Order  Shipment Invoice  Return
Credit

8
Transaction and Snapshot Facts

 When the operations of an organization are

examined, its important to realize that most
organizations want to look at their information
on a transactional and a snapshot basis.
 Transaction basis: look at individual
transactions (inventory movement, sales, ATM
transactions, etc.). Allows analysis of patterns
of behavior (time of day analysis, market basket
analysis, etc.)

9
Transaction and Snapshot Facts

 Typically created in addition to a transaction

fact.
 Typically, create ‘snapshots’ at the end of
specific reporting periods (month-end, etc.)
 Rolling snapshot – continuously update then
‘publish’ – advantage spreads the work.
Example: monthly sales. Bank month end
account balance and monthly transaction
counts, etc.

10
Snapshot Example

ATM Activity Snapshot

A snapshot
(Foreign_Keys)
for ATM .
usage, by .
.
account, by Transaction count
month Account balance
Revenue_earned
Average_daily_balance
.
.

11
Factless Facts

 Type of fact where there is no real measure

(additive or otherwise)
 Typically factless facts related to events
 Attendance in class, for example
 Typically, add a dummy variable with value of 1
for purpose of counting

Test Ict450
100% (1)
Test Ict450
11 pages
LK Ign, Electrical
No ratings yet
LK Ign, Electrical
193 pages
Rig No.: 314 Well Name: Date: 0.00 Drill Pipe: 0.00 Bha: 0.00 Kelly: Depth 0.00 Page #: 1
100% (1)
Rig No.: 314 Well Name: Date: 0.00 Drill Pipe: 0.00 Bha: 0.00 Kelly: Depth 0.00 Page #: 1
7 pages
Singh Surender - Biostatistics & Research Methodolgy
No ratings yet
Singh Surender - Biostatistics & Research Methodolgy
18 pages
Lecture 4
No ratings yet
Lecture 4
24 pages
Tutorial # 1
No ratings yet
Tutorial # 1
58 pages
Week 5
No ratings yet
Week 5
19 pages
DWH Architecture & Concepts
No ratings yet
DWH Architecture & Concepts
37 pages
Data Warehouse: What, Why and How ?
No ratings yet
Data Warehouse: What, Why and How ?
25 pages
Week 3
No ratings yet
Week 3
39 pages
Datawarehouse Concepts
No ratings yet
Datawarehouse Concepts
7 pages
Week 04 - 05
No ratings yet
Week 04 - 05
60 pages
Data Stage
No ratings yet
Data Stage
10 pages
DW Mod 4
No ratings yet
DW Mod 4
37 pages
DWT Chapter 2 Part 1
No ratings yet
DWT Chapter 2 Part 1
18 pages
First Part 27 Pages
No ratings yet
First Part 27 Pages
27 pages
Introduction To Data Warehousing
No ratings yet
Introduction To Data Warehousing
46 pages
Chapter Four - Data Warehouse Design: SATA Technology and Business Collage
No ratings yet
Chapter Four - Data Warehouse Design: SATA Technology and Business Collage
10 pages
Lecture 3
No ratings yet
Lecture 3
42 pages
Data Warehousing for Analysts
No ratings yet
Data Warehousing for Analysts
11 pages
Data Warehousing Concepts
No ratings yet
Data Warehousing Concepts
14 pages
Dimensions DW
No ratings yet
Dimensions DW
6 pages
Data Warehousing - C03 - DM
No ratings yet
Data Warehousing - C03 - DM
42 pages
C 01 Dimensional Modeling
No ratings yet
C 01 Dimensional Modeling
30 pages
L04 Dimensional Modeling
100% (1)
L04 Dimensional Modeling
58 pages
Data Warehouse Fact Tables Guide
No ratings yet
Data Warehouse Fact Tables Guide
3 pages
Data Warehouse Schema
No ratings yet
Data Warehouse Schema
10 pages
Dimensional Modeling for Analysts
No ratings yet
Dimensional Modeling for Analysts
30 pages
Data Warehouse Implementation
No ratings yet
Data Warehouse Implementation
37 pages
Data Modeling - Presentation PDF
No ratings yet
Data Modeling - Presentation PDF
46 pages
Data Warehousin G Concepts
No ratings yet
Data Warehousin G Concepts
41 pages
DW Basics
No ratings yet
DW Basics
24 pages
Cs655 Unit II
No ratings yet
Cs655 Unit II
27 pages
Dimensional Modeling
No ratings yet
Dimensional Modeling
7 pages
CH 3
No ratings yet
CH 3
60 pages
Ch4 DW Detailed Version
No ratings yet
Ch4 DW Detailed Version
39 pages
BI - Lecture 3 - Kimball Concepts
No ratings yet
BI - Lecture 3 - Kimball Concepts
44 pages
Abinitio Vijay - 8553385664
No ratings yet
Abinitio Vijay - 8553385664
28 pages
Populating A DW With SS2K
No ratings yet
Populating A DW With SS2K
5 pages
L03A-Dimensional Modeling I
No ratings yet
L03A-Dimensional Modeling I
27 pages
Dimensional Modeling: Prof. Sunita Sahu
No ratings yet
Dimensional Modeling: Prof. Sunita Sahu
50 pages
BI - Chap 3 - Data Warehouses Design
No ratings yet
BI - Chap 3 - Data Warehouses Design
54 pages
Week 04 & 05
No ratings yet
Week 04 & 05
63 pages
Data Warehousing Fundamentals: Priyanka Deshmukh
No ratings yet
Data Warehousing Fundamentals: Priyanka Deshmukh
43 pages
Data Modeling: Extended Star Schema & Aggregates
No ratings yet
Data Modeling: Extended Star Schema & Aggregates
11 pages
Data Modeling: Extended Star Schema & Aggregates
No ratings yet
Data Modeling: Extended Star Schema & Aggregates
11 pages
Session 4 Case Study Retail Case
50% (2)
Session 4 Case Study Retail Case
28 pages
Chapter-04-Analisis Dan Drfinisi Kebutuhan Datawarehouse
No ratings yet
Chapter-04-Analisis Dan Drfinisi Kebutuhan Datawarehouse
56 pages
Data Warehouse Design & Implementation
No ratings yet
Data Warehouse Design & Implementation
27 pages
Data Warehouse Ques
No ratings yet
Data Warehouse Ques
10 pages
Datawarehousing Top50 Interview Questions
No ratings yet
Datawarehousing Top50 Interview Questions
10 pages
What Is Data Warehouse?: Explanatory Note
No ratings yet
What Is Data Warehouse?: Explanatory Note
11 pages
5.data Warehouse
No ratings yet
5.data Warehouse
19 pages
DWH Int Questions
100% (1)
DWH Int Questions
9 pages
1.1 (Dimensional Modelling)
No ratings yet
1.1 (Dimensional Modelling)
51 pages
Informatica Bhaskar20161012
No ratings yet
Informatica Bhaskar20161012
90 pages
Data Warehousing Essentials
No ratings yet
Data Warehousing Essentials
28 pages
KR Chapter 3 - Inventory
No ratings yet
KR Chapter 3 - Inventory
20 pages
Dimensional Modeling Guide
No ratings yet
Dimensional Modeling Guide
59 pages
Citer
No ratings yet
Citer
4 pages
What Is Data Warehouse?: Explanatory Note
No ratings yet
What Is Data Warehouse?: Explanatory Note
10 pages
Lecture 1 Notes: Dimension Tables
No ratings yet
Lecture 1 Notes: Dimension Tables
2 pages
DWDM Class PPT 9-9-23
No ratings yet
DWDM Class PPT 9-9-23
65 pages
School Based Press Conference Guidelines
No ratings yet
School Based Press Conference Guidelines
13 pages
Presentation Matrix COSEC For End Users
No ratings yet
Presentation Matrix COSEC For End Users
147 pages
Quiz 2
No ratings yet
Quiz 2
4 pages
Gemcom Minex: New Features
No ratings yet
Gemcom Minex: New Features
13 pages
AFP FilterPress Brochure PDF
No ratings yet
AFP FilterPress Brochure PDF
4 pages
Philippine Digitalization Bills
No ratings yet
Philippine Digitalization Bills
13 pages
Web Practical
No ratings yet
Web Practical
37 pages
مكاتب استشارية الكويت PDF
No ratings yet
مكاتب استشارية الكويت PDF
2 pages
An Authoritative Study On The
No ratings yet
An Authoritative Study On The
21 pages
I6079-NATM Tunnel Reinforcement Quantity Details
100% (1)
I6079-NATM Tunnel Reinforcement Quantity Details
1 page
Abtik Group
No ratings yet
Abtik Group
23 pages
Freedom-Ticket 01-2 Notes
No ratings yet
Freedom-Ticket 01-2 Notes
10 pages
MUET
No ratings yet
MUET
1 page
Mini Projects 1-3-Satyaki Mitra
No ratings yet
Mini Projects 1-3-Satyaki Mitra
33 pages
Fuzzy Logic for Computing Students
No ratings yet
Fuzzy Logic for Computing Students
69 pages
IE PSheet
No ratings yet
IE PSheet
3 pages
Planning Pack 2016
No ratings yet
Planning Pack 2016
48 pages
Coal Conversions Facts 2013
No ratings yet
Coal Conversions Facts 2013
4 pages
Sapera User
No ratings yet
Sapera User
109 pages
Visa Cashless Cities Report
No ratings yet
Visa Cashless Cities Report
68 pages
Intake and Exhaust: Group 15
No ratings yet
Intake and Exhaust: Group 15
20 pages
Refrigeration System Optimization
No ratings yet
Refrigeration System Optimization
14 pages
1 s2.0 S0196890421011778 Main
No ratings yet
1 s2.0 S0196890421011778 Main
12 pages
Power System Course Outline 2022
No ratings yet
Power System Course Outline 2022
1 page
Advanced Eigrp Concepts: CCNP ROUTE: Implementing IP Routing
No ratings yet
Advanced Eigrp Concepts: CCNP ROUTE: Implementing IP Routing
19 pages
Portable Percent Oxygen Analyzer With USB Data Logging
No ratings yet
Portable Percent Oxygen Analyzer With USB Data Logging
1 page

Dimensional Data Modeling - Lecture 3

Uploaded by

Dimensional Data Modeling - Lecture 3

Uploaded by

Dimensional Data Modeling

Lecture 3 – Dimensional Modeling

 Case where the dimension table is very large

 Strategy: split the variables that change

 Consider an order. What is the grain of

 The answer: probably only the order number.

 There are multiple types: additive, semi-

 When designing a data warehouse, need to

 When the operations of an organization are

 Typically created in addition to a transaction

ATM Activity Snapshot

 Type of fact where there is no real measure

You might also like