0% found this document useful (0 votes)

7 views14 pages

MultiDimensional Data Model

Uploaded by

ap.itpcbt

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

7 views14 pages

MultiDimensional Data Model

Uploaded by

ap.itpcbt

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 14

MultiDimensional Data Model

A Multidimensional Data Model is defined as a model that allows data to be organized and viewed
in multiple dimensions, such as product, time and location

Features of multi-dimensional data model

• It allows users to ask analytical questions associated with multiple dimensions which help
us know market or business trends.

• OLAP (online analytical processing) and data warehousing uses multi-dimensional

databases.

• It represents data in the form of data cubes. Data cubes allow to model and view the data
from many dimensions and perspectives.

• It is defined by dimensions and facts and is represented by a fact table. Facts are numerical
measures and fact tables contain measures of the related dimensional tables or names of
the facts.

Multidimensional Data Representation

Working on a Multidimensional Data Model

The following stages should be followed by every project for building a Multi Dimensional Data
Model:

Stage 1: Assembling data from the client

In first stage, a Multi-Dimensional Data Model collects correct data from the client. Mostly,
software professionals provide simplicity to the client about the range of data which can be
gained with the selected technology and collect the complete data in detail.

Stage 2: Grouping different segments of the system

In the second stage, the Multi Dimensional Data Model recognizes and classifies all the data to

Stage 3: Noticing the different proportions : In the third stage, it is the basis on which the design
of the system is based. In this stage, the main factors are recognized according to the user's point
of view. These factors are also known as "Dimensions". the respective section they belong to and
also builds it problem-free to apply step by step.

Stage 4: Preparing the actual-time factors and their respective qualities : In the fourth stage, the
factors which are recognized in the previous step are used further for identifying the related
qualities. These qualities are also known as "attributes" in the database.

Stage 5: Finding the actuality of factors which are listed previously and their qualities : In the
fifth stage, A Multi Dimensional Data Model separates and differentiates the actuality from the
factors which are collected by it. These actually play a significant role in the arrangement of a
Multi Dimensional Data Model.

Stage 6: Building the Schema to place the data, with respect to the information collected from
the steps above : In the sixth stage, on the basis of the data which was collected previously, a
Schema is built.
Example to Understand Multidimensional Data Model

1. Let us take the example of a firm. The revenue cost of a firm can be recognized on the basis of
different factors such as geographical location of firm's workplace, products of the firm,
advertisements done, time utilized to flourish a product, etc.

Example 1

2. Let us take the example of the data of a factory which sells products per quarter in Bangalore.
The data is represented in the table given below :
2D factory data

In the above given presentation, the factory's sales for Bangalore are, for the time dimension,
which is organized into quarters and the dimension of items, which is sorted according to the kind
of item which is sold. The facts here are represented in rupees (in thousands).

Now, if we desire to view the data of the sales in a three-dimensional table, then it is represented
in the diagram given below. Here the data of the sales is represented as a two dimensional table.
Let us consider the data according to item, time and location (like Kolkata, Delhi, Mumbai). Here
is the table :

3D data representation as 2D

This data can be represented in the form of three dimensions conceptually, which is shown in the
image below :
3D data representation

Features of multidimensional data models

• Measures: Measures are numerical values like sales or revenue that can be analyzed. They
are stored in fact tables in a multidimensional model.

• Dimensions: Dimensions are descriptive attributes like time, location, or product that give
context to measures. They are stored in dimension tables.

• Cubes: Cubes organize data into multiple dimensions, linking measures and dimensions
for fast and flexible analysis.

• Aggregation: Aggregation summarizes data (e.g., total sales by month), allowing users to
view data at different levels of detail.

• Drill-down: View data in more detail (e.g., from year → month).

• Roll-up: View data in summary (e.g., from day → quarter).

These help explore data across levels.

• Hierarchies: Hierarchies arrange dimensions into levels (e.g., Year > Quarter > Month >
Day), supporting drill-down and roll-up.

• OLAP (Online Analytical Processing): OLAP tools allow quick analysis of large data sets
using cubes, hierarchies, and aggregation for complex queries.
Advantage and Disadvantage of Data Model

Advantage Disadvantage

Easy to handle Requires skilled professionals

Simple to maintain Complex structure

Better performance than relational databases System performance drops if cache fails

More intuitive data representation (multi-viewed) Dynamic and harder to design

Handles complex systems and applications well Longer path to final output
Schemas for multidimensional data

Schema is a logical description of the entire database. It includes the name and description of
records of all record types including all associated data-items and aggregates. Much like a
database, a data warehouse also requires to maintain a schema. A database uses relational
model, while a data warehouse uses Star, Snowflake, and Fact Constellation schema. In this
chapter, we will discuss the schemas used in a data warehouse.

Star Schema

• Each dimension in a star schema is represented with only one-dimension table.

• This dimension table contains the set of attributes.

• The following diagram shows the sales data of a company with respect to the four
dimensions, namely time, item, branch, and location.

• There is a fact table at the center. It contains the keys to each of four dimensions.

• The fact table also contains the attributes, namely dollars sold and units sold.

Note − Each dimension has only one dimension table and each table holds a set of attributes. For
example, the location dimension table contains the attribute set {location_key, street, city,
province_or_state,country}. This constraint may cause data redundancy. For example,
"Vancouver" and "Victoria" both the cities are in the Canadian province of British Columbia. The
entries for such cities may cause data redundancy along the attributes province_or_state and
country.

Snowflake Schema

• Some dimension tables in the Snowflake schema are normalized.

• The normalization splits up the data into additional tables.

• Unlike Star schema, the dimensions table in a snowflake schema are normalized. For
example, the item dimension table in star schema is normalized and split into two
dimension tables, namely item and supplier table.

• Now the item dimension table contains the attributes item_key, item_name, type, brand,
and supplier-key.

• The supplier key is linked to the supplier dimension table. The supplier dimension table
contains the attributes supplier_key and supplier_type.

Note − Due to normalization in the Snowflake schema, the redundancy is reduced and therefore,
it becomes easy to maintain and the save storage space.
Fact Constellation Schema

• A fact constellation has multiple fact tables. It is also known as galaxy schema.

• The following diagram shows two fact tables, namely sales and shipping.

• The sales fact table is same as that in the star schema.

• The shipping fact table has the five dimensions, namely item_key, time_key, shipper_key,
from_location, to_location.

• The shipping fact table also contains two measures, namely dollars sold and units sold.

• It is also possible to share dimension tables between fact tables. For example, time, item,
and location dimension tables are shared between the sales and shipping fact table.

Schema Definition

Multidimensional schema is defined using Data Mining Query Language (DMQL). The two
primitives, cube definition and dimension definition, can be used for defining the data
warehouses and data marts.
Difference between Star Schema and Snowflake Schema

The Star Schema and Snowflake Schema are two approaches to data warehouse design. In the
Star Schema, a central fact table is connected to dimension tables, forming a star-like structure.
This design is simpler and faster for querying. On the other hand, the Snowflake Schema
normalizes dimension tables into multiple related tables, resembling a snowflake. While it
reduces data redundancy, it can make queries more complex. The Star Schema prioritizes query
speed and simplicity, while the Snowflake Schema focuses on data normalization and storage
efficiency.

Star Schema

Star Schema is a type of multidimensional model used for data warehouses. In a star schema, the
fact tables and dimension tables are included. This schema uses fewer foreign-key joins. It forms
a star structure with a central fact table connected to the surrounding dimension tables.

Snowflake Schema
Snowflake Schema is also a type of multidimensional model used for data warehouses. In the
snowflake schema, the fact tables, dimension tables and sub-dimension tables are included. This
schema forms a snowflake structure with fact tables, dimension tables and sub-dimension tables.

Difference Between Star and Snowflake Schema

Feature Star Schema Snowflake Schema

Central fact table connected to Fact table connected to

Structure dimension tables normalized dimension tables

Data
Denormalized dimension tables Normalized dimension tables
Normalization
Feature Star Schema Snowflake Schema

Faster query execution due to fewer Slower query performance due to

Performance joins multiple joins

Complex design with multiple

Simple and easy to understand
Design Complexity levels of relationships

Uses more storage due to Uses less storage due to

Space Usage denormalization normalization

Data Redundancy Higher data redundancy Lower data redundancy

Foreign Keys Fewer foreign keys More foreign keys

Best for large datasets and quick ad- Best for structured, predictable
Use Cases hoc queries queries

High query complexity due to

Low query complexity
Query Complexity multiple joins

Easier to maintain due to simple More difficult to maintain due to

Maintainability design complexity
Feature Star Schema Snowflake Schema

Scalable but may encounter

More scalable for very large data
performance issues with large data
sets due to normalization
Scalability volumes

Better for systems that require

Ideal for BI tools and quick
Suitability for BI detailed reporting and data
reporting
Tools analysis

Lower data integrity due to Higher data integrity due

Data Integrity redundancy to normalization

Updates and More difficult to update due to Easier to update as data is

Modifications denormalization normalized

More complex to learn and

Easier to learn and implement
Learning Curve implement

Choosing Between Star Schema and Snowflake Schema

When selecting between Star Schema and Snowflake Schema, it’s important to align our choice
with our organization’s needs, data characteristics and performance expectations. Here’s a quick
guide to help we decide:

1. Star Schema

• Best for Simplicity and Speed: If we need a straightforward, easy-to-implement solution

with fast query execution, the Star Schema is ideal. It works well for small to medium
datasets where quick, simple queries are essential.
• Use Case: Perfect for scenarios with fewer dimensions and limited hierarchy levels, such
as sales data warehouses in small businesses. It allows for fast data retrieval with minimal
joins, making it suitable for quick reporting and analytics.

• Storage Considerations: Suitable when redundancy isn’t a significant issue and storage
requirements are manageable.

2. Snowflake Schema

• Best for Flexibility and Data Integrity: If we need to handle large datasets with multiple
levels of hierarchy and a high degree of normalization, the Snowflake Schema offers
greater flexibility. It’s perfect for maintaining data integrity across complex datasets.

• Use Case: Ideal for large organizations dealing with large, normalized datasets or those
with frequent updates, like customer or inventory management systems. It minimizes
redundancy and improves storage efficiency.

• Storage Considerations: Snowflake is more storage-efficient due to its normalized

structure, making it a great choice for scenarios with complex, high-volume data.

Which Schema is Right for You?

• If simplicity and speed are our priorities, the Star Schema is a better fit.

• If we need to handle complex data with frequent updates while minimizing storage, the
Snowflake Schema is more suitable.

Vulnerability Assessment and Penetration Testing
100% (1)
Vulnerability Assessment and Penetration Testing
19 pages
DWM Unit 2. Data Warehousing Modeling & OLAP I
100% (2)
DWM Unit 2. Data Warehousing Modeling & OLAP I
16 pages
Advantages of Multidimensional Data Model
No ratings yet
Advantages of Multidimensional Data Model
6 pages
Data Warehouse Basics & Schemas
100% (1)
Data Warehouse Basics & Schemas
25 pages
Data Cubemod2
100% (1)
Data Cubemod2
21 pages
Data Warehouse Basics for Analysts
0% (1)
Data Warehouse Basics for Analysts
14 pages
Unit-1 Lecture Notes
100% (1)
Unit-1 Lecture Notes
43 pages
DW Concepts
No ratings yet
DW Concepts
7 pages
A Multi-Dimensional Data Model
No ratings yet
A Multi-Dimensional Data Model
37 pages
DWDM Unit 2 PDF
No ratings yet
DWDM Unit 2 PDF
16 pages
Data Warehousing and Data Mining: Sunil Paudel
No ratings yet
Data Warehousing and Data Mining: Sunil Paudel
29 pages
Unit 2
No ratings yet
Unit 2
8 pages
Dimensional Modeling Guide
No ratings yet
Dimensional Modeling Guide
26 pages
Multidimensional Data Modeling Guide
No ratings yet
Multidimensional Data Modeling Guide
29 pages
Case Study: The Rise & Fall of Blackberry: Harsha Mishra 80401170008 PGDM 2ND YEAR
No ratings yet
Case Study: The Rise & Fall of Blackberry: Harsha Mishra 80401170008 PGDM 2ND YEAR
5 pages
CCS341 - Data Warehousing - Unit 4 Notes
0% (1)
CCS341 - Data Warehousing - Unit 4 Notes
19 pages
Dataware House Strcture
No ratings yet
Dataware House Strcture
13 pages
Lecture Six-Schemas
No ratings yet
Lecture Six-Schemas
5 pages
BA
No ratings yet
BA
6 pages
21IS503 UnitI LM2
No ratings yet
21IS503 UnitI LM2
31 pages
Dimensional Modeling Guide
No ratings yet
Dimensional Modeling Guide
45 pages
Dimensional Modeling and Schemas: Data Modeling Research Paper
No ratings yet
Dimensional Modeling and Schemas: Data Modeling Research Paper
11 pages
Final DWM
No ratings yet
Final DWM
30 pages
Data Warehouse Schemas & OLAP
No ratings yet
Data Warehouse Schemas & OLAP
12 pages
Unit 3
No ratings yet
Unit 3
18 pages
Data Cube
No ratings yet
Data Cube
6 pages
DWDM Unit 2
No ratings yet
DWDM Unit 2
104 pages
Dimensional Modelling
No ratings yet
Dimensional Modelling
36 pages
Chapter 4 Vector Space
No ratings yet
Chapter 4 Vector Space
66 pages
3 - Business Analysis in Data Mining - L6 - 7 - 8 - 9 - 10
No ratings yet
3 - Business Analysis in Data Mining - L6 - 7 - 8 - 9 - 10
40 pages
Data Model Schemas
No ratings yet
Data Model Schemas
5 pages
Unit - 4
No ratings yet
Unit - 4
36 pages
3 Business Analysis in Data Mining L6 7 8-9-10
No ratings yet
3 Business Analysis in Data Mining L6 7 8-9-10
39 pages
3 - Business Analysis in Data Mining - L6 - 7 - 8 - 9 - 10
No ratings yet
3 - Business Analysis in Data Mining - L6 - 7 - 8 - 9 - 10
39 pages
Unit 4
No ratings yet
Unit 4
11 pages
DWM 2
No ratings yet
DWM 2
21 pages
M 1.4 Multidimensional Data Model
No ratings yet
M 1.4 Multidimensional Data Model
72 pages
Enyecontrols TKG CO
No ratings yet
Enyecontrols TKG CO
3 pages
1
No ratings yet
1
35 pages
Data Mining and Warehousing (chp#3) .
No ratings yet
Data Mining and Warehousing (chp#3) .
11 pages
Introduction To DataWarehouse and DataMining
No ratings yet
Introduction To DataWarehouse and DataMining
35 pages
Dim Modelling Part 1 - Sh24
No ratings yet
Dim Modelling Part 1 - Sh24
50 pages
Multidimensional Data Models Guide
No ratings yet
Multidimensional Data Models Guide
53 pages
DWM Chp2 Notes
No ratings yet
DWM Chp2 Notes
21 pages
CH 3
No ratings yet
CH 3
60 pages
Multidimensional
No ratings yet
Multidimensional
77 pages
DW Unit IV Notes
No ratings yet
DW Unit IV Notes
36 pages
DWDM Class PPT 9-9-23
No ratings yet
DWDM Class PPT 9-9-23
65 pages
Data Mining Notes UNIT II
No ratings yet
Data Mining Notes UNIT II
25 pages
Multi Dimensional Data Model
No ratings yet
Multi Dimensional Data Model
21 pages
Lect-6-Data warehousing-Part-II
No ratings yet
Lect-6-Data warehousing-Part-II
37 pages
Datawarehouse Operations
No ratings yet
Datawarehouse Operations
18 pages
Espa 444
No ratings yet
Espa 444
15 pages
2018 FEMAP Symposium - Using FEMAP With LS-DYNA - Applied CAx
No ratings yet
2018 FEMAP Symposium - Using FEMAP With LS-DYNA - Applied CAx
28 pages
Unit 2
No ratings yet
Unit 2
30 pages
Unit 2 Notes DWM
No ratings yet
Unit 2 Notes DWM
14 pages
DMDW Unit2
No ratings yet
DMDW Unit2
35 pages
Thesis Title Sample Information Technology
100% (4)
Thesis Title Sample Information Technology
5 pages
E-Commerce App for Indian Users
No ratings yet
E-Commerce App for Indian Users
19 pages
Multi-Dimensional Data Modeling
No ratings yet
Multi-Dimensional Data Modeling
4 pages
AI Engineer's Career Profile
No ratings yet
AI Engineer's Career Profile
5 pages
Network Redundancy with STP
No ratings yet
Network Redundancy with STP
39 pages
R22 SkillDevelopmentCourse
No ratings yet
R22 SkillDevelopmentCourse
21 pages
PCI Express Validation with IFV
No ratings yet
PCI Express Validation with IFV
12 pages
Using Unicode Character Symbols in Excel
No ratings yet
Using Unicode Character Symbols in Excel
28 pages
Toslink
No ratings yet
Toslink
20 pages
2023-01-01
No ratings yet
2023-01-01
3 pages
ID Reader Installation Manualote
No ratings yet
ID Reader Installation Manualote
37 pages
Installar Notes
No ratings yet
Installar Notes
42 pages
Laptops Manufacturers Suppliers Exporters
No ratings yet
Laptops Manufacturers Suppliers Exporters
23 pages
Unit Iv
No ratings yet
Unit Iv
33 pages
Brosur Elektronik 15 April 2023
No ratings yet
Brosur Elektronik 15 April 2023
2 pages
1sdh002031a1101 C
No ratings yet
1sdh002031a1101 C
36 pages
Session 1 and 2 Course Overview and Intro To R
No ratings yet
Session 1 and 2 Course Overview and Intro To R
147 pages
Procedural Programming
No ratings yet
Procedural Programming
9 pages
Sustainment-Assessment 12 09 2024-Sales
No ratings yet
Sustainment-Assessment 12 09 2024-Sales
18 pages
HW3: (Regularized) Least Square Problem (65 PTS) : Mathematical Backgrounds
No ratings yet
HW3: (Regularized) Least Square Problem (65 PTS) : Mathematical Backgrounds
13 pages
Log
No ratings yet
Log
2 pages
DBMS - Worksheet 1 - 3 Marks
No ratings yet
DBMS - Worksheet 1 - 3 Marks
13 pages
CS602-Assignment 2 Solution Fall 2024 by M.junaid Qazi
No ratings yet
CS602-Assignment 2 Solution Fall 2024 by M.junaid Qazi
5 pages
Unit - 1
No ratings yet
Unit - 1
39 pages
CCS341 Data Warehousing Unit 4 Notes
No ratings yet
CCS341 Data Warehousing Unit 4 Notes
19 pages
MDDM
No ratings yet
MDDM
22 pages
Data and Computer Communications 8th Edition William Stallings Download
No ratings yet
Data and Computer Communications 8th Edition William Stallings Download
52 pages
DWDM Unit - I Notes
No ratings yet
DWDM Unit - I Notes
24 pages
KnowledgeManagementUnit III
No ratings yet
KnowledgeManagementUnit III
12 pages
DWDM Unit-1 R23
No ratings yet
DWDM Unit-1 R23
33 pages
Data Sheet BYD BMU
No ratings yet
Data Sheet BYD BMU
1 page
Dimensional Data Modeling With Databricks
No ratings yet
Dimensional Data Modeling With Databricks
23 pages

MultiDimensional Data Model

Uploaded by

MultiDimensional Data Model

Uploaded by

MultiDimensional Data Model

Features of multi-dimensional data model

• OLAP (online analytical processing) and data warehousing uses multi-dimensional

Multidimensional Data Representation

Stage 1: Assembling data from the client

Stage 2: Grouping different segments of the system

Features of multidimensional data models

• Drill-down: View data in more detail (e.g., from year → month).

• Roll-up: View data in summary (e.g., from day → quarter).

Easy to handle Requires skilled professionals

Simple to maintain Complex structure

More intuitive data representation (multi-viewed) Dynamic and harder to design

• Each dimension in a star schema is represented with only one-dimension table.

• This dimension table contains the set of attributes.

• Some dimension tables in the Snowflake schema are normalized.

• The normalization splits up the data into additional tables.

• The sales fact table is same as that in the star schema.

Difference Between Star and Snowflake Schema

Feature Star Schema Snowflake Schema

Central fact table connected to Fact table connected to

Faster query execution due to fewer Slower query performance due to

Complex design with multiple

Uses more storage due to Uses less storage due to

Data Redundancy Higher data redundancy Lower data redundancy

Foreign Keys Fewer foreign keys More foreign keys

High query complexity due to

Easier to maintain due to simple More difficult to maintain due to

Scalable but may encounter

Better for systems that require

Lower data integrity due to Higher data integrity due

Updates and More difficult to update due to Easier to update as data is

More complex to learn and

Choosing Between Star Schema and Snowflake Schema

• Best for Simplicity and Speed: If we need a straightforward, easy-to-implement solution

• Storage Considerations: Snowflake is more storage-efficient due to its normalized

Which Schema is Right for You?

You might also like