1
Brief introduction to
Datawarehouse
Eng. Anas M.Mouaddeb
Introduction 2
A Data Warehouse is Built by combining data from multiple diverse
sources that support analytical reporting, structured and
unstructured queries, and decision making for the organization
Data Warehousing is a step-by-step approach for constructing and
using a Data Warehouse. Many data scientists get their data in raw
formats from various sources of data and information.
Eng. Anas M.Mouaddeb
Introduction 3
But, for many data scientists also as business decision-makers,
particularly in big enterprises, the main sources of data and
information are corporate data warehouses. A data warehouse
holds data from multiple sources, including internal databases and
Software (SaaS) platforms. After the data is loaded, it often cleaned,
transformed, and checked for quality before it is used for analytics
reporting, data science, machine learning, or anything.
What is Data Warehouse 4
A Data Warehouse is a collection of software tools that facilitates
analysis of a large set of business data used to help an organization
make decisions. A large amount of data in data warehouses comes
from numerous sources such that internal applications like marketing,
sales, and finance; customer-facing apps; and external partner systems,
among others. It is a centralized data repository for analysts that can be
queried whenever required for business benefits. A data warehouse is
mainly a data management system that’s designed to enable and
support business intelligence (BI) activities, particularly analytics. Data
warehouses are alleged to perform queries, cleaning, manipulating,
transforming and analysing the data and they also contain large
amounts of historical data.
Eng. Anas M.Mouaddeb
What is Data Warehouse 5
Eng. Anas M.Mouaddeb
What is Data Warehousing 6
The process of creating data warehouses to store a large amount of
data is named Data Warehousing. Data Warehousing helps to
improve the speed and efficiency of accessing different data sets
and makes it easier for company decision-makers to obtain insights
that will help the business and promoting marketing tactics that set
them aside from their competitors.
Eng. Anas M.Mouaddeb
What is Data Warehousing 7
We can say that it is a blend of technologies and components
which aids the strategic use of data and information. The main goal
of data warehousing is to create a hoarded wealth of historical
data that can be retrieved and analyzed to supply helpful insight
into the organization’s operations.
Eng. Anas M.Mouaddeb
Need of Data Warehousing 8
Data Warehousing is a progressively essential tool for business
intelligence. It allows organizations to make quality business
decisions. The data warehouse benefits by improving data
analytics, it also helps to gain considerable revenue and the
strength to compete more strategically in the market. By efficiently
providing systematic, contextual data to the business intelligence
tool of an organization, the data warehouses can find out more
practical business strategies.
Eng. Anas M.Mouaddeb
Need of Data Warehousing 9
Eng. Anas M.Mouaddeb
Need of Data Warehousing 10
1. Business User: Business users or customers need a data warehouse to
look at summarized data from the past. Since these people are coming
from a non-technical background also, the data may be represented
to them in an uncomplicated way.
2. Maintains consistency: Data warehouses are programmed in such a
way that they can be applied in a regular format to all collected data
from different sources, which makes it effortless for company decision-
makers to analyse and share data insights with their colleagues around
the globe. By standardizing the data, the risk of error in interpretation is
also reduced and improves overall accuracy.
Eng. Anas M.Mouaddeb
Need of Data Warehousing 11
3. Store historical data: Data Warehouses are also used to store historical
data that means, the time variable data from the past and this input can
be used for various purposes.
4. Make strategic decisions: Data warehouses contribute to making better
strategic decisions. Some business strategies may be depending upon the
data stored within the data warehouses.
5. High response time: Data warehouse has got to be prepared for somewhat
sudden masses and type of queries that demands a major degree of
flexibility and fast latency.
Eng. Anas M.Mouaddeb
Characteristics of Data warehouse 12
Subject Oriented: A data warehouse is often subject-oriented
because it delivers may be achieved on a particular theme which
means the data warehousing process is proposed to handle a
particular theme that is more defined. These themes are often sales,
distribution, selling. etc.
Eng. Anas M.Mouaddeb
Characteristics of Data warehouse 13
Time-Variant: When the data is maintained via totally different intervals of
time like weekly, monthly,
or annually, etc. It founds numerous time limits that are unit structured
between the big datasets and are command within the online transaction
method (OLTP). The time limits for the data warehouse are extended than
that of operational systems. The data resided within the data warehouse is
predetermined with a particular interval of time and delivers information
from the historical perspective. It contains parts of time directly or indirectly.
Eng. Anas M.Mouaddeb
Characteristics of Data warehouse 14
Non-volatile: The data residing in the data warehouse is permanent and
defined by its names. It additionally means that the data in the data warehouse
is cannot be erased or deleted or also when new data is inserted into it. In the
data warehouse, data is
read-only and can only be refreshed at a particular interval of time. Operations
such as delete, update and insert that is done in a software application over
data is lost in the data warehouse environment. There are only two types of
data operations that can be done in the data warehouse:
• Data Loading
• Data Access
Eng. Anas M.Mouaddeb
Characteristics of Data warehouse 15
Integrated: A data warehouse is created by integrating data from
numerous different sources such that from mainframe computers
and a relational database. Additionally, it should also have reliable
naming conventions, formats, and codes. Integration of data
warehouse benefits in the successful analysis of data. Dependability
in naming conventions, column scaling, encoding structure, etc.
needs to be confirmed. Integration of data warehouse handles
numerous subject-oriented warehouses.
Eng. Anas M.Mouaddeb
Architecture & Components of Data 16
Warehouse
The architecture of the data warehouse mainly consists of the
proper arrangement of its elements, to build an efficient data
warehouse with software and hardware components. The elements
and components may vary based on the requirement of
organizations. All of these depend on the organization’s
circumstances.
Eng. Anas M.Mouaddeb
Architecture & Components of Data 17
Warehouse
Eng. Anas M.Mouaddeb
Data Warehousing life Cycle 18
As we know the data warehouse is made by combining data from
multiple diverse sources and the tools that support analytical
reporting, structured and unstructured queries, and decision making
for the organization. We need to follow the step by step approach
for building and successfully implementing the Data Warehouse:
Eng. Anas M.Mouaddeb
Data Warehousing life Cycle 19
Eng. Anas M.Mouaddeb
How does Data Warehouse work? 20
A Data Warehouse is like a central depository where data comes from
different data sources. In a data warehouse, the data flows from the
transactional system and relational databases. A data warehouse
timely pulls out the data from various apps and systems, after then, the
data goes through various processing and formatting and makes the
data in a format that matches the data already in the warehouse. This
processed data is stored in the data warehouses that ready for further
analysis for decision making. The data formatting and processing
depends upon the need of the organization
Eng. Anas M.Mouaddeb
How does Data Warehouse work? 21
The Data could be in one of the following formats:
1.Structured
2.Semi-structured
3.Unstructured data
The data is processed and transformed so that users and analysts can
access the processed data in the Data Warehouse through Business
Intelligence tools, SQL clients, and spreadsheets. A data warehouse merges
all information coming from various sources into one global and complete
database. By merging all of this information in one place, it becomes easier
for an organization to analyse its customers more comprehensively.
Eng. Anas M.Mouaddeb
Latest Tools and Technologies for 22
Data Warehousing
Data warehousing had improved the access to information,
reduced query-response time, and also allows businesses to get
deep insights from huge big data. Earlier, companies had to build
lots of infrastructure for data warehousing. But today the cloud
technology has remarkably reduced the cost and effort of data
warehousing for businesses.
Eng. Anas M.Mouaddeb
Latest Tools and Technologies for 23
Data Warehousing
The field of data warehousing is most emerging and there various cloud data
warehousing tools and technologies are developed for better decision making.
The cloud-based data warehousing tools are fast, highly scalable, and available
on a pay-per-use basis. Following are some data warehousing tools:
1.Amazon Redshift 2.Microsoft Azure 3.Google Big Query
4 . Snowflake 5.Micro Focus Vertica 6.Teradata 7.Amazon DynamoDB
8.PostgreSQL 9.Amazon RD 10.Amazon S3
All these are the top 10 Data Warehousing Tools.
Eng. Anas M.Mouaddeb