Thanks to visit codestin.com
Credit goes to www.scribd.com

0% found this document useful (0 votes)
30 views23 pages

Data Warehousing Essentials Guide

1. A data warehouse is a collection of tools and technologies used to combine data from multiple sources to support analysis and decision making. 2. Data is extracted from sources like databases and software platforms, cleaned, transformed, and loaded into the data warehouse for analytics and reporting. 3. A data warehouse centralizes historical data to enable business intelligence activities and queries to generate insights for strategic decisions.

Uploaded by

Anes Elmouaddeb
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
30 views23 pages

Data Warehousing Essentials Guide

1. A data warehouse is a collection of tools and technologies used to combine data from multiple sources to support analysis and decision making. 2. Data is extracted from sources like databases and software platforms, cleaned, transformed, and loaded into the data warehouse for analytics and reporting. 3. A data warehouse centralizes historical data to enable business intelligence activities and queries to generate insights for strategic decisions.

Uploaded by

Anes Elmouaddeb
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 23

1

Brief introduction to
Datawarehouse

Eng. Anas M.Mouaddeb


Introduction 2

 A Data Warehouse is Built by combining data from multiple diverse

sources that support analytical reporting, structured and

unstructured queries, and decision making for the organization

 Data Warehousing is a step-by-step approach for constructing and

using a Data Warehouse. Many data scientists get their data in raw

formats from various sources of data and information.

Eng. Anas M.Mouaddeb


Introduction 3

 But, for many data scientists also as business decision-makers,

particularly in big enterprises, the main sources of data and

information are corporate data warehouses. A data warehouse

holds data from multiple sources, including internal databases and

Software (SaaS) platforms. After the data is loaded, it often cleaned,

transformed, and checked for quality before it is used for analytics

reporting, data science, machine learning, or anything.


What is Data Warehouse 4

 A Data Warehouse is a collection of software tools that facilitates


analysis of a large set of business data used to help an organization
make decisions. A large amount of data in data warehouses comes
from numerous sources such that internal applications like marketing,
sales, and finance; customer-facing apps; and external partner systems,
among others. It is a centralized data repository for analysts that can be
queried whenever required for business benefits. A data warehouse is
mainly a data management system that’s designed to enable and
support business intelligence (BI) activities, particularly analytics. Data
warehouses are alleged to perform queries, cleaning, manipulating,
transforming and analysing the data and they also contain large
amounts of historical data.

Eng. Anas M.Mouaddeb


What is Data Warehouse 5

Eng. Anas M.Mouaddeb


What is Data Warehousing 6

 The process of creating data warehouses to store a large amount of

data is named Data Warehousing. Data Warehousing helps to

improve the speed and efficiency of accessing different data sets

and makes it easier for company decision-makers to obtain insights

that will help the business and promoting marketing tactics that set

them aside from their competitors.

Eng. Anas M.Mouaddeb


What is Data Warehousing 7

 We can say that it is a blend of technologies and components

which aids the strategic use of data and information. The main goal

of data warehousing is to create a hoarded wealth of historical

data that can be retrieved and analyzed to supply helpful insight

into the organization’s operations.

Eng. Anas M.Mouaddeb


Need of Data Warehousing 8

 Data Warehousing is a progressively essential tool for business

intelligence. It allows organizations to make quality business

decisions. The data warehouse benefits by improving data

analytics, it also helps to gain considerable revenue and the

strength to compete more strategically in the market. By efficiently

providing systematic, contextual data to the business intelligence

tool of an organization, the data warehouses can find out more

practical business strategies.

Eng. Anas M.Mouaddeb


Need of Data Warehousing 9

Eng. Anas M.Mouaddeb


Need of Data Warehousing 10
1. Business User: Business users or customers need a data warehouse to
look at summarized data from the past. Since these people are coming
from a non-technical background also, the data may be represented
to them in an uncomplicated way.

2. Maintains consistency: Data warehouses are programmed in such a


way that they can be applied in a regular format to all collected data
from different sources, which makes it effortless for company decision-
makers to analyse and share data insights with their colleagues around
the globe. By standardizing the data, the risk of error in interpretation is
also reduced and improves overall accuracy.

Eng. Anas M.Mouaddeb


Need of Data Warehousing 11
3. Store historical data: Data Warehouses are also used to store historical
data that means, the time variable data from the past and this input can
be used for various purposes.

4. Make strategic decisions: Data warehouses contribute to making better


strategic decisions. Some business strategies may be depending upon the
data stored within the data warehouses.

5. High response time: Data warehouse has got to be prepared for somewhat
sudden masses and type of queries that demands a major degree of
flexibility and fast latency.

Eng. Anas M.Mouaddeb


Characteristics of Data warehouse 12

 Subject Oriented: A data warehouse is often subject-oriented

because it delivers may be achieved on a particular theme which

means the data warehousing process is proposed to handle a

particular theme that is more defined. These themes are often sales,

distribution, selling. etc.

Eng. Anas M.Mouaddeb


Characteristics of Data warehouse 13

 Time-Variant: When the data is maintained via totally different intervals of

time like weekly, monthly,

or annually, etc. It founds numerous time limits that are unit structured

between the big datasets and are command within the online transaction

method (OLTP). The time limits for the data warehouse are extended than

that of operational systems. The data resided within the data warehouse is

predetermined with a particular interval of time and delivers information

from the historical perspective. It contains parts of time directly or indirectly.

Eng. Anas M.Mouaddeb


Characteristics of Data warehouse 14

 Non-volatile: The data residing in the data warehouse is permanent and

defined by its names. It additionally means that the data in the data warehouse

is cannot be erased or deleted or also when new data is inserted into it. In the

data warehouse, data is

read-only and can only be refreshed at a particular interval of time. Operations

such as delete, update and insert that is done in a software application over

data is lost in the data warehouse environment. There are only two types of

data operations that can be done in the data warehouse:

• Data Loading

• Data Access
Eng. Anas M.Mouaddeb
Characteristics of Data warehouse 15

 Integrated: A data warehouse is created by integrating data from

numerous different sources such that from mainframe computers

and a relational database. Additionally, it should also have reliable

naming conventions, formats, and codes. Integration of data

warehouse benefits in the successful analysis of data. Dependability

in naming conventions, column scaling, encoding structure, etc.

needs to be confirmed. Integration of data warehouse handles

numerous subject-oriented warehouses.


Eng. Anas M.Mouaddeb
Architecture & Components of Data 16

Warehouse
 The architecture of the data warehouse mainly consists of the

proper arrangement of its elements, to build an efficient data

warehouse with software and hardware components. The elements

and components may vary based on the requirement of

organizations. All of these depend on the organization’s

circumstances.

Eng. Anas M.Mouaddeb


Architecture & Components of Data 17

Warehouse

Eng. Anas M.Mouaddeb


Data Warehousing life Cycle 18

 As we know the data warehouse is made by combining data from

multiple diverse sources and the tools that support analytical

reporting, structured and unstructured queries, and decision making

for the organization. We need to follow the step by step approach

for building and successfully implementing the Data Warehouse:

Eng. Anas M.Mouaddeb


Data Warehousing life Cycle 19

Eng. Anas M.Mouaddeb


How does Data Warehouse work? 20

 A Data Warehouse is like a central depository where data comes from

different data sources. In a data warehouse, the data flows from the

transactional system and relational databases. A data warehouse

timely pulls out the data from various apps and systems, after then, the

data goes through various processing and formatting and makes the

data in a format that matches the data already in the warehouse. This

processed data is stored in the data warehouses that ready for further

analysis for decision making. The data formatting and processing

depends upon the need of the organization


Eng. Anas M.Mouaddeb
How does Data Warehouse work? 21

 The Data could be in one of the following formats:

1.Structured

2.Semi-structured

3.Unstructured data

 The data is processed and transformed so that users and analysts can

access the processed data in the Data Warehouse through Business

Intelligence tools, SQL clients, and spreadsheets. A data warehouse merges

all information coming from various sources into one global and complete

database. By merging all of this information in one place, it becomes easier

for an organization to analyse its customers more comprehensively.

Eng. Anas M.Mouaddeb


Latest Tools and Technologies for 22

Data Warehousing
 Data warehousing had improved the access to information,

reduced query-response time, and also allows businesses to get

deep insights from huge big data. Earlier, companies had to build

lots of infrastructure for data warehousing. But today the cloud

technology has remarkably reduced the cost and effort of data

warehousing for businesses.

Eng. Anas M.Mouaddeb


Latest Tools and Technologies for 23

Data Warehousing
 The field of data warehousing is most emerging and there various cloud data

warehousing tools and technologies are developed for better decision making.

The cloud-based data warehousing tools are fast, highly scalable, and available

on a pay-per-use basis. Following are some data warehousing tools:

1.Amazon Redshift 2.Microsoft Azure 3.Google Big Query

4 . Snowflake 5.Micro Focus Vertica 6.Teradata 7.Amazon DynamoDB

8.PostgreSQL 9.Amazon RD 10.Amazon S3

 All these are the top 10 Data Warehousing Tools.

Eng. Anas M.Mouaddeb

You might also like