0% found this document useful (0 votes)

12 views5 pages

DWM Questions

The document is a question bank for a Data Warehousing and Mining course at Bharati Vidyapeeth College of Engineering, Navi Mumbai. It includes questions for unit tests, assignments, and module-wise topics covering various aspects of data warehousing, data mining, and related concepts. Key topics include schema construction, OLTP vs OLAP, ETL processes, data preprocessing, clustering, and market basket analysis.

Uploaded by

delta1504120229

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

12 views5 pages

DWM Questions

Uploaded by

delta1504120229

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 5

BHARATI VIDYAPEETH COLLEGE OF ENGINEERING,

NAVI MUMBAI

Department of Computer Engineering

CLASS -TE SEM-V

Subject : Data Warehousing and Mining
UT-I Question Bank
1. Costruction of Snowflake and Star Schema
Given a problem statement, construct both a Star Schema and a Snowflake Schema with
appropriate dimension tables and a fact table.

2. Explain the differences between OLTP and OLAP.

Explain the key differences between Online Transaction Processing (OLTP) and Online
Analytical Processing (OLAP) with suitable examples.

3. Explain various ETL operations.

Discuss the Extract, Transform, Load (ETL) process in detail, and explain its role in data
warehousing.

4. Explain the architecture of a data warehouse.

Describe the typical architecture of a data warehouse and its main components.

5. Discuss various issues and applications of Data Mining.

Elaborate on the major issues in data mining and highlight its real-world applications.

6. Short note on data preprocessing and phases of data cleaning.(Handling missing

data noisy data)
Describe the steps involved in data preprocessing and explain different phases of data cleaning.

7. Dimensionality reduction and data discretization.

Discuss the methods used for dimensionality reduction and explain the concept of data
discretization.

8. Numerical problem based on Decision Tree.

Solve a numerical problem using a given dataset to construct a Decision Tree.

9. Short note on classification and clustering accuracy.

Briefly explain how the accuracy of classification and clustering algorithms is measured and
compared.

10. Write a short note on data pruning.

Define data pruning and explain its role in decision tree construction and model optimization.
BHARATI VIDYAPEETH COLLEGE OF ENGINEERING,
NAVI MUMBAI

Department of Computer Engineering

CLASS -TE SEM-V

Subject : Data Warehousing and Mining

Assignment 01

1. What are the basic building blocks of data warehouse?

2. Compare OLTP and OLAP.
3. Differentiate between star schema and snowflake schema. Design star schema
4. Differentiate between top down and bottom-up approaches for building data warehouse.
5. Discuss the data visualization technique.
6. Explain issues in data mining.
7. Explain data pre-processing.
8. Explain the steps involved in data mining when viewed as a process of knowledge
discovery.
9. Explain decision tree-based classification approach with example. Discuss
metrics for evaluating classifier performance.

Assignment 02

1. What are the different types of data handled in cluster analysis? Give examples.
2. Explain agglomerative hierarchical clustering with an example dendrogram.
3. What is market basket analysis? Give one real-world application.
4. Explain the concept of an association rule with an example and define support and
confidence.
Describe the steps of the Apriori algorithm for frequent itemset generation.
5. Compare web content mining, web structure mining, and web usage mining in tabular
form.
BHARATI VIDYAPEETH COLLEGE OF ENGINEERING,
NAVI MUMBAI
Department of Computer Engineering

CLASS -TE SEM-V

Subject : Data Warehousing and Mining

Module wise Question Bank

Module-1
1. Define a data warehouse and explain its key characteristics.
2. Draw and explain a typical data warehouse architecture.
3. Differentiate between a data warehouse and a data mart with examples.
4. Compare E-R modeling and dimensional modeling in the context of data
warehousing.
5. What is an information package diagram? Explain its use in dimensional modeling.
6. Differentiate between star schema, snowflake schema, factless fact table, and fact
constellation schema with neat diagrams.
7. What is meant by updating dimension tables? Explain slowly changing dimensions
(SCD) with types.
8. List and briefly describe the major steps in the ETL process.
9. Compare OLTP and OLAP systems in terms of purpose, design, and usage.
10. Explain slice, dice, roll-up, drill-down, and pivot operations in OLAP with
examples.

Module-2
1. What are data mining task primitives? Give examples for each type.
2. Draw and explain the architecture of a data mining system.
3. List and explain the main steps of the KDD (Knowledge Discovery in Databases)
process.
4. What are the major issues in data mining? Explain any four in detail.
5. Give at least five applications of data mining in different domains.
6. List the types of attributes in data mining and give one example of each.
7. Explain statistical description of data using mean, median, mode, variance, and
histogram.
8. Describe at least three data visualization techniques used in data mining.
9. Explain the steps of data preprocessing, including cleaning, integration,
transformation, reduction, and discretization.
10. What is concept hierarchy generation? Explain its role in data discretization.
Module3:-

1. Define classification in data mining and give two real-life applications.

2. Explain the basic concepts of decision tree induction.
3. Draw and explain the working of a decision tree using a small dataset example.
4. Describe the Naïve Bayesian classification algorithm with an example.
5. What are accuracy and error measures in classification? Explain any two.
6. Explain the holdout method for evaluating the accuracy of a classifier.
7. What is random subsampling? How does it differ from the holdout method?
8. Describe the process of k-fold cross-validation with an example.
9. Explain the bootstrap method for classifier evaluation.
10. Compare cross-validation and bootstrap in terms of advantages and limitations

Module-4
1. What are the different types of data handled in cluster analysis? Give examples.
2. Explain the Euclidean distance and Manhattan distance measures used in clustering.
3. Describe the k-means clustering algorithm with steps.
4. What are the limitations of the k-means algorithm?
5. Explain the k-medoids clustering method and compare it with k-means.
6. Differentiate between partitional and hierarchical clustering methods.
7. Explain agglomerative hierarchical clustering with an example dendrogram.
8. Explain divisive hierarchical clustering and compare it with agglomerative.
9. What is the role of a proximity (similarity/dissimilarity) matrix in hierarchical
clustering?
10. Compare k-means, k-medoids, agglomerative, and divisive methods in a tabular
format

Module-5

1. What is market basket analysis? Give one real-world application.

2. Define frequent itemset and closed itemset with examples.
3. Explain the concept of an association rule with an example and define support and
confidence.
4. Describe the steps of the Apriori algorithm for frequent itemset generation.
5. How are association rules generated from frequent itemsets?
6. List and explain at least three techniques for improving the efficiency of Apriori.
7. Explain the concept of frequent pattern mining without candidate generation (FP-
Growth method).
8. What are multilevel association rules? Give an example.
9. Explain multidimensional association rules with a suitable example.
10. Compare Apriori and FP-Growth in terms of working and efficiency.
Module-6
1. Define web mining and list its three main categories.
2. What is web content mining? Give one real-life application.
3. Explain the role of web crawlers in content mining.
4. What is a harvest system in web mining?
5. Describe the concept of a virtual web view and its importance.
6. What is personalization in the context of web content mining? Give an example.
7. Explain web structure mining and the working of the PageRank algorithm.
8. Describe the CLEVER algorithm and how it differs from PageRank.
9. What is web usage mining? List any two techniques used for it.
10. Compare web content mining, web structure mining, and web usage mining in
tabular form.

Refer University question papers for numericals

Data Warehousing and Data Mining Important Question
No ratings yet
Data Warehousing and Data Mining Important Question
7 pages
DWM NOTES
No ratings yet
DWM NOTES
118 pages
Brocade Fabric OS v9.2.1 Release Notes
No ratings yet
Brocade Fabric OS v9.2.1 Release Notes
54 pages
Data Warehousing and Data Mining Unit - I Data Warehousing, Business Analysis and On-Line Analytical Processing (Olap) PART A (2 Marks)
No ratings yet
Data Warehousing and Data Mining Unit - I Data Warehousing, Business Analysis and On-Line Analytical Processing (Olap) PART A (2 Marks)
5 pages
R23!3!1 DWDM Final Syllabus On 21-06-2025
No ratings yet
R23!3!1 DWDM Final Syllabus On 21-06-2025
5 pages
Security Firms Directory
No ratings yet
Security Firms Directory
17 pages
PG - M.sc. - Computer Science - 34141 Data Mining and Ware Housing
No ratings yet
PG - M.sc. - Computer Science - 34141 Data Mining and Ware Housing
192 pages
Associate Cloud Engineer
No ratings yet
Associate Cloud Engineer
6 pages
Data Mining - GDi Techno Solutions
No ratings yet
Data Mining - GDi Techno Solutions
145 pages
DWDM Unitwise Qns
No ratings yet
DWDM Unitwise Qns
3 pages
DWM Questions
No ratings yet
DWM Questions
5 pages
DWM PYQs
No ratings yet
DWM PYQs
7 pages
Data Mining
No ratings yet
Data Mining
7 pages
QB Data Mining
No ratings yet
QB Data Mining
5 pages
Data Warehousing & Mining Course
No ratings yet
Data Warehousing & Mining Course
2 pages
PG DataMiningR Practicals
No ratings yet
PG DataMiningR Practicals
2 pages
DWDM QB
No ratings yet
DWDM QB
6 pages
SKP Engineering College: A Course Material On
No ratings yet
SKP Engineering College: A Course Material On
212 pages
DMBI-Viva Sample Questions
No ratings yet
DMBI-Viva Sample Questions
2 pages
DWDM Unitwise Questions
No ratings yet
DWDM Unitwise Questions
3 pages
QUESTION BANK FOR DM & W (3rd Sem) 2023-2024
No ratings yet
QUESTION BANK FOR DM & W (3rd Sem) 2023-2024
2 pages
Important Questions From All Units
No ratings yet
Important Questions From All Units
3 pages
Data Warehouse and Mining
No ratings yet
Data Warehouse and Mining
4 pages
Data Preprocessing in Data Mining
No ratings yet
Data Preprocessing in Data Mining
105 pages
Data Mining and Warehousing
No ratings yet
Data Mining and Warehousing
7 pages
CSE Data Warehousing Q&A Guide
No ratings yet
CSE Data Warehousing Q&A Guide
3 pages
Software
No ratings yet
Software
93 pages
Data Mining & Warehouse Q&A
No ratings yet
Data Mining & Warehouse Q&A
4 pages
Data Warehousing & Mining Course
No ratings yet
Data Warehousing & Mining Course
2 pages
DWDM
No ratings yet
DWDM
2 pages
DWDM Questions Bank (BCS058)
No ratings yet
DWDM Questions Bank (BCS058)
9 pages
Question Bank SY
No ratings yet
Question Bank SY
3 pages
DMBI All Pyqs
No ratings yet
DMBI All Pyqs
4 pages
DataWarehousing DataMining Question Bank
No ratings yet
DataWarehousing DataMining Question Bank
3 pages
SEM 5 - Comps, IOT, CYBER, CS - Data Warehousing & Mining - 2024 MAY To 2022 DEC PYQ - Aeraxia - in
No ratings yet
SEM 5 - Comps, IOT, CYBER, CS - Data Warehousing & Mining - 2024 MAY To 2022 DEC PYQ - Aeraxia - in
10 pages
DWM QB Cyse
No ratings yet
DWM QB Cyse
8 pages
Question Bank DWM 2022-23 Vii Semester B.E. Cse
No ratings yet
Question Bank DWM 2022-23 Vii Semester B.E. Cse
3 pages
Data Warehousing and Data Mining
No ratings yet
Data Warehousing and Data Mining
2 pages
Data Mining & BI for Engineering Students
No ratings yet
Data Mining & BI for Engineering Students
3 pages
Book Exercises NayelliAnswers
No ratings yet
Book Exercises NayelliAnswers
3 pages
Mining 2720209
No ratings yet
Mining 2720209
3 pages
DM Important Questions
100% (1)
DM Important Questions
2 pages
Lesson Plan: Unit Topic Books For Reference No. of Hours Required Teaching Methodology
No ratings yet
Lesson Plan: Unit Topic Books For Reference No. of Hours Required Teaching Methodology
6 pages
Data Mining & Database Systems Guide
No ratings yet
Data Mining & Database Systems Guide
6 pages
Consolidated Cse Question Bank1
No ratings yet
Consolidated Cse Question Bank1
170 pages
CSE602 - Data Warehousing & Data Mining
No ratings yet
CSE602 - Data Warehousing & Data Mining
6 pages
Data Warehousing and Data Minining Answer Key - Anna University (16M & 2M With Answers)
No ratings yet
Data Warehousing and Data Minining Answer Key - Anna University (16M & 2M With Answers)
139 pages
Gandhinagar Institute of Technology: Computer Engineer Ing Department Question Bank
No ratings yet
Gandhinagar Institute of Technology: Computer Engineer Ing Department Question Bank
3 pages
DMDW Lab Oral Question Bank
No ratings yet
DMDW Lab Oral Question Bank
4 pages
CEUC502 - DMBI - Question - Bank
No ratings yet
CEUC502 - DMBI - Question - Bank
12 pages
Matsonic MS8127C
No ratings yet
Matsonic MS8127C
80 pages
Data Warehousing & Mining Guide
No ratings yet
Data Warehousing & Mining Guide
3 pages
Data Mining Syllabus and Question
No ratings yet
Data Mining Syllabus and Question
6 pages
Multimedia Systems Overview
No ratings yet
Multimedia Systems Overview
2 pages
Data Mining & Warehousing Guide
No ratings yet
Data Mining & Warehousing Guide
17 pages
Data Mining Unitwise Imp Questions
No ratings yet
Data Mining Unitwise Imp Questions
3 pages
A Linguagem Da Paz Num Mundo de Conflitos
No ratings yet
A Linguagem Da Paz Num Mundo de Conflitos
181 pages
Internet and Web Technologies - Notes X2023 April-1
No ratings yet
Internet and Web Technologies - Notes X2023 April-1
90 pages
E Cat Jobs
No ratings yet
E Cat Jobs
3 pages
Data Warehousing Exam Prep
No ratings yet
Data Warehousing Exam Prep
59 pages
CS-DM Module - 1
No ratings yet
CS-DM Module - 1
27 pages
Data Warehouse Scheme and Syllabus
No ratings yet
Data Warehouse Scheme and Syllabus
2 pages
ARIS PPM System Architecture
100% (1)
ARIS PPM System Architecture
84 pages
Data Warehousing and Data Mining - Handbook
0% (2)
Data Warehousing and Data Mining - Handbook
27 pages
Fiori Front Server 4.0 Implementation Guide
No ratings yet
Fiori Front Server 4.0 Implementation Guide
20 pages
IECEx Certificate for Honeywell Devices
No ratings yet
IECEx Certificate for Honeywell Devices
17 pages
Mathematics
No ratings yet
Mathematics
2 pages
Cs 2032 Data Warehousing and Data Mining Question Bank by Gopi
No ratings yet
Cs 2032 Data Warehousing and Data Mining Question Bank by Gopi
6 pages
Biodata Etrio Widodo
No ratings yet
Biodata Etrio Widodo
3 pages
Indoor Video Intercom for Installers
No ratings yet
Indoor Video Intercom for Installers
3 pages
Osp-P300/P300A Osp-P200/P200A: Okuma Mtconnect Adapter Software
No ratings yet
Osp-P300/P300A Osp-P200/P200A: Okuma Mtconnect Adapter Software
21 pages
SRS Exp3
No ratings yet
SRS Exp3
23 pages
SE Assign Front Page
No ratings yet
SE Assign Front Page
1 page
Software Engineer Resume
No ratings yet
Software Engineer Resume
3 pages
CN Coverpage
No ratings yet
CN Coverpage
1 page
Chatbot (Api)
No ratings yet
Chatbot (Api)
17 pages
ChatBot (Groq)
No ratings yet
ChatBot (Groq)
10 pages
Full Stack Java (Springboot)
No ratings yet
Full Stack Java (Springboot)
6 pages
Module 4
No ratings yet
Module 4
15 pages
5.b SWP
No ratings yet
5.b SWP
3 pages
Exp2 DWM
No ratings yet
Exp2 DWM
7 pages
Exp4 CN
No ratings yet
Exp4 CN
4 pages
Informatica: The Powercenter/Powermart
No ratings yet
Informatica: The Powercenter/Powermart
3 pages
Solarmax Manual 2015
No ratings yet
Solarmax Manual 2015
24 pages
Software Reuse for Developers
No ratings yet
Software Reuse for Developers
9 pages
Minecraft Launcher Debug Log
No ratings yet
Minecraft Launcher Debug Log
14 pages
Emo Aesthetic Computer Wallpapers
No ratings yet
Emo Aesthetic Computer Wallpapers
1 page
5.1 Using Network Configuration Tools: Unit V:Networking and TCP/IP
No ratings yet
5.1 Using Network Configuration Tools: Unit V:Networking and TCP/IP
20 pages
Invoice - Bitrefill
No ratings yet
Invoice - Bitrefill
2 pages
Hospital IT System Overview
No ratings yet
Hospital IT System Overview
31 pages
SDR and NFV Extensions in The Ns-3 LTE Module For 5G Rapid Prototyping
No ratings yet
SDR and NFV Extensions in The Ns-3 LTE Module For 5G Rapid Prototyping
6 pages
Supply Chain Flowchart
No ratings yet
Supply Chain Flowchart
8 pages
Silverland Oil Piracy Script Role Play
No ratings yet
Silverland Oil Piracy Script Role Play
8 pages
Salinan Dari Copy of Genshin Impact Materials Tracker (By Oble)
No ratings yet
Salinan Dari Copy of Genshin Impact Materials Tracker (By Oble)
242 pages
Latitude 5490: Owner's Manual
No ratings yet
Latitude 5490: Owner's Manual
89 pages

DWM Questions

Uploaded by

DWM Questions

Uploaded by

BHARATI VIDYAPEETH COLLEGE OF ENGINEERING,

Department of Computer Engineering

CLASS -TE SEM-V

2. Explain the differences between OLTP and OLAP.

3. Explain various ETL operations.

4. Explain the architecture of a data warehouse.

5. Discuss various issues and applications of Data Mining.

6. Short note on data preprocessing and phases of data cleaning.(Handling missing

7. Dimensionality reduction and data discretization.

8. Numerical problem based on Decision Tree.

9. Short note on classification and clustering accuracy.

10. Write a short note on data pruning.

Department of Computer Engineering

CLASS -TE SEM-V

Subject : Data Warehousing and Mining

1. What are the basic building blocks of data warehouse?

CLASS -TE SEM-V

Subject : Data Warehousing and Mining

1. Define classification in data mining and give two real-life applications.

1. What is market basket analysis? Give one real-world application.

Refer University question papers for numericals

You might also like