0 ratings 0% found this document useful (0 votes) 22 views 17 pages Data Warehousing and Data Mining-1
The document is an examination booklet for a B.C.A. (Fourth Semester) course on Data Warehousing and Data Mining, containing instructions for candidates and a series of questions related to the subject. It covers various topics such as data warehouse design, data mining techniques, and data analysis processes. The maximum marks for the examination are 75, and candidates are instructed on how to handle the question booklet and answer sheets.
AI-enhanced title and description
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content,
claim it here .
Available Formats
Download as PDF or read online on Scribd
Go to previous items Go to next items
Save data warehousing and data mining-1 For Later 1097382
Serial No. of Booklet :
Code No. : 1297
B.C. A. (Fourth Semester) Examination, 2023-24
‘AFFIX PRESCRIBED
RUBBER STAMP
Roll No.
Paper Third
DATA WAREHOUSING AND DATA MINING
Course Code—BCA-403T (Major)
In Figures (iat #) : QIAlW
ol2h [8]3a [s [to fofafo
In Words (ret #) :
ignature of Invigilator
Max. Marks : 75
wa fede 3 seme
Important Instructions : erage ‘Ae:
1 ‘The candidate will write his/her Roll Number 1 seme sm m aaa set tw
only at the places provided for, i. ¢., on the fora oh ga fee RI oa &, aly wea
cover page and on the OMR answer sheet at giro 3 gor ye ca am BS od
the end and nowhere else. alestas ce ett a
gt feretar
2. Immediately on receipt of the question
booklet, the candidate should check up,the > 4 SPiver Pet & once at oe wx
booklet and ensure that it contains all'the ghar ox dar ate yer Fy
pages and that no question is rising: If he wet ake Os wear mw at ad Bak
candidate finds any discrepancy ‘in the rea
question booklet, he’she should report the eo nea wer gfirar Bra & 10
invigilator within 10 minutes of the issue of 4 oa tas o afta
this booklet and a fresh question booklet oar ae ae fia gt & wa wear
+ without any discrepancy is obtained. Wa a da ae)
No second question booklet shall be given to a
candidate under any circumstances after 10
minutes. The candidate should be careful in
handling the question booklet and in filling the
OMR answer sheet given separately with this
booklet,
(Contd. on the last page / amex fer sift y. oR)1.
Ae
What is a Data Warehouse primarily
designed for?
(A) Real-time transaction processing
@© Historical data analysis and
“reporting
(C) Online gaming =
(D) Social media interaction
What are the components of a Data
Warehouse ?.
(A) Web servers, databases and
applications
(B) Input devices, processing units
and output devices a,
(C) Tables, queries and forms
@ data sources, ETL processes
ae
What is the process of building a
Data Warehouse 2 2
(A) Data’Mining
(B) Data Analysis
@ = I00010
(D) Data Visualization
(3)
What is thé difference between a
P Database System and a Data
Warehouse ? °
@® ‘* sotabase is used to store and
(B) A database is used for data
data
warehouse is used to store and
analytics, while a
capture data.
(C) A database is used for data
while data
warehouses are designed for
analytics,
transactional data storage.
(D) None of the above
What is a multi-dimensional data
model in Data Warehousing ?
(A) A model that stores data in a
single dimension
(B) A model that stores data in two
dimensions
(D) A model that stores data in a
non-relational format
129716.
7.
8.
What are data cubes, stars,
snowflakes and fact constellations in
Data Warehousing ?
(B) Types of data storage devices
(C) Data encryption techniques
(D) Data visualization tools
What is the first step in the Data
Warehouse process and technology ?
(A) Data Analysis
(C) Data Mining
(D) Data Visualization
What ‘is the strategy behind Data
Warehousing ?
(A) Storing data in‘a flat file format
(B) Deleting historical data
(© Performing real-time data
analysis
a
_ singe repository
(4)
10.
6
What are ‘the management anq
support processes involved in Data
Warehousing ?
(A) Hardware maintenanegg,
(B) Data backup and recovery
WH Network security —
(D) Web development
What is involved in the planning
and implementation of a Data
Warehouse ?
(A) Setting up social media
accounts
(B) Creating data visualizations
@ designing data models and
| schemas
(©) Building mobile appligations
is a subject-oriented
integrated, time-variant, nonvolatile
collection of data in support of
management decisions.
(A) Data Mining
PHP via Warehousing |
(C) Web Mining &
(D) Text Mining
12971 @12. What is the prithary purpose of a
Data Warehouse ?
(rem
(B) Real-time transaction
processing
(C) Online gaming
(D) Social média interaction
13. The data is stored, retrieved and
. ~
updated in 5
(A) OLAP
(C) SMTP
(D) FTP
14. What is the process of selecting only
relevant data from a larger dataset
called :
My
(A) Sampling
Pricing
(J Data reduction
(D) Data integration
s (8)
d-
15. Which of the following is not a
commonly - used data warehouse
architecture ? “\
(A) Inmon
(B) Kimball
(©) Snowflake
—
16. Which of the following is not a
>
common data mining application?
(&) Fraud detection
(B) Customer segmentation
- ©) Sales forecasting
WP Woventory management
17. The star schema is composed of
fact table.
Poe
() two.
(C) three
(D) four
1297/018. The modern CASE tools belong to
category.
(B) development
(C) coding
(D) delivery
19. Data scrubbing is :
(A) a process to reject data from
the data warehouse. and, to
create the necessary indexes.
(B) a process to load the data in the
data warehouse and to create
the necessary indexes.
(C) aprocess to upgrade the quality
_ of data after it is moved into a
data warehouse.
20. What is the (primary goal of data
mining ?
(A) Real-time data processing
(B) Storing historical “data for
reporting
(C) Conducting online transactions
ON
21, What is the main motivation behing
data mining ?
(A) Generating random data
> (C) Data visualization
(D) Network seciiity
22. What is data pre-processing in the
context of data mining ?
(B) Encrypting data for security
purposes :
(C) ‘Storing data in a database
(D) Analyzing data pattems
23. How is noisy data typically
addressed in data cleaning ?
AB v5 osing clustering techniques.
(B) Through human inspection
“ie
(C) By ignoring it
(D) None of the above
4297/0
7a/a does data reduction involve in
. data mining ?
(A) Increasing the volume of data
for analysis
(C) Randomly sampling data points
(D), Ignoring data outliers
25. at technique is: used for data
cube aggregation 2 :
(A) Clustering ,
(B) Regression analysis
(C) Dimensionality reduction
—-—
26. What is dimensionality reduction in
data mining ?
(A) Increasing the number of
dimensions in a dataset
MPrecieasing the numberof |
(C) Storing data in a hierarchical
structure
(D) Grouping similar data points
together
(7)
be
JI Wat is data compression in data -
mining ?
MBPReducing the size of the dataset
(B) Increasing the size of the
- dataset for better analysis
(C) Encrypting data for -sectirity
purposes
(D) ‘Storing data in a compressed
format
28, Which technique is used for
numerosity reduction ?
0%) Clustering
(B) ‘Regression analysis
(D) Random sampling
29, What - is concept hierarchy
+ generation in data mining ?
=
(B) Removing irrelevant data from
the dataset
(©) Applying statistical techniques
to analyze data patterns
(D) Predicting future trends based
on historical data
1297/1. 7
30. What is one form of data cleaning 33, To represent any n-Dimension data
32.
technique used for handling we need a series of
inconsistent data ? Seg Dimension cubes.
(9 Clustering Foy
ae oa
(B) Regression analysis (@) n
APP Computer inspection — Oo
(@) Random sampling () n+2
What is the main objective of data as Which technique is commonly used
mining ? to handle missing values in data
(A) Real-time data processing cleaning’?
AF =2:1ng: meaningtal pases) Cusine
(B), ‘Regression tay
a, Poin
(C) Storing historical data for
reporting (D) Computer inspection
(D) Conducting online transactions 35. What is data integration in data
mining ?
Fo ay
What drives the motivation behind - :
data mining ?
"sources into a'single repository
ee (B), Separating data into different
(B) Data visualization categories
(C)_ Network security (C) Encrypting data for security
BB *:sivtve modeling, (D) Deleting irrelevant data
(8) 12971 @36. ‘The
selection on one dimension of the
operation performs: a
given cube, resulting in a subcube.
(A) Pivot ;
PRI
(C) Roll-up
(D) Drill-down
37. What is classification in the context
of data mining ?
PHMPoring cata into different
"categories
(B) Combining multiple datasets
into’a single repository
(C) Storing -historical data for
analysis
(D) Randomly sampling dat3*Paints
%8,. What is data generalization ?
(A) Adding noise to data for
privacy protection
(B) Removing irrelevant attributes
from the dataset
(©) Analyzing the
significance of data
statistical
39.
oe
What is analytical . characterization
in data mining ?
“(A) Sorting data into different
40.
oN
Se
categories
(B) Removing duplicates ‘from the
dataset .
(D) Adding rioise to data for
privacy protection
What does analysis of attribute
relevance involve ? :
(A) Sorting data into difffent
categories
(B) Removing outliers from the
dataset
(C) Generating association rules
between attributes
1297/@41. Which type of algorithms use 44.
statistical measures in large
databases for classification ?
(B) Distance-based algorithms
(S) Decision tree-based algorithms
() Chustering algorithms. -
42. What do distance-based algorithms
45.
ely on for classification ?
(A) Statistical measures
(B) Decision trees
(C) Association rules
Moray ok sisance
measures
43, Which algorithm is a decision tree-
based algorithm commonly used for
classification ?
(#) K-means
(C) CURE
(D) DBSCAN
s (10)
46.
What is clustering in the Context ot
data mining ?
(A) Sorting data into differen
categories
rina te ep,
(©) Storing historical data for
analysis :
(D) Randomly sampling data points
What are similarity and distance
measures used for in clustering ?
(A) Analyzing the distribution of
data values
(B) Sorting data into different
categories
APPreriving the sinitarty
between data points
() Adding noise to data for
‘privacy protection
Which algorithm: is an, example of a
hierarchical coaiebagetien ?
(A) K-means
(B) DBSCAN
(©) optics e
12971- wo
47. Which clustering method is density: 50. What is association rules mining ?
ge
T based? (A) Sorting data into different
eerscan categories
(B) Hierarchical clustering PPivcig interesting
(C) Partitional clustering _ relationships between variables
©) sTING + in large datasets .
48. What is the goal. of grid-based (©) Analyzing the distribution of
a clustering methiods ? oe lees
PM rcuring data points based on (D) Randomly sampling data points
their density S1.. What do large item sets represent in
(B) Separating data into different association rules mining ? Ny
categories : (A) Groups of similar data points
(C) Analyzing the statistical " (B) Outliers in the data
significance of data (C) Missing values
(D) Summarizing data at a higKer GH eg0201 pattems in the dataset
level of abstraction 52. Which algorithm is. commonly
49. What is the model-based method for” used. for generating association
(iS coetting bed on? miles?
(A) Similarity measures (Af K-means
(B) Decision trees Pris
PH s2iistical approach ©) CURE
() Distance measures Ne, (D) DBSCAN
8 . (1) 1297/653.
What is the objective of parallel and
distributed algorithms in association
rules mining ?
(A) Analyzing ne si tribution of
data values
(B) Sorting data into diffSient
categories
speeding up’ the mining
Process for large datasets
(D) Randomly sampling data points
What approach does the neural
network approach take — in
‘association rules mining ?
So Finding pattems in the data
based on neuron activations
55.
(B) ‘Analyzing the distribution of
data values
(C) Sorting data into different
categories
(D) Randomly sampling dati’Paiges
‘What is the primary purpose of data
visualization in data warehousing ?
ou data in a visual
format for easy understanding
(B) Generating random data
(C) Summarizing data at 2 bigher
level of abstraction
(D) Analyzing the
significance of data
statistical
(12)
56.
37.
58.
y >
What does ‘aggregation involye in
data visualization ?
(A) Sorting data into » Airey
categories
ummarizing data at a higher
level of abstraction
(C) Removing duplicates from the
dataset
(D) Adding noise to data for
privacy protection
What type of ‘information “Shes
historical information provide in
data visualization?
(A) Real-time data processing
(B) Current trends and patterns in
the data
ast trends and patterns in the
data
(D) Future predictions based on the
data .
What function does OLAP serve in
data warehousing ?
(A) Online Transaction Processing
(B) Online Data Mining
(C) Online Data Backup
Online Analytical Processing
1297/0
|59.
60.
61.
Which type of OLAP server stores
data in a relational database ?
(B) HOLAP .
(©) DOLAP
(D) ROLAP™
What does the acronym MOLAP
stand for ?
(A) Massive
Processing
Oriline’ Analytical
(© Mixed, Online Analytical
"Processing
() Mobile Online Analytical
Processing <
=
‘What is the purpose of a data mining
interface ?
(A) Analyzing the distribution of
data values :
(B) Sorting data into ~ different
categories
(C) Adding noise to data fer
privacy protection
—_—
(13)
62. What aspect of data warehousing
does security address ?
(B) Sorting data into different
categories
(C) Presenting data in a visual
format
{D) Adding noise to data for
privacy protection
What is the purpose of backup and
recovery in data warehousing ?
(A). Analyzing the distribution of
data values we
“{B) Storing data in a visual Format
(D) Adding noise to data for
privacy protection
64. What does tuning data warehouse
involve ?
®). Sorting data into different’
categories
(C) Presenting data in a visual
format
(D) Adding noise to data for
privacy protection
1297/0)
{
0
65. What are some types of 68, What does temporal mining focys
warehousing applications? on? .
gare come ee (8) Sorting data into different
media interaction
* categories
(C) Data encryption and decryption ae
(D) Statistical analysis and
regression modeling
(C) Presenting data in a visual
format
66. What does web mining focus on? (D) Adding noise to data for
(A) Sorting data into different privacy protection
categories 69. What type of OLAP server stores
@) Presenting data in a visual data in a multidimensional cube
format. ~ ae
oo” at 7 ine (A) ROLAP
: sn -
(D) Adding noise to data for wre
privacy protection (C) DOLAP
— MOLAP
67. What is the goal of spatial mining 2 ®)
(A) Sorting data into different 70, What is the main purpose of data
—
categories warehousing ?
(A) Real-time data processing
(C) Presenting data in a visual wa
format
(D) Adding noise to data . for (C) Generating random data
privacy protection (D) Conducting online transactions
s (4) 1297/@71. Which component is essential for
building a data warehouse ?
(A) Operating system
(B) Web browser
“— (BMS) _ =
©) Programming language ey
72. What is the primary function of
r OLAP servers in data warehousing ? *
(B) Data encryption ice
(2) Data visualization
(D) Database backup
73. Which type of data processing
involves summarizing data at a
higher level of abstraction ?
(B) Segmentation ee
(©) Classification
(D) Regression
74. What is the purpose of data cleaning
in data warchousing ?
(A) Adding noise to the dataset
(B) Increasing the size of the
dataset
(©) Enhaneing data security
77., Data warehouse contains ...»
i (18)
75. What does OLAP allow users to do ?
(A) Conduct online transactions.
(C) Encrypt data for security
Purposes
(D) Create random datasets
76. Which of the following is not a
component of data warehousing ?
(A) OLAP server ia
(B) Database management system
(D) Data warehouse
data that is never found in
operational environment.
(A) normalized
(B) informational
© summary
APPvevormalized
78. Witich of the following -is “ii
essential process in which the
intelligent methods are dpplied to
extract data patterns ?
(A) Warehousing
(B) Text Mining
(C) Data Selection
PP PeaMining
1297 @What is KDD in data
ing ?
(B) Knowledge Data Defi
(C) Knowledge
Database
(D) Knowledge Data House
mn
Distovery
80. For what purpose, the analysis tools
a pre-compute the summaries of the
huge amount of data ?
(A) To obtain the queries respons vA
(C) For authentication
(D) For data access
81. The Data Warehouse is ..
(A) write only
- He) read only
(D) None‘of the above
82. Expansion for DSS in Data _
Warehouse is «11... sees
(A) Decision’Single System
(B) Data Storable System ~
Decision Support System
83. . describes the data
contained in the data warehouse.
(A) Relational data
(C) Operational data
(D) Informational data
Ra
: (16)
bo
84... .. prédicts future trends and
behaviours, allowing business
managers to make — proactive,
knowledge-driven decisions.
(A) Data warehouse
(B) Datamarts
(D) ,Metadata
85. is the heart of the
(B) Datamining database servers
(C) Datamart database servers
(D) Relational database servers
86. is the spécialized data
Ft warehouse database.
wey
# (B) Redbrick
(C) DBZ
(D) Informix
87. . defines the structure of
the data held in. operational
databases and used by operational
applicgtions.
%) Oe ta
(C) Data warehouse metadata
(D) Data mining metadata ~
“=
1297/@oy
88. .. is held in the catalog of
the warehouse database system.
(A) Application Igvel metadata
(B) Departmental level metadata
(D) Algorithmic level metadata
89... .. maps the —_ core
warehouse metadata’ to business
concepts, familiar and useful to
_endusers.
(B) End user level metadata
(©) Application level metadata
(D) Core level metadata
90. What is the difference between
OLTP and OLAP ? =
(A) OLTP is optimized “Stor
analytical processing, while
OLAP is optimized for
transaction processing.
~~ analytidal processing.
(©) OLTP and OLAP are théWpye
thing, :
() OLTP and OLAP are both
optimized for —_ transaction
processing.
(7)
91,, What is a ditnension table ?
KR
(A) A table that stores transactional
data.
(C) A table that stores metadata.
A table that stores details about
@) on
customers.
92. Which: of the following is not a
basic data mining task ?
(A): Prediction
(B) Classification
BHorovivs
©) Clustering
93. Which of the following is SK an
issue in data mining? |
(A) Overfiting
(B) Outliers
(C) High dimensionality
PP vores ora ~
1297/694. Which of the following is a subset
of data warehouse focused on a
specific functional area ?
(A) Association rules
(C) Flat files es
(D) Database
Be What do data warehouses support ?
7 (A) OLTP
(@f OLAP and OLTP
(D) Operational databases
96. Which one of the following
{statements is not correct about the
data cleaning ?
(A) It refers to the process of data
cleaning.
(B) It refers to the transformation
of wrong data into correct dala,
(C) It refers to correcting
97. The issues, like _ efficiency,
scalability of data mining algorithms
comes under ;
(A) Diverse data type issues
(B) Performance issues
(C) Mining methodology and user
interaction
S
98. Which one of the following can be
considered as the correct application
of the data mining ?
(A) Fraud detection _
(B) Corporate analysis and Risk
management
(C) Management
analysis
99. is a subject-oriented,
and = Market
integrated, time-variant, non-volatile
collection of data in support of
management decisions.
(B) Data mining
(C) Web mining
(@) Text mining
: “ig.
100. Record cannoi be updated in
(A) files :
(C) rdbms
(D) None of the above
12971