0 ratings0% found this document useful (0 votes) 45 views13 pagesData Mining - 015542
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content,
claim it here.
Available Formats
Download as PDF or read online on Scribd
r
ANAND ENGG. COLLEGE AGRA
R SATION
B.TECH EIGHTH SEMESTER EXAMINA TIC
DATA MINING AND WAREHOUSING
inne? 60 mii Total Marks :30
(i) Attempt any FIVE question
Note:
GiiJAM questions carry equal marks.
1.Define the data warchousing and its basie characteristics.
2.Describe operational data store and informational data store.
3.Deseribe development of clint server systems
4.Explain in short distributed memory architecture.
€ 5. What is the data architecture of data warehouse operation?
6.Write E.F.Codd’s 12 guidelines for OLAP
OR
Describe features of ORACLE system.
TWrite 10 mistakes for data warchousit
OR
Describe multiprocessor systems
managers to avoid-
ANAND ENGG COLLAG
COMPUTER SCIENCE FINAL YEAR
DATAMINING AND W AREHOUSING
SUB CODE - CS031
Attempt any five questions
QI. Is Data Warehouse a Database ? Explain.
Q2.What are Data Warehouse design issuses and decisions required ?
Q3.Describe various categories of tools used in data warehouse 2
Q4, Explain benefits of datawarehousing ?
Q5 . Describe importance of parallelism in Data Warehouse
Q6. Discuss DBMS schemas for Decision support ?
Q7. Describe about metadata repository 2£82000
Printed Pages—2 Bon
(Following Paper ID and Roll No. tobe filled in your Answer Book)
De R RET
B.Tech.
EIGHTH SEMESTER EXAMINATION, 2004-2005
DATA MINING AND WAREHOUSING
Time : 3 Hours Total Marks : 100
Note: (i) Attempt ALL questions.
(ii) All questions carry equal marks.
1. Attempt any four of the following : (5x4=20)
(a) What are the characteristics of the data in a data
warehouse ?
(b) What is difference between physical and a logical
data warehouse ?
(c) Explain Data warehouse life cycle ?
(d) What is data sourcing ? Explain.
(e) What are the uses of data warehouse ?
(9 What is the data architecture of data warehouse
operations ?
2. Attempt any two of the following : (10x2=20)
(a) Describe the structure of a data warehouse with the
help of a diagram.
(b) What are Data Warehouse design issues ?. Explain
with examples.
Discuss DBMS schemas for Decision support.CT cee
No. of Printed Pages—2 CS-031
B. TECH.
EIGHTH SEMESTER EXAMINATION, 2003-2004
DATA MINING AND WAREHOUSING
Total Marks : 100
Time : 3 Hours
Note = (1) Attempt ALL questions.
(2) All questions carry equal marks.
1. Attempt any FOUR parts of the following :—
Define the Data Warehousing and its basic
characteristics.
@) Briefly describe about the historical
development of client/server systems.
(©) Describe about the multi-processor systems.
- (@ Describe about the DBMS connectivity.
Discuss about the Reliability and Availability
of Relational Database systems.
Describe the features of Sybase systems.
@)
©
"
‘Attempt any FOUR parts of the following —
2
(@) Is Data ‘Warehouse a Database ? Explain.
(What are the design considerations for data
warehouse ?
oO Describe about the Metadata and state how
: it is useful.
(@ Explain the use of Data Partitioning.
(© Describe about the data cardinality in data
= warehousing.
Describe about the Metadata Repository:
Turn OverWT
[5]
No. of Printed Pages—2
Time : 3 Hours
EIG
Roll No,
B. TECH.
HTH SEMESTER EXAMINATION, 2003-2004
DATA MINING AND WAREHOUSING
Note: (1) Attempt ALL questions.
15
Re
(2) All questions carry equal marks.
Attempt any FOUR parts of the following :—
(a)
(b)
©
@)
@
ff)
Define the Data Warehousing and its basic
characteristics.
Briefly describe about the historical
development of client/server systems.
Describe about the multi-processor systems.
Deseribe about the DBMS connectivity.
Discuss about the Reliability and Availability
of Relational Database systems.
Describe the features of Sybase systems.
Attempt any FOUR parts of the following :—
@
(b)
©
@
©)
/
CS-031
Is Data Warehouse a Database ? Explain.
What are the design considerations for data
warehouse ?
Describe about the Metadata
it is useful.
Explain the use of Data Partitioning.
ry in data
and state how
Describe about the data cardinalit
warehousing.
Describe about the Metadata Repository.
Turn Over
CS-031
Total Marks : 100ANAND ENGINEERING COLLEGE AGRA
; B.TECH.
FIGHT SEMESTER PRE-UNIVERSITY TEST 2006-07
: DATAMINING & WAREHOUSING
Max Time : 2hrs Max Marks :60
PAPER ID-:1042 PAPER CODE: CS 031
4. Attempt any FOUR of the following: (5x4)
(a) Differentiate between OLTP and PLAP with examples.
(b) What is Pattern? Describe about visualizing of pattern.
(€) State 12 rules for evaluating OLAP products developed by E F.Codd.
(d) Describe about the Reporting and Managed Query tools
{e) What is Model? Describe about Selection and Acquisition
2. Attempt any TWO of the following (10x2)
(a) Discuss most commonly used techniques in Data Mining
(b)What is @ Neural Network? What are the advantages and applications of
Artificial Neural Networks?
{c)How decision trees are useful in Data mining? Explain.
3. Attempt any TWO of the following: (10x2)
(a) Write a note on Data Visualization and overall perspective
(0) What is Mutation? How the mutation is useful in Data Mining, Explain.
(©) Describe about the Data Visualization principles.RING COLLEGE AGRA
ser B.TECH.
EIGHT SEMESTER PRE-UNIVERSITY TE!
DATAMINING & WAREHOUSING
2006-07
‘Max Time : 2hrs Max Marks :60
PAPER ID-:1042 PAPER CODE: CS 031
1. Attempt any FOUR of the following (5x4)
(a) Differentiate between OLTP and PLAP with examples
(b) What is Pattern? Describe about visualizing of pattern.
(c) State 12 rules for evaluating OLAP products developed by EF.Codd.
(a) Describe about the Reporting and Managed Query tools.
(e) What is Model? Describe about Selection and Acquisition
2. Attempt any TWO of the following (10x2)
(a) Discuss most commonly used techniques in Data Mining
(b)What is a Neural Network? What are the advantages and applications of
Artificial Neural Networks?
(How decision trees are useful in Data mining? Explain
3. Attempt any TWO of the following i (10x2)
(a) Write a note on Data Visualization and overall perspective.
(b) What is Mutation? How the mutation is useful in Data Mining. Explain.
(c) Describe about the Data Visualization principles.following —
JUR parts of the
‘Attempt any FO
2) Describe about the Reporting and Managed
Query tools. .
(What are the basic guidelines for OLAP ?
(What is Pattern ? Describe about visualising
of pattern. é
What is the Model ? Describe about Selection
and Acquisition.
() What is Missing Data ? How is this problem
solved ?
(f) What is Hypothesis Testing ?
4. Attempt any TWO parts of the following :—
(a) Describe about the Data Mining Effectiveness.
(0) “How are decision trees useful in Data
Mining ? Explain.
(9 What is a Neural Netwrok ? Explain.
(@) Describe about the Back Propagation in
Neural Networks.
5. Attempt any TWO parts of the following :—
(@) Describe about the hierarchical and non-
hierarchical clustering.
(6) What is Mutation ? Explain.
(©) Describe about the Data Visualisation
principles.
(@ What do you understand by Data Quality ?
Explain.
@
»(10x2=20)
19 of the following :
picsiietlets LTP and OLAP with
3
(a) Differentiate between O
examples. ,
(b) Discuss various Report generating and Query tools.
() State 12 rules for evaluating OLAP products
developed by E.F. Codd.
4. Attempt any two of the following + (10x2=20)
(a) Discuss most commonly used techniques in Data
Mining.
Discuss the advantages and disadvantages of Data
©) ge
Mining.
() Give examples of main tasks that are solved by a data
P
mining system. .
5. Attempt any two of the following : (10x2=20)
(@) Write a note on Data visualization and overall
perspective,
(b) Big Data-Better Results. Justify the statement.
(©) Write short note on the following :
() Decision tree.
(i) Genetic Algorithm.
-000-nous sue exuiaeuvus ve use1ul H-tnen rules rom Gate
~ Pesca'on statistical significance.
ieaniteation®: | interpretation of complex
© Data visualization : The visua
relationships in multidimensional data. Graphies tools are use¢
to illustrate data relationships.
10.5.2. What Technological Infrastructure is Required?
Today, data mining applications are available on all size systems for
mainframe, client/server, and PC platforms. System prices range from
several thousand dollars for the smallest applications up to $1 million a
terabyte for the largest. Enterprise-wide applications generally range in
size from 10 gigabytes to over 11 terabytes. NCR has the capacity to
deliver applications exceeding 100 terabytes. There are two critical
technological drivers :
© Size of the database : The more data being processed and
maintained, the more powerful the system required.
© Query complexity : The more complex the queries and the
greater the number of queries being processed, the more powerful
the system required.
Relational database storage and management technology is adequate
for many data mining applications less than 50 gigabytes. However, this
infrastructure needs to be significantly enhanced to support larger
applications. Some vendors have added extensive indexing capabilities
to improve query performance. Others use new hardware architectures
such as Massively Parallel Processors (MPP) to achieve order-of-
magnitude improvements in query time. For example, MPP systems
NCR link hundreds of high-speed Pentium processors to a
performance levels exceeding those of the largest supercomputer
SUMMARY
This chapter introduces data mining, data, informati
and introduces data mining methods like neural netwo1
clustering, association rules.
1. What do you mean by data mining ?
2. Differentiate between data warehouse and
3. Write short notes on :
(a) Data (b) Information
What are relationships needed between
5. Write the five major elements of d
6. What are the different methods
explain in brief. 3Introduction to Data Warehousing C
ahction to Data Warehousing
This chapter introduces about Data warehousing, its history and
compares it with OLTP. It describes about the components of data
warehouse, advantages of data warehouse, gives the conceptual view of
a data warehouse and gives introduction to 3 tier architecture.
EXERC
What is data warehouse ? Give its architecture ?
What are the components of data warehouse ?
How data is acquired or collected in a data warehouse ?
Give the conceptual view of a data warehouse.
. Explain METADATA.
What are the advantages and disadvantages of a data warehouse ?
Differentiate between OLAP and OLTP.
What are the disadvantages of query driven approach ?@
SUMMARY
This chapter discusses the Data warehousing concepts, need for
‘veloping data warehousing. Data warehousing in decision making,
OLTP vs OLAP, Data marts etc.
EXERCISE
1, What is data warehousing ?
2. Explain the architecture of Dataware housing ?
3. What are various components of a dataware house ?
4. Why we need a separate data warehouse ?
5. What do you mean by subject-oriented, integrated, time-variant and
non-volatile collection of data in data warehousing.
6. What are the advantages of data warehouse.
7. Explain metadata and its importance.
What should be the features of administration and management tools.
9. Differentiate between OLTP and data warehouse ?
10. Differentiate between DSS and OLAP. :
11. What are the different data processing models.fen
ees)
This chapter explains about non-client server and client server
architecture. It describes 2-tier, 3-tier and n-tier in detail. It explains
the need of tier architecture in data warehousing. It also explains the
‘ests and limitation of tiered architecture.
EXERCISE
1. Explain MAINFRAME and SHARING architecture.
2 What do you mean by client-server model ?
3. Explain 2-tier and 3-tier architecture in detail ?
4. How is the data warehouse different from other systems ? a
5. Write short notes on eee
(a) Message Server
(6) Application Server
(c) ORB Architecture
(d) Client-Server Architecture.
6. What are the costs and limitation of 2-tier a
7. What is the need of client-server archit