Thanks to visit codestin.com
Credit goes to www.scribd.com

0% found this document useful (0 votes)
22 views17 pages

Data Warehousing and Data Mining-1

The document is an examination booklet for a B.C.A. (Fourth Semester) course on Data Warehousing and Data Mining, containing instructions for candidates and a series of questions related to the subject. It covers various topics such as data warehouse design, data mining techniques, and data analysis processes. The maximum marks for the examination are 75, and candidates are instructed on how to handle the question booklet and answer sheets.

Uploaded by

xyz010010001
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF or read online on Scribd
0% found this document useful (0 votes)
22 views17 pages

Data Warehousing and Data Mining-1

The document is an examination booklet for a B.C.A. (Fourth Semester) course on Data Warehousing and Data Mining, containing instructions for candidates and a series of questions related to the subject. It covers various topics such as data warehouse design, data mining techniques, and data analysis processes. The maximum marks for the examination are 75, and candidates are instructed on how to handle the question booklet and answer sheets.

Uploaded by

xyz010010001
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF or read online on Scribd
You are on page 1/ 17
1097382 Serial No. of Booklet : Code No. : 1297 B.C. A. (Fourth Semester) Examination, 2023-24 ‘AFFIX PRESCRIBED RUBBER STAMP Roll No. Paper Third DATA WAREHOUSING AND DATA MINING Course Code—BCA-403T (Major) In Figures (iat #) : QIAlW ol2h [8]3a [s [to fofafo In Words (ret #) : ignature of Invigilator Max. Marks : 75 wa fede 3 seme Important Instructions : erage ‘Ae: 1 ‘The candidate will write his/her Roll Number 1 seme sm m aaa set tw only at the places provided for, i. ¢., on the fora oh ga fee RI oa &, aly wea cover page and on the OMR answer sheet at giro 3 gor ye ca am BS od the end and nowhere else. alestas ce ett a gt feretar 2. Immediately on receipt of the question booklet, the candidate should check up,the > 4 SPiver Pet & once at oe wx booklet and ensure that it contains all'the ghar ox dar ate yer Fy pages and that no question is rising: If he wet ake Os wear mw at ad Bak candidate finds any discrepancy ‘in the rea question booklet, he’she should report the eo nea wer gfirar Bra & 10 invigilator within 10 minutes of the issue of 4 oa tas o afta this booklet and a fresh question booklet oar ae ae fia gt & wa wear + without any discrepancy is obtained. Wa a da ae) No second question booklet shall be given to a candidate under any circumstances after 10 minutes. The candidate should be careful in handling the question booklet and in filling the OMR answer sheet given separately with this booklet, (Contd. on the last page / amex fer sift y. oR) 1. Ae What is a Data Warehouse primarily designed for? (A) Real-time transaction processing @© Historical data analysis and “reporting (C) Online gaming = (D) Social media interaction What are the components of a Data Warehouse ?. (A) Web servers, databases and applications (B) Input devices, processing units and output devices a, (C) Tables, queries and forms @ data sources, ETL processes ae What is the process of building a Data Warehouse 2 2 (A) Data’Mining (B) Data Analysis @ = I00010 (D) Data Visualization (3) What is thé difference between a P Database System and a Data Warehouse ? ° @® ‘* sotabase is used to store and (B) A database is used for data data warehouse is used to store and analytics, while a capture data. (C) A database is used for data while data warehouses are designed for analytics, transactional data storage. (D) None of the above What is a multi-dimensional data model in Data Warehousing ? (A) A model that stores data in a single dimension (B) A model that stores data in two dimensions (D) A model that stores data in a non-relational format 12971 6. 7. 8. What are data cubes, stars, snowflakes and fact constellations in Data Warehousing ? (B) Types of data storage devices (C) Data encryption techniques (D) Data visualization tools What is the first step in the Data Warehouse process and technology ? (A) Data Analysis (C) Data Mining (D) Data Visualization What ‘is the strategy behind Data Warehousing ? (A) Storing data in‘a flat file format (B) Deleting historical data (© Performing real-time data analysis a _ singe repository (4) 10. 6 What are ‘the management anq support processes involved in Data Warehousing ? (A) Hardware maintenanegg, (B) Data backup and recovery WH Network security — (D) Web development What is involved in the planning and implementation of a Data Warehouse ? (A) Setting up social media accounts (B) Creating data visualizations @ designing data models and | schemas (©) Building mobile appligations is a subject-oriented integrated, time-variant, nonvolatile collection of data in support of management decisions. (A) Data Mining PHP via Warehousing | (C) Web Mining & (D) Text Mining 12971 @ 12. What is the prithary purpose of a Data Warehouse ? (rem (B) Real-time transaction processing (C) Online gaming (D) Social média interaction 13. The data is stored, retrieved and . ~ updated in 5 (A) OLAP (C) SMTP (D) FTP 14. What is the process of selecting only relevant data from a larger dataset called : My (A) Sampling Pricing (J Data reduction (D) Data integration s (8) d- 15. Which of the following is not a commonly - used data warehouse architecture ? “\ (A) Inmon (B) Kimball (©) Snowflake — 16. Which of the following is not a > common data mining application? (&) Fraud detection (B) Customer segmentation - ©) Sales forecasting WP Woventory management 17. The star schema is composed of fact table. Poe () two. (C) three (D) four 1297/0 18. The modern CASE tools belong to category. (B) development (C) coding (D) delivery 19. Data scrubbing is : (A) a process to reject data from the data warehouse. and, to create the necessary indexes. (B) a process to load the data in the data warehouse and to create the necessary indexes. (C) aprocess to upgrade the quality _ of data after it is moved into a data warehouse. 20. What is the (primary goal of data mining ? (A) Real-time data processing (B) Storing historical “data for reporting (C) Conducting online transactions ON 21, What is the main motivation behing data mining ? (A) Generating random data > (C) Data visualization (D) Network seciiity 22. What is data pre-processing in the context of data mining ? (B) Encrypting data for security purposes : (C) ‘Storing data in a database (D) Analyzing data pattems 23. How is noisy data typically addressed in data cleaning ? AB v5 osing clustering techniques. (B) Through human inspection “ie (C) By ignoring it (D) None of the above 4297/0 7 a/a does data reduction involve in . data mining ? (A) Increasing the volume of data for analysis (C) Randomly sampling data points (D), Ignoring data outliers 25. at technique is: used for data cube aggregation 2 : (A) Clustering , (B) Regression analysis (C) Dimensionality reduction —-— 26. What is dimensionality reduction in data mining ? (A) Increasing the number of dimensions in a dataset MPrecieasing the numberof | (C) Storing data in a hierarchical structure (D) Grouping similar data points together (7) be JI Wat is data compression in data - mining ? MBPReducing the size of the dataset (B) Increasing the size of the - dataset for better analysis (C) Encrypting data for -sectirity purposes (D) ‘Storing data in a compressed format 28, Which technique is used for numerosity reduction ? 0%) Clustering (B) ‘Regression analysis (D) Random sampling 29, What - is concept hierarchy + generation in data mining ? = (B) Removing irrelevant data from the dataset (©) Applying statistical techniques to analyze data patterns (D) Predicting future trends based on historical data 1297/1 . 7 30. What is one form of data cleaning 33, To represent any n-Dimension data 32. technique used for handling we need a series of inconsistent data ? Seg Dimension cubes. (9 Clustering Foy ae oa (B) Regression analysis (@) n APP Computer inspection — Oo (@) Random sampling () n+2 What is the main objective of data as Which technique is commonly used mining ? to handle missing values in data (A) Real-time data processing cleaning’? AF =2:1ng: meaningtal pases) Cusine (B), ‘Regression tay a, Poin (C) Storing historical data for reporting (D) Computer inspection (D) Conducting online transactions 35. What is data integration in data mining ? Fo ay What drives the motivation behind - : data mining ? "sources into a'single repository ee (B), Separating data into different (B) Data visualization categories (C)_ Network security (C) Encrypting data for security BB *:sivtve modeling, (D) Deleting irrelevant data (8) 12971 @ 36. ‘The selection on one dimension of the operation performs: a given cube, resulting in a subcube. (A) Pivot ; PRI (C) Roll-up (D) Drill-down 37. What is classification in the context of data mining ? PHMPoring cata into different "categories (B) Combining multiple datasets into’a single repository (C) Storing -historical data for analysis (D) Randomly sampling dat3*Paints %8,. What is data generalization ? (A) Adding noise to data for privacy protection (B) Removing irrelevant attributes from the dataset (©) Analyzing the significance of data statistical 39. oe What is analytical . characterization in data mining ? “(A) Sorting data into different 40. oN Se categories (B) Removing duplicates ‘from the dataset . (D) Adding rioise to data for privacy protection What does analysis of attribute relevance involve ? : (A) Sorting data into difffent categories (B) Removing outliers from the dataset (C) Generating association rules between attributes 1297/@ 41. Which type of algorithms use 44. statistical measures in large databases for classification ? (B) Distance-based algorithms (S) Decision tree-based algorithms () Chustering algorithms. - 42. What do distance-based algorithms 45. ely on for classification ? (A) Statistical measures (B) Decision trees (C) Association rules Moray ok sisance measures 43, Which algorithm is a decision tree- based algorithm commonly used for classification ? (#) K-means (C) CURE (D) DBSCAN s (10) 46. What is clustering in the Context ot data mining ? (A) Sorting data into differen categories rina te ep, (©) Storing historical data for analysis : (D) Randomly sampling data points What are similarity and distance measures used for in clustering ? (A) Analyzing the distribution of data values (B) Sorting data into different categories APPreriving the sinitarty between data points () Adding noise to data for ‘privacy protection Which algorithm: is an, example of a hierarchical coaiebagetien ? (A) K-means (B) DBSCAN (©) optics e 12971 - wo 47. Which clustering method is density: 50. What is association rules mining ? ge T based? (A) Sorting data into different eerscan categories (B) Hierarchical clustering PPivcig interesting (C) Partitional clustering _ relationships between variables ©) sTING + in large datasets . 48. What is the goal. of grid-based (©) Analyzing the distribution of a clustering methiods ? oe lees PM rcuring data points based on (D) Randomly sampling data points their density S1.. What do large item sets represent in (B) Separating data into different association rules mining ? Ny categories : (A) Groups of similar data points (C) Analyzing the statistical " (B) Outliers in the data significance of data (C) Missing values (D) Summarizing data at a higKer GH eg0201 pattems in the dataset level of abstraction 52. Which algorithm is. commonly 49. What is the model-based method for” used. for generating association (iS coetting bed on? miles? (A) Similarity measures (Af K-means (B) Decision trees Pris PH s2iistical approach ©) CURE () Distance measures Ne, (D) DBSCAN 8 . (1) 1297/6 53. What is the objective of parallel and distributed algorithms in association rules mining ? (A) Analyzing ne si tribution of data values (B) Sorting data into diffSient categories speeding up’ the mining Process for large datasets (D) Randomly sampling data points What approach does the neural network approach take — in ‘association rules mining ? So Finding pattems in the data based on neuron activations 55. (B) ‘Analyzing the distribution of data values (C) Sorting data into different categories (D) Randomly sampling dati’Paiges ‘What is the primary purpose of data visualization in data warehousing ? ou data in a visual format for easy understanding (B) Generating random data (C) Summarizing data at 2 bigher level of abstraction (D) Analyzing the significance of data statistical (12) 56. 37. 58. y > What does ‘aggregation involye in data visualization ? (A) Sorting data into » Airey categories ummarizing data at a higher level of abstraction (C) Removing duplicates from the dataset (D) Adding noise to data for privacy protection What type of ‘information “Shes historical information provide in data visualization? (A) Real-time data processing (B) Current trends and patterns in the data ast trends and patterns in the data (D) Future predictions based on the data . What function does OLAP serve in data warehousing ? (A) Online Transaction Processing (B) Online Data Mining (C) Online Data Backup Online Analytical Processing 1297/0 | 59. 60. 61. Which type of OLAP server stores data in a relational database ? (B) HOLAP . (©) DOLAP (D) ROLAP™ What does the acronym MOLAP stand for ? (A) Massive Processing Oriline’ Analytical (© Mixed, Online Analytical "Processing () Mobile Online Analytical Processing < = ‘What is the purpose of a data mining interface ? (A) Analyzing the distribution of data values : (B) Sorting data into ~ different categories (C) Adding noise to data fer privacy protection —_— (13) 62. What aspect of data warehousing does security address ? (B) Sorting data into different categories (C) Presenting data in a visual format {D) Adding noise to data for privacy protection What is the purpose of backup and recovery in data warehousing ? (A). Analyzing the distribution of data values we “{B) Storing data in a visual Format (D) Adding noise to data for privacy protection 64. What does tuning data warehouse involve ? ®). Sorting data into different’ categories (C) Presenting data in a visual format (D) Adding noise to data for privacy protection 1297/0 ) { 0 65. What are some types of 68, What does temporal mining focys warehousing applications? on? . gare come ee (8) Sorting data into different media interaction * categories (C) Data encryption and decryption ae (D) Statistical analysis and regression modeling (C) Presenting data in a visual format 66. What does web mining focus on? (D) Adding noise to data for (A) Sorting data into different privacy protection categories 69. What type of OLAP server stores @) Presenting data in a visual data in a multidimensional cube format. ~ ae oo” at 7 ine (A) ROLAP : sn - (D) Adding noise to data for wre privacy protection (C) DOLAP — MOLAP 67. What is the goal of spatial mining 2 ®) (A) Sorting data into different 70, What is the main purpose of data — categories warehousing ? (A) Real-time data processing (C) Presenting data in a visual wa format (D) Adding noise to data . for (C) Generating random data privacy protection (D) Conducting online transactions s (4) 1297/@ 71. Which component is essential for building a data warehouse ? (A) Operating system (B) Web browser “— (BMS) _ = ©) Programming language ey 72. What is the primary function of r OLAP servers in data warehousing ? * (B) Data encryption ice (2) Data visualization (D) Database backup 73. Which type of data processing involves summarizing data at a higher level of abstraction ? (B) Segmentation ee (©) Classification (D) Regression 74. What is the purpose of data cleaning in data warchousing ? (A) Adding noise to the dataset (B) Increasing the size of the dataset (©) Enhaneing data security 77., Data warehouse contains ...» i (18) 75. What does OLAP allow users to do ? (A) Conduct online transactions. (C) Encrypt data for security Purposes (D) Create random datasets 76. Which of the following is not a component of data warehousing ? (A) OLAP server ia (B) Database management system (D) Data warehouse data that is never found in operational environment. (A) normalized (B) informational © summary APPvevormalized 78. Witich of the following -is “ii essential process in which the intelligent methods are dpplied to extract data patterns ? (A) Warehousing (B) Text Mining (C) Data Selection PP PeaMining 1297 @ What is KDD in data ing ? (B) Knowledge Data Defi (C) Knowledge Database (D) Knowledge Data House mn Distovery 80. For what purpose, the analysis tools a pre-compute the summaries of the huge amount of data ? (A) To obtain the queries respons vA (C) For authentication (D) For data access 81. The Data Warehouse is .. (A) write only - He) read only (D) None‘of the above 82. Expansion for DSS in Data _ Warehouse is «11... sees (A) Decision’Single System (B) Data Storable System ~ Decision Support System 83. . describes the data contained in the data warehouse. (A) Relational data (C) Operational data (D) Informational data Ra : (16) bo 84... .. prédicts future trends and behaviours, allowing business managers to make — proactive, knowledge-driven decisions. (A) Data warehouse (B) Datamarts (D) ,Metadata 85. is the heart of the (B) Datamining database servers (C) Datamart database servers (D) Relational database servers 86. is the spécialized data Ft warehouse database. wey # (B) Redbrick (C) DBZ (D) Informix 87. . defines the structure of the data held in. operational databases and used by operational applicgtions. %) Oe ta (C) Data warehouse metadata (D) Data mining metadata ~ “= 1297/@ oy 88. .. is held in the catalog of the warehouse database system. (A) Application Igvel metadata (B) Departmental level metadata (D) Algorithmic level metadata 89... .. maps the —_ core warehouse metadata’ to business concepts, familiar and useful to _endusers. (B) End user level metadata (©) Application level metadata (D) Core level metadata 90. What is the difference between OLTP and OLAP ? = (A) OLTP is optimized “Stor analytical processing, while OLAP is optimized for transaction processing. ~~ analytidal processing. (©) OLTP and OLAP are théWpye thing, : () OLTP and OLAP are both optimized for —_ transaction processing. (7) 91,, What is a ditnension table ? KR (A) A table that stores transactional data. (C) A table that stores metadata. A table that stores details about @) on customers. 92. Which: of the following is not a basic data mining task ? (A): Prediction (B) Classification BHorovivs ©) Clustering 93. Which of the following is SK an issue in data mining? | (A) Overfiting (B) Outliers (C) High dimensionality PP vores ora ~ 1297/6 94. Which of the following is a subset of data warehouse focused on a specific functional area ? (A) Association rules (C) Flat files es (D) Database Be What do data warehouses support ? 7 (A) OLTP (@f OLAP and OLTP (D) Operational databases 96. Which one of the following {statements is not correct about the data cleaning ? (A) It refers to the process of data cleaning. (B) It refers to the transformation of wrong data into correct dala, (C) It refers to correcting 97. The issues, like _ efficiency, scalability of data mining algorithms comes under ; (A) Diverse data type issues (B) Performance issues (C) Mining methodology and user interaction S 98. Which one of the following can be considered as the correct application of the data mining ? (A) Fraud detection _ (B) Corporate analysis and Risk management (C) Management analysis 99. is a subject-oriented, and = Market integrated, time-variant, non-volatile collection of data in support of management decisions. (B) Data mining (C) Web mining (@) Text mining : “ig. 100. Record cannoi be updated in (A) files : (C) rdbms (D) None of the above 12971

You might also like