Department of CS/IT
TYCS – SEM 6 - DATA SCIENCE
MCQ Model Questions
Choose the correct option:
1. Which of the following would be more appropriate to be replaced with question mark in
the following figure?
a) Data Analysis b) Data Science c) Descriptive Analytics d) Commerce
2. Which of the following is the most important language for Data Science?
a) Java b) Ruby c) R d) Basic
3. Which of the following is the common goal of statistical modelling?
a) Inference b) Summarizing c) Subsetting d) script
4. -----------------------Shows all individual data points.
a) Box-plot b) Scatter Plot c) Line plot d) Pie chart
5. Xquery is a functional query language used to retrieve information stored in -----------
format.
a) HTML b) XML c) UML d) Jscript
6. Xpath specification has ------------------------ types of nodes
a) Four b) Five c) Six d) Seven
7. Data Visualization is also on element of the broader ------------------------
a) deliver presentation architecture b) data presentation architecture
c) dataset presentation architecture c) data process architecture
8. Which method shows hierarchical data in a nested format?
a) tree maps b) Scatter Plots c) Population pyramids d) Area Charts
9. Which of the following is most basic and commonly used techniques?
a) line charts b) Scatter plots c) Population pyramids d) Area charts
10. Which of the following is not a part of data science process?
a) discovery b) model planning c) communication building d) operationalize
11. In Xquery ___________ symbol preceded before the variable name.
a) @ b) $ c) # d) *
12. MongoDB support cross platform and is written in _____________ language.
a) C++ b) R c) Java d) Python
13. MongoDB is ___________ database.
a) SQL b) NoSQL c) RDBMS d) DBMS
14. Ridge Regression is when data suffers from ___________
a) Collinearity b) Multicollinearity c) Does not suffer d) Regression
15. Bayesian information Criterion (BIC) is related to _______________
a) Ridge regression b) AIC c) Cross validation d) Lasso Regression
16. Joins are used for combining _____________ product.
a) Vector b) Cartesian c) Scalar d) Euler
17. Which of the following step is performed by data scientist after acquiring the data?
a) Data Cleansing b) Data Integration c) Data Replication d) Deletion
18. Which of the following package is used for reading excel data?
a) xlsx b) xlsc c) read.sheet d)VB
19. Which of the following is another name for raw data?
a) destination data b) eggy data c) secondary d) Machine Learning
20. Arranging the customers names in ascending order is an example of
a) process b) information processing c) process d) information
21. Organisation, distribution and manipulation of information is classified as
a) data manipulating b) process selection
c) information extraction d) information processing
22. Quantitative data deals with _________________
a) numbers and things b) Characteristics c) images d) sketches
23. Qualitative data deals with _____________________
a) Characteristics b) numbers c) things d) price
24. Example for discrete data ______________________
a) The number of children b) height of children
c) weight of children d) behaviour of children
25. Primary data is __________________________________
a) Collected for the first time b) Collected for the second time
c) Not original data d) statistical operations have been performed.
26. The use of tabular data and graphs and charts makes it __________ to understand the
concept of bar charts and histograms.
a) easy b) difficult c) boring d) confusing
27. This language was developed by Dennis Ritchie of Bell Laboratories in order to
implement the operating system UNIX.
a) C b) C++ c) Java d) LISP
28. Computer programs are written in a high level programming language; however, the
human-readable version of a program is called ………….
a) cache b) Instruction set c) source code d) word size
29. Query language comes under:
a) Third generation b) Fourth generation c) Fifth generation d) First Generation
30. Bitmapped file formats can be most useful for ____________
a) Plots that may need to be resized
b) Plots that require animation or interactivity
c) Plots that are not scaled to a specific resolution
d) Scatterplots with many many points
31. The stem and leaf displaying technique is used to present data in
a) descriptive data analysis b) exploratory data analysis
c) nominal data analysis d) ordinal data analysis
32. Example for semi structured data__________________
a) XML data b) Relational data c) media logs d) word
33. Example for Unstructured data ________________
a) media logs b) XML data c) Relational data d) Oracle
34. Which of the following is not a NoSQL database?
a) SQL Server b) MongoDB c) Cassandra d) C
35. Which of the following is a NoSQL Database Type?
a) SQL b) Document Database c) JSON d) C++
36. NoSQL databases is used mainly for handling large volumes of ______________ data.
a) unstructured b) structured c) semi-structured d) images
37. The government and non government publications are considered as
a) external secondary data sources b) internal secondary data sources
c) external primary data sources d) internal primary data sources
38. Amazon web services falls into which of the following cloud-computing category?
a) Platform as a Service b) Software as a Service
c) Infrastructure as a Service d) Back-end as a Service
39. The _______ is a symbolic representation of facts or ideas from which information can
potentially be extracted.
a) knowledge b) data c) algorithm d) program
40. Data mining is used to refer ______ stage in knowledge discovery in database.
a) Selection b) retrieving c) discovery d) coding
41. A collection of interesting and useful patterns in database is called _______.
a) knowledge b) information c) data d) algorithm
42. ________analysis divides data into groups that are meaningful, useful, or both.
a) cluster b) text c) multimedia d) link
43. Data dictionary is _____________________
a) Large collection of data mostly stored in a computer system
b) The removal of noise errors and incorrect input from a database
c) The systematic description of the syntactic structure of a specific database. It describes the
structure of the attributes the tables and foreign key relationships.
d) image
44. Data cleaning is
a) Large collection of data mostly stored in a computer system
b) The removal of noise errors and incorrect input from a database
c) The systematic description of the syntactic structure of a specific database. It describes the
structure of the attributes the tables and foreign key relationships.
d) Decision support systems
45. E-R model uses this symbol to represent weak entity set?
a) Dotted rectangle b) Diamond c) Doubly outlined rectangle d) Square
46. Relational Algebra is
a) Data Definition Language b) Meta Language
c) Procedural query Language d) BASIC
47. What is a relationship called when it is maintained between two entities?
a) Unary b) Binary c) Ternary d)Quaternary
48. The RDBMS terminology for a row is
a) Tuple b) Relation c) Attribute d) Degree
49. CouchDB is ____________________
a) Document-oriented DBMS b) Relational DBMS
c) Compiler d) Interpreter
50. _____________ can be used for batch processing of data and aggregation operations.
a) Hive b) MapReduce c) Oozie d) PASCAL