Online_MBA_AI(OMBAIML 301)
Extract insights from data
Data Analysis
Learning from labeled data
Decision Trees
When the model performs well on training data but poorly on test data
c(1, 2, 3)
read.csv()
data.frame(column1, column2)
install.packages("package_name")
11
To clean the data
Inferential statistics
Values that are not recorded
Mean
selection
conversion
duplicates
True
False
True
2. What challenges might the company face when handling text data?
✅ Ambiguity in language
✅ High dimensionality of text data
3. Which preprocessing methods help improve the accuracy of sentiment analysis?
✅ Lemmatization
✅ Stop word removal
4. What types of insights can be gained from sentiment analysis of customer feedback?
✅ Identification of customer preferences
✅ Understanding product issues
5. Which of the following visualizations can be effective for displaying sentiment analysis results?
✅ Bar charts
✅ Line graphs
6. What is the primary goal of topic modeling in text analysis?
✅ To identify common themes in feedback
✅ To classify text into information
Online_MBA_ML(OMBAIML 302)_
1. Which of the following machine learning algorithm can be used for imputing missing values of
both categorical and continuous variables?
✅ KNN
2. Which of the following distance measure do we use in case of categorical variables in KNN?
✅ Hamming distance
3. Machine Learning is used to find what?
✅ Hidden patterns in dataset
4. What is Standard Deviation?
✅ A quantity expressing by how much the members of a group differ from the mean value for the
group
5. Which algorithm is commonly used for classification tasks in machine learning?
✅ Decision trees
6. What is Regression?
✅ A technique to determine the statistical relationship between two or more variables
7. What is the full form of SVR?
✅ Support vector regression
8. Clustering is a technique of which Machine Learning?
✅ Unsupervised
9. If there are datapoints that belong to more than one cluster, then which clustering technique
is/are used?
✅ Fuzzy
10. In which type of clustering, each data point either belongs to a cluster completely or not?
✅ Hard clustering
11. What do you mean by support(A)?
✅ Number of transactions containing A / Total number of transactions
12. What is the full form of MDP?
✅ Markov Decision Process
13. CNN carries out which operation, which is an element-wise product and sum between two
matrices?
✅ Convolution
14. Which layer simply removes all the negative values from the convolution?
✅ ReLU
15. Mobile phone face ID is an example of Deep Learning
✅ True
16. PCA is dimensionality reduction technique
✅ True
17. Random Forest is an example of bagging
✅ True
18. The fundamental unit of Neural Network is ___________
✅ Neuron
19. ______ are best recognized for their ability to find patterns present in images
✅ CNN
20. _______ is applied by the agent to decide the next action based on the current state.
✅ Policy/Strategy
2. Choose applications that utilize machine learning:
✅ Weather forecasting
✅ Language translation
(These are explicitly mentioned in the paragraph as ML applications. Simple calculator apps and
alarm clocks are not.)
3. Identify the fields that are used in machine learning:
✅ Autonomous vehicles
✅ Music composition
(The paragraph discusses autonomous vehicles. Music composition can involve ML in generative
models, although not explicitly mentioned in the paragraph, it's still a valid application compared to
the others.)
4. Choose the correct applications of machine learning that are used in healthcare:
✅ Medical image analysis
✅ Disease prediction
(Both are explicitly listed in the healthcare section of the paragraph.)
5. Identify the tasks that can be performed using natural language processing (NLP):
✅ Sentiment analysis
✅ Speech recognition
(Sentiment analysis is a direct NLP task. Speech recognition is mentioned in ML use, and often
overlaps with NLP in speech-to-text systems.)
6. Select all the applications of machine learning in recommendation systems:
✅ Music recommendation
✅ Movie recommendation
Online_MBA_PAP(OMBAIML 303)
1. Identify the correct way to define a set in Python:
✅ set = {1, 2, 3}
2. Identify the correct way to access a value in a dictionary by its key in Python:
✅ dictionary[key]
3. Determine what statistics provided by df.describe() function in Pandas:
✅ Descriptive
4. Visual data exploration and exploratory statistical analysis are part of which step?
✅ Data Analysis
5. Which regression model is used when the dependent variable is categorical?
✅ Logistic Regression
6. Which of the following is not a method to prevent overfitting in decision trees?
✅ Regularization
7. What task is SVM algorithm used for?
✅ Classification
8. What does Ensemble methods do?
✅ combine multiple classifier to improve the performance of a model.
9. Major voting process is considered to be from the following:
✅ Bagging
10. For multiple trees to be effective, how should the trees be?
✅ Independent
11. What does k-NN stand for?
✅ k-Nearest Neighbor
12. Association Rules are used to determine:
✅ Discovering interesting relationships between variables
13. The support of an itemset in Association Rules relates to:
✅ The proportion of transactions containing the itemset out of all transactions
14. What major challenges do interconnected databases and regular data updates pose?
✅ Negative impact on data quality
15. Decision Trees can handle both numerical and categorical data.
✅ True
16. A social network learner usually consists of a local model using only node-specific
characteristics.
✅ True
17. Conventional approach to analytics in social network learning doesn't consider network
connections.
✅ True
18. Growing a decision tree involves recursively splitting data based on:
✅ Information Gain
19. A decision tree model with too many levels or branches suffers from:
✅ Overfitting
20. In SVM, the points closest to the hyperplane are called:
✅ Support Vectors
2. Determine the libraries or frameworks in Python that facilitate event-driven programming
✅ Tkinter
✅ asyncio
(❌ Requests and NumPy are not event-driven libraries)
3. Dissect the advantages of event-driven programming in Python?
✅ Responsive and interactive applications
✅ Modular and reusable code
(❌ Sequential execution and database management are not core advantages mentioned)
4. Determine which type of applications can benefit from event-driven programming in Python?
✅ Networking servers
✅ GUI interfaces (implied from the paragraph; choose Networking servers + GUI interfaces if asked)
(Note: Command-line utilities and simulations are not event-driven in nature)
5. Determine which features are associated with event-driven programming in Python?
✅ Asynchronous execution
✅ Callback functions
(❌ Object-oriented programming and exception handling are general concepts, not specific to
event-driven)
6. Evaluate how are events typically triggered in event-driven programming?
✅ Through user actions
✅ Automatically by the operating system
Online_MBA_SQT(OMBAIML 304)
1. Unit of measurement for variance of heights in cm
✅ cm²
2. Type of data for heights of the trees
✅ Continuous
3. Purpose of multiple regression
✅ To predict scores on a dependent variable from scores on multiple independent variables
4. Attribute with two classes is called
✅ Dichotomy
5. Data required for calculation of measures of association
✅ Attributes
6. Type of data for gender of students
✅ Attribute
7. Probability of selecting a king from a deck of 52 cards
✅ 0.076 (4 kings / 52 cards = 0.0769)
8. Why saying 10°C is twice as hot as 5°C is incorrect
✅ For interval data, differences make sense, but ratios do not
9. Maximax criterion decision-making environment
✅ Conditions of uncertainty, the decision maker being perfectly optimistic
10. Probability of getting a heart from 52 cards
✅ 0.25 (13 hearts / 52 cards = 0.25)
11. Mean and median for: 1, 7, 3, 3, 6, 4
✅ 4 and 3.5
(Sorted: 1, 3, 3, 4, 6, 7 → mean = 24/6 = 4; median = (3+4)/2 = 3.5)
12. Measure that is most frequent number in dataset
✅ Mode
13. Mean of 6 numbers is 17 → Sum = ?
✅ 102 (Mean × count = 17 × 6 = 102)
14. Arrival follows
✅ Poisson distribution
15. Programming that allows more than one objective
✅ Goal
16. Transportation problem requires equal
✅ Units supplied; units demanded
17. Standard normal distribution has
✅ Mean equal 0 and variance equal 1
18. Median divides into four equal parts
✅ False
(Median divides into two equal parts; quartiles divide into four)
19. Hungarian Method is faster and efficient
✅ True
20. Pure strategy in two-person, zero-sum game
✅ True
1- A&B
2- B&D
3- B&C
4- A&B
5- C&D
Online_MBA_BI(OMBAIML 305)_
1. Where is data stored, retrieved & updated in a data warehouse?
✅ OLAP
2. Which metadata maps core warehouse metadata to business concepts?
✅ Application-level metadata
3. Expansion of OLAP:
✅ Online Analytical Processing
4. What does data mining use to convert large collections of data into useful output?
✅ Algorithms and various techniques
5. Data mining tool based on GUI and uses R stats:
✅ Rattle
6. Primary means of creating a dimension:
✅ Dimension Wizard
7. Processing of data about customers and their relationship with the enterprise:
✅ CRM analytics
8. Performance management tool:
✅ Balanced Scorecard
9. How to remove noise and inconsistency in data:
✅ Data cleaning
10. Style of network communication where client initiates request:
✅ Pull coding
11. Device used to transfer an image in visualization:
✅ Scanner
12. Visualization technique showing customer perception of brands:
✅ Perceptual mapping
13. How to design effective tables and charts for data visualization:
✅ Data-ink ratio
14. Use of BI service:
✅ Flexible self-service analysis
15. What processes data through use of nodes:
✅ Neural networks
16. A calculation in ________ is a bunch of heuristics and estimations:
✅ Data mining
17. The class under study in Data Characterization is known as:
✅ Target
18. To harness a successful BI project, companies must plan:
✅ True
19. A data source may not be the initial location where data is born:
✅ True
20. Successful information mining supports business strategy and operations:
✅ True
2. What is the requirement to use BI softwares?
✅ for transparent analytics
✅ reporting process
3. Which are data management solutions?
✅ Business intelligence
✅ Business Analytics
These are explicitly mentioned as data management solutions used to collect and analyze data.
4. Which kind of data could be used for analysis?
✅ organized
✅ unstructured
The paragraph says: "The data used for analysis can be organised or unstructured."
5. What is the use of statistics in BI?
✅ to analyze raw information
✅ to deliver insights for future
It says BI uses statistics "to analyze raw information, and deliver insights for making better future
decisions."
6. In which format business intelligence present the data?
✅ charts
✅ dashboard
Online_MBA_R programming(OMBAIML 306)
1. What does R stand for in the context of the R programming language?
✅ NA (R does not officially stand for anything; it was partly named after the first letters of its
authors Robert and Ross)
2. Which of the following is a primary use of R?
✅ Statistical analysis
3. What is the command to install a package in R?
✅ install.packages("package_name")
4. Which of the following data structures is NOT available in R?
✅ tree
5. How do you create a vector in R?
✅ c(1,2,3)
6. What function is used to view the structure of a dataset in R?
✅ str()
7. Which of the following is a way to read a CSV file into R?
✅ read.csv("file.csv")
8. In R, what does the function summary() do?
✅ It provides a summary of statistical measures for an object.
9. What is the primary function of the ggplot2 package in R?
✅ data visualization
10. Which of the following is a logical operator in R?
✅ &&
11. What is a matrix in R?
✅ A collection of elements of the same type arranged in a rectangular format
12. How do you create a matrix in R?
✅ matrix(c(1, 2, 3, 4), nrow=2, ncol=2)
13. What function is used to access a specific element in a matrix in R?
✅ [row, column] notation
14. Which function is used to transpose a matrix in R?
✅ t(matrix)
15. To create a sequence of numbers from 1 to 10 in R, you would use the function __________.
✅ seq(1,10)
16. The function __________ is used to read a CSV file into R.
✅ read.csv()
17. In R, the __________ function is used to calculate the mean of a numeric vector.
✅ mean()
18. R packages are collections of R functions, data, and documentation bundled together
✅ True
19. The function install.packages("package_name") is used to load an already installed package
into the R environment.
✅ False
*(Correct function: library(package_name))
20. R scripts are plain text files that contain R code and can be executed in the R environment.
✅ True
2. What is the purpose of the JOIN clause in SQL?
✅ To combine rows from two or more tables based on a related column
✅ (None of the other listed options are correct; only one correct option applies here)
3. Which of the following SQL statements correctly groups data to calculate the total quantity sold
for each product?
✅ SELECT Product, SUM(Quantity) AS TotalQuantity FROM Orders GROUP BY Product;
✅ SELECT Product, SUM(Quantity) AS TotalQuantity FROM Orders ORDER BY TotalQuantity;
(Though missing GROUP BY, this one could still be a possible second choice if GROUP BY is implied.
However, the first is the most accurate.)
4. Which R function is used to read CSV data into a data frame?
✅ read.csv()
✅ read.csv2() (Another variant that uses semicolon separators common in European data formats)
5. What is the output of the following SQL query?
✅ Total revenue generated per city
✅ (Only one correct answer; others do not match the query's intent.)
6. Which dplyr functions did Mr. Sinha use in R to summarize data?
✅ group_by()
✅ summarise()