Assignment 3 Solution

Solution of assignment given in the AI course.

Uploaded by

Wajeeha Nafees

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

4 views7 pages

Assignment 3 Solution

Solution of assignment given in the AI course.

Uploaded by

Wajeeha Nafees

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 7

Assignment 3 Solution

Question 1:

Write a python code for reading a large dataset and map the textual features to
numerical values. Then apply PCA to reduce the dimensionality of the dataset to 3
principle componets and plot in 3-D.

Using Iris Dataset

1 import numpy as np
2 import pandas as pd
3 import matplotlib.pyplot as plt
4 from sklearn.datasets import load_iris
5 from sklearn.decomposition import PCA
6 from mpl_toolkits.mplot3d import Axes3D
7
8 # Load the Iris dataset
9 iris = load_iris()
10 data = iris.data
11 target = iris.target
12 feature_names = iris.feature_names
13
14 # Convert the features to a DataFrame
15 df = pd.DataFrame(data, columns=feature_names)
16
17 # Apply PCA to reduce dimensionality to 3 components
18 pca = PCA(n_components=3)
19 principal_components = pca.fit_transform(df)
20
21 # Create a DataFrame with the principal components
22 df_pca = pd.DataFrame(data=principal_components, columns=['PC1', 'PC2', '
23
24 # Plot the 3D graph
25 fig = plt.figure(figsize=(8, 8))
26 ax = fig.add_subplot(111, projection='3d')
27
28 # Scatter plot
29 scatter = ax.scatter(df_pca['PC1'], df_pca['PC2'], df_pca['PC3'], c=targe
30
31 # Legend
32 legend_labels = [f'Class {i}' for i in range(3)]
33 ax.legend(handles=scatter.legend_elements()[0], labels=legend_labels)
34
35 # Axes labels
36 ax.set_xlabel('Principal Component 1')
37 ax.set ylabel('Principal Component 2')
37 ax.set_ylabel( Principal Component 2 )
38 ax.set_zlabel('Principal Component 3')
39
40 # Title
41 ax.set_title('3D PCA of Iris Dataset')
42
43 plt.show()

Using Wine dataset

1 import pandas as pd
2 import numpy as np
3 import matplotlib.pyplot as plt
4 from sklearn.decomposition import PCA
5 from sklearn.datasets import load_wine
6 from mpl_toolkits.mplot3d import Axes3D
7
8 # Load the Wine dataset from scikit-learn
9 wine = load_wine()
10 X = pd.DataFrame(data=wine.data, columns=wine.feature_names)
11
12 # Standardize the data
13 from sklearn.preprocessing import StandardScaler
14 scaler = StandardScaler()
15 X_std = scaler.fit_transform(X)
16
17 # Apply PCA to reduce dimensionality to 3 components
18 pca = PCA(n_components=3)
19 X_pca = pca.fit_transform(X_std)
20
21 # Create a 3D scatter plot of the three principal components
22 fig = plt.figure(figsize=(8, 8))
23 ax = fig.add_subplot(111, projection='3d')
24
25 ax.scatter(X_pca[:, 0], X_pca[:, 1], X_pca[:, 2], c=wine.target, cmap='vi
26
27 ax.set_xlabel('Principal Component 1')
28 ax.set_ylabel('Principal Component 2')
29 ax.set_zlabel('Principal Component 3')
30 plt.title('PCA of Wine Dataset (3 Components)')
31
32 plt.show()

Question 2:

Apply Bayes’ Theorem on Covid tests being conducted when total tests are 10000 .
Also calculate specificty, sensitivity, prior and posterior probabilities and false
positive and false negative rates.

1 # Hypothetical values
2 P_A = 0.01 # Prevalence (1%)
3 P_B_given_A = 0.95 # Sensitivity (95%)
4 P_B_given_not_A = 0.05 # False positive rate (5%)
5
6 # Total number of tests
7 total_tests = 10000
8
9 # Calculate probabilities
10 P_not_A = 1 - P_A # Complement of P(A)
11
12 # Calculate true positives, false positives, true negatives, and false ne
13 true_positives = P_B_given_A * P_A * total_tests
14 false_positives = P_B_given_not_A * P_not_A * total_tests
15 true_negatives = (1 - P_B_given_not_A) * P_not_A * total_tests
16 false_negatives = (1 - P_B_given_A) * P_A * total_tests
17
18 # Calculate performance measures
19 accuracy = (true_positives + true_negatives) / total_tests
20 precision = true_positives / (true_positives + false_positives)
21 recall = true_positives / (true_positives + false_negatives)
22 f1_score = 2 * (precision * recall) / (precision + recall)
23
24 # Calculate additional metrics
25 specificity = true_negatives / (true_negatives + false_positives)
26 sensitivity = recall
27 prior_probability = P_A
28 posterior_probability = true_positives / total_tests
29 false_positive_rate = false_positives / (false_positives + true_negative
30 false_negative_rate = false_negatives / (false_negatives + true_positive
31
32 # Print the results
33 print(f"True Positives: {true_positives:.0f}")
34 print(f"False Positives: {false_positives:.0f}")
35 print(f"True Negatives: {true_negatives:.0f}")
36 print(f"False Negatives: {false_negatives:.0f}\n")
37
38 print(f"Accuracy: {accuracy:.4f}")
39 print(f"Precision: {precision:.4f}")
40 print(f"Recall (Sensitivity): {recall:.4f}")
41 print(f"F1 Score: {f1_score:.4f}")
42 print(f"Specificity: {specificity:.4f}")
43 print(f"Sensitivity: {sensitivity:.4f}")
44 print(f"Prior Probability: {prior_probability:.4f}")
45 print(f"Posterior Probability: {posterior_probability:.4f}")
46 print(f"False Positive Rate: {false_positive_rate:.4f}")

True Positives: 95
False Positives: 495
True Negatives: 9405
False Negatives: 5

Accuracy: 0.9500
Precision: 0.1610
Recall (Sensitivity): 0.9500
F1 Score: 0.2754
Specificity: 0.9500
Sensitivity: 0.9500
Prior Probability: 0.0100
Posterior Probability: 0.0095
False Positive Rate: 0.0500
False Negative Rate: 0.0500

Question 3:

Solution 1: Generate a random dataset of true and predicted labels, and calculate
and interpret the performance measures and print the output

1 import numpy as np
2 from sklearn.metrics import accuracy_score, precision_score, recall_score
3
4 # Generate a random dataset of true and predicted labels
5 np.random.seed(42)
6
7 # Number of samples
8 num_samples = 1000
9
10 # True labels (0: Negative, 1: Positive)
11 true_labels = np.random.randint(2, size=num_samples)
12
13 # Simulate a classifier's predicted labels with some errors
14 # Let's introduce some false positives and false negatives
15 predicted_labels = true_labels.copy()
16 predicted_labels[np.random.choice(num_samples, size=int(0.2 * num_sample
17 predicted_labels[np.random.choice(num_samples, size=int(0.1 * num_sample
18
19 # Calculate performance measures
20 accuracy = accuracy_score(true_labels, predicted_labels)
21 precision = precision_score(true_labels, predicted_labels)
22 recall = recall_score(true_labels, predicted_labels)
23 f1 = f1_score(true_labels, predicted_labels)
24 conf_matrix = confusion_matrix(true_labels, predicted_labels)
25
26 # Print the results
27 #print(f"True Labels: {true_labels}")
28 #print(f"Predicted Labels: {predicted_labels}\n")
29
30 print(f"Confusion Matrix:\n{conf_matrix}\n")
31
32 print(f"Accuracy: {accuracy:.4f}")
33 print(f"Precision: {precision:.4f}")
34 print(f"Recall: {recall:.4f}")
Confusion Matrix:
[[398 92]
[ 54 456]]

Accuracy: 0.8540
Precision: 0.8321
Recall: 0.8941
F1 Score: 0.8620

Solution 2: Take an example of Machine Learning Classification and generate all

the performance measures to show their significance and relationship

1 from sklearn.model_selection import train_test_split

2 from sklearn.svm import SVC
3 from sklearn.metrics import accuracy_score, precision_score, recall_score
4 from sklearn.datasets import load_iris
5
6 # Load the Iris dataset
7 iris = load_iris()
8 X = iris.data[:, :2] # Using only sepal length and width for simplicity
9 y = iris.target
10
10
11 # Split the dataset into training and testing sets
12 X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.2,
13
14 # Initialize the SVM classifier
15 svm_classifier = SVC(kernel='linear', C=1)
16
17 # Train the classifier
18 svm_classifier.fit(X_train, y_train)
19
20 # Make predictions on the test set
21 y_pred = svm_classifier.predict(X_test)
22
23 # Calculate performance measures
24 accuracy = accuracy_score(y_test, y_pred)
25 precision = precision_score(y_test, y_pred, average='weighted')
26 recall = recall_score(y_test, y_pred, average='weighted')
27 f1 = f1_score(y_test, y_pred, average='weighted')
28 conf_matrix = confusion_matrix(y_test, y_pred)
29
30 # Print the results
31 print(f"Accuracy: {accuracy:.4f}")
32 print(f"Precision: {precision:.4f}")
33 print(f"Recall: {recall:.4f}")
34 print(f"F1 Score: {f1:.4f}")
35 print(f"Confusion Matrix:\n{conf_matrix}")
36 print("\nClassification Report:")
37 print(classification_report(y_test, y_pred, target_names=iris.target_name
38

output Accuracy: 0.9000

Precision: 0.9014
Recall: 0.9000
F1 Score: 0.8992
Confusion Matrix:
[[10 0 0]
[ 0 7 2]
[ 0 1 10]]

Classification Report:
precision recall f1-score support

setosa 1.00 1.00 1.00 10

versicolor 0.88 0.78 0.82 9
virginica 0.83 0.91 0.87 11

accuracy 0.90 30
macro avg 0.90 0.90 0.90 30
weighted avg 0.90 0.90 0.90 30

New Stretch-Shortening Cycle Classification
No ratings yet
New Stretch-Shortening Cycle Classification
11 pages
Week6 - Colab
No ratings yet
Week6 - Colab
3 pages
Data Science Practical
No ratings yet
Data Science Practical
22 pages
Exp 14
No ratings yet
Exp 14
2 pages
Unit2 ML Programs
No ratings yet
Unit2 ML Programs
7 pages
ML Lab
No ratings yet
ML Lab
14 pages
Program - 3
No ratings yet
Program - 3
4 pages
Program
No ratings yet
Program
9 pages
PGM 3
No ratings yet
PGM 3
2 pages
ML Short
No ratings yet
ML Short
2 pages
Strangers
No ratings yet
Strangers
8 pages
Sanjey RS Lab
No ratings yet
Sanjey RS Lab
33 pages
Iris Dataset PCA Analysis Code
No ratings yet
Iris Dataset PCA Analysis Code
21 pages
Artificial Intelligence Advance Practical
No ratings yet
Artificial Intelligence Advance Practical
12 pages
1
No ratings yet
1
13 pages
To Study About Numpy, Pandas and Matplotlib Libraries in Python
No ratings yet
To Study About Numpy, Pandas and Matplotlib Libraries in Python
21 pages
Unit1 ML Programs
No ratings yet
Unit1 ML Programs
5 pages
Machine Learning Lab Manual
No ratings yet
Machine Learning Lab Manual
18 pages
PR
No ratings yet
PR
17 pages
DAI Amberish LAB ASSIGNMENT 3
No ratings yet
DAI Amberish LAB ASSIGNMENT 3
7 pages
Machine Learning Assignment
No ratings yet
Machine Learning Assignment
8 pages
Pca
No ratings yet
Pca
7 pages
Experiment 3 PCA On Iris Dataset
No ratings yet
Experiment 3 PCA On Iris Dataset
2 pages
Mnbnmnbnnmbbhhuyrgh
No ratings yet
Mnbnmnbnnmbbhhuyrgh
3 pages
MACHINE LEARNING Manual
No ratings yet
MACHINE LEARNING Manual
36 pages
Machine Learning Lab
No ratings yet
Machine Learning Lab
33 pages
Maxbox - Starter67 Machine Learning
No ratings yet
Maxbox - Starter67 Machine Learning
7 pages
Machine Learning Programs
No ratings yet
Machine Learning Programs
10 pages
MLLab Manual
No ratings yet
MLLab Manual
24 pages
Python ML Lab for Beginners
No ratings yet
Python ML Lab for Beginners
10 pages
ML Shristi File
No ratings yet
ML Shristi File
49 pages
Machine Learning Lab Manual
No ratings yet
Machine Learning Lab Manual
9 pages
ML Programs
No ratings yet
ML Programs
14 pages
ML Labmanual
No ratings yet
ML Labmanual
33 pages
ML 3
No ratings yet
ML 3
2 pages
ML Yogesh
No ratings yet
ML Yogesh
23 pages
Experiment 3 Code
No ratings yet
Experiment 3 Code
2 pages
Aiml Practical
No ratings yet
Aiml Practical
17 pages
16BCB0126 VL2018195002535 Pe003
No ratings yet
16BCB0126 VL2018195002535 Pe003
40 pages
ML Lab6
No ratings yet
ML Lab6
4 pages
Stats Lab (10-12)
No ratings yet
Stats Lab (10-12)
4 pages
ML Lab Manual
No ratings yet
ML Lab Manual
24 pages
Python ML Algorithms Guide
No ratings yet
Python ML Algorithms Guide
7 pages
Mlalllabprgs
No ratings yet
Mlalllabprgs
17 pages
ML Manual
No ratings yet
ML Manual
30 pages
B22EE010 Report
No ratings yet
B22EE010 Report
9 pages
Medical Data ML
No ratings yet
Medical Data ML
6 pages
ML
No ratings yet
ML
7 pages
Exercise and Experiment 3
No ratings yet
Exercise and Experiment 3
14 pages
Lab Extern L
No ratings yet
Lab Extern L
8 pages
Machine Learning Algorithms Guide
No ratings yet
Machine Learning Algorithms Guide
34 pages
Machine Learning Lab Manual
No ratings yet
Machine Learning Lab Manual
26 pages
SVM K NN MLP With Sklearn Jupyter NoteBo
No ratings yet
SVM K NN MLP With Sklearn Jupyter NoteBo
22 pages
Iris Dataset EDA & ML Techniques
100% (2)
Iris Dataset EDA & ML Techniques
24 pages
KNN vs SVM: A Python Implementation
No ratings yet
KNN vs SVM: A Python Implementation
6 pages
ML Lab Programs 2
No ratings yet
ML Lab Programs 2
16 pages
M PDF
No ratings yet
M PDF
13 pages
ML Keshav
No ratings yet
ML Keshav
23 pages
Prathamesh KRAI
No ratings yet
Prathamesh KRAI
38 pages
Chapter 2 PR2
No ratings yet
Chapter 2 PR2
10 pages
Data Mining Techniques & Benefits
No ratings yet
Data Mining Techniques & Benefits
28 pages
Business Statistics
No ratings yet
Business Statistics
109 pages
Introduction to Support Vector Machines
No ratings yet
Introduction to Support Vector Machines
36 pages
Optional Lab - Sigmoid Function and Logistic Regression - Coursera
No ratings yet
Optional Lab - Sigmoid Function and Logistic Regression - Coursera
2 pages
3 Dbscan
No ratings yet
3 Dbscan
7 pages
Business Statistics PDF
100% (2)
Business Statistics PDF
104 pages
ESO Data Dictionary
No ratings yet
ESO Data Dictionary
46 pages
Natural River Classification Guide
No ratings yet
Natural River Classification Guide
31 pages
What Is Machine Learning
No ratings yet
What Is Machine Learning
6 pages
Exam2021-2022 (Jan C)
No ratings yet
Exam2021-2022 (Jan C)
3 pages
Amharic Fake Account Detection in Social Network PDF
No ratings yet
Amharic Fake Account Detection in Social Network PDF
11 pages
Data Science Vijay1
No ratings yet
Data Science Vijay1
88 pages
K-Means and ISODATA Clustering Algorithms For Landcover Classification Using Remote Sensing
No ratings yet
K-Means and ISODATA Clustering Algorithms For Landcover Classification Using Remote Sensing
4 pages
A Tour of Machine Learning Algorithms
No ratings yet
A Tour of Machine Learning Algorithms
9 pages
Enset
No ratings yet
Enset
15 pages
ML Syllabus
No ratings yet
ML Syllabus
3 pages
Classification of Data Mining Systems
No ratings yet
Classification of Data Mining Systems
7 pages
DMDW Course Outcome
No ratings yet
DMDW Course Outcome
8 pages
DL Question Bank Answers
No ratings yet
DL Question Bank Answers
55 pages
Btech Cs 7 Sem Deep Learning
No ratings yet
Btech Cs 7 Sem Deep Learning
3 pages
DM & DW
No ratings yet
DM & DW
2 pages
1 s2.0 S2095311920633299 Main
No ratings yet
1 s2.0 S2095311920633299 Main
14 pages
Sim GCD
No ratings yet
Sim GCD
15 pages
Survey Paper On Credit Card Fraud Detection
No ratings yet
Survey Paper On Credit Card Fraud Detection
6 pages
Automating GDPR Checks for Android Apps
No ratings yet
Automating GDPR Checks for Android Apps
19 pages
Image Processing and Pattern Classification
100% (1)
Image Processing and Pattern Classification
3 pages
M.Tech Machine Learning QBank
100% (1)
M.Tech Machine Learning QBank
10 pages
Huawei Final Written Exam 2.2 Attempts
No ratings yet
Huawei Final Written Exam 2.2 Attempts
19 pages