0% found this document useful (0 votes)

20 views2 pages

ML Assignment 9

The document outlines a process for applying Principal Component Analysis (PCA) to the heart_disease dataset for binary classification using logistic regression. It includes steps for data loading, preprocessing, PCA transformation, and model training and evaluation, achieving an accuracy of approximately 85.25%. The code snippets provided guide the user through each stage of the analysis.

Uploaded by

anuj rawat

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

20 views2 pages

ML Assignment 9

Uploaded by

anuj rawat

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 2

9w3itlede

January 3, 2025

0.1 Apply PCA on heart_disease.csv for implementing binary classification.

Please refer to the meta data of heart_disease data before implementation.
[1]: import pandas as pd
import numpy as np
from sklearn.model_selection import train_test_split
from sklearn.preprocessing import StandardScaler
from sklearn.decomposition import PCA
from sklearn.linear_model import LogisticRegression
from sklearn.metrics import accuracy_score

# Load the dataset

url = "https://itv-contentbucket.s3.ap-south-1.amazonaws.com/Exams/ML/PCA/
↪heart_disease.csv"

data = pd.read_csv(url)

# Display the first few rows

print(data.head())

age sex cp trestbps chol fbs restecg thalach exang oldpeak slope \
0 63 1 3 145 233 1 0 150 0 2.3 0
1 37 1 2 130 250 0 1 187 0 3.5 0
2 41 0 1 130 204 0 0 172 0 1.4 2
3 56 1 1 120 236 0 1 178 0 0.8 2
4 57 0 0 120 354 0 1 163 1 0.6 2

ca thal target
0 0 1 1
1 0 2 1
2 0 2 1
3 0 2 1
4 0 2 1

[2]: # Check for missing values

print(data.isnull().sum())

# Drop or fill missing values as required

data = data.dropna() # Example of dropping missing values

1
age 0
sex 0
cp 0
trestbps 0
chol 0
fbs 0
restecg 0
thalach 0
exang 0
oldpeak 0
slope 0
ca 0
thal 0
target 0
dtype: int64

[3]: # Example assuming 'target' is the target column based on typical naming
X = data.drop('target', axis=1) # Replace 'target' with the actual target␣
↪column name

y = data['target'] # Replace 'target' with the actual target column name

[4]: X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.2,␣

↪random_state=42)

[5]: scaler = StandardScaler()

X_train = scaler.fit_transform(X_train)
X_test = scaler.transform(X_test)

[6]: # Choose the number of principal components to keep (e.g., 2 components)

pca = PCA(n_components=2)
X_train_pca = pca.fit_transform(X_train)
X_test_pca = pca.transform(X_test)

print(f'Explained Variance Ratio: {pca.explained_variance_ratio_}')

Explained Variance Ratio: [0.2072575 0.12434085]

[7]: model = LogisticRegression()

model.fit(X_train_pca, y_train)

[7]: LogisticRegression()

[8]: y_pred = model.predict(X_test_pca)

accuracy = accuracy_score(y_test, y_pred)
print(f'Accuracy: {accuracy}')

Accuracy: 0.8524590163934426

Application of Statistics in The Business Research PDF
87% (23)
Application of Statistics in The Business Research PDF
5 pages
Week - 6 - SWI - MLP - LogisticRegression - Ipynb - Colaboratory
No ratings yet
Week - 6 - SWI - MLP - LogisticRegression - Ipynb - Colaboratory
15 pages
COMP5318
No ratings yet
COMP5318
42 pages
AI Mini Project
No ratings yet
AI Mini Project
6 pages
AIML Practical 05 22105A2021
No ratings yet
AIML Practical 05 22105A2021
9 pages
Heart Attack Prediction
No ratings yet
Heart Attack Prediction
6 pages
Heart Disease Prediction - Jupyter Notebook
100% (1)
Heart Disease Prediction - Jupyter Notebook
9 pages
Ide To 6 Classification Algorithms
No ratings yet
Ide To 6 Classification Algorithms
34 pages
Heart - Disease - 1.ipynb - Colaboratory
No ratings yet
Heart - Disease - 1.ipynb - Colaboratory
9 pages
Heart Disease Classification ML Assignment - Jupyter Notebook
No ratings yet
Heart Disease Classification ML Assignment - Jupyter Notebook
7 pages
Bayesian Network Notes
No ratings yet
Bayesian Network Notes
4 pages
Lab Report Content - 15marks
No ratings yet
Lab Report Content - 15marks
10 pages
ML LAB - Principal Component Analysis
No ratings yet
ML LAB - Principal Component Analysis
3 pages
ML Practicals
No ratings yet
ML Practicals
21 pages
Ex 12
No ratings yet
Ex 12
4 pages
Ai in HC - 2
No ratings yet
Ai in HC - 2
9 pages
Heart Disease Prediction Using ML
No ratings yet
Heart Disease Prediction Using ML
16 pages
Ex No 4
No ratings yet
Ex No 4
3 pages
HEART
No ratings yet
HEART
15 pages
Anoosha ML Lab02
No ratings yet
Anoosha ML Lab02
5 pages
Prediction - Ipynb - Colab
No ratings yet
Prediction - Ipynb - Colab
7 pages
Samplecode (HDPS)
No ratings yet
Samplecode (HDPS)
29 pages
MLT Lab 07
No ratings yet
MLT Lab 07
4 pages
Heart Disease Report With Comments and Code
No ratings yet
Heart Disease Report With Comments and Code
9 pages
Heart Disease Prediction Guide
100% (1)
Heart Disease Prediction Guide
73 pages
Lab Manual - MachineLearningLaboratory-DR - Vaishnavi
No ratings yet
Lab Manual - MachineLearningLaboratory-DR - Vaishnavi
71 pages
Heart Disease Report
No ratings yet
Heart Disease Report
8 pages
Ex 8
No ratings yet
Ex 8
2 pages
Assignment 1
No ratings yet
Assignment 1
10 pages
Heart - Cleveland - Ipynb - Colab
No ratings yet
Heart - Cleveland - Ipynb - Colab
5 pages
C2 W4 Lab 02 Tree Ensemble
No ratings yet
C2 W4 Lab 02 Tree Ensemble
16 pages
Heart Disease Prediction Analysis
No ratings yet
Heart Disease Prediction Analysis
10 pages
A.I Lab Report
No ratings yet
A.I Lab Report
24 pages
Heart Attack
No ratings yet
Heart Attack
18 pages
Exp Number 13 LM
No ratings yet
Exp Number 13 LM
1 page
Implementing PCA in Python With Scikit
No ratings yet
Implementing PCA in Python With Scikit
6 pages
IR Final LabManual
No ratings yet
IR Final LabManual
18 pages
Heart Failure Prediction EDA & Modeling
No ratings yet
Heart Failure Prediction EDA & Modeling
38 pages
C2 W4 Lab 02 Tree Ensemble
No ratings yet
C2 W4 Lab 02 Tree Ensemble
10 pages
Bayesian Networks for Python Users
No ratings yet
Bayesian Networks for Python Users
2 pages
Python Cod1
No ratings yet
Python Cod1
3 pages
Data Set Preperation
No ratings yet
Data Set Preperation
7 pages
ML Lab Program - VTU
No ratings yet
ML Lab Program - VTU
5 pages
Lab 2
No ratings yet
Lab 2
8 pages
AI 28-01-2025 - Classification
No ratings yet
AI 28-01-2025 - Classification
4 pages
Program 9-Bayesian Network Inference
No ratings yet
Program 9-Bayesian Network Inference
1 page
Lab Task - 13 - 224g1a0528
No ratings yet
Lab Task - 13 - 224g1a0528
3 pages
Medical Bayesian Network Analysis
No ratings yet
Medical Bayesian Network Analysis
8 pages
Naive Bayes Ve SVM Alqoritimleri
No ratings yet
Naive Bayes Ve SVM Alqoritimleri
2 pages
4-10 Aiml
No ratings yet
4-10 Aiml
25 pages
Heart Disease Classification Using Ann Hands-On
No ratings yet
Heart Disease Classification Using Ann Hands-On
7 pages
Heart Disease Classification Project
No ratings yet
Heart Disease Classification Project
3 pages
IR 3 Bayesian Network Heart
No ratings yet
IR 3 Bayesian Network Heart
2 pages
Aih Exp 2
No ratings yet
Aih Exp 2
8 pages
Program 7
100% (1)
Program 7
4 pages
Inbound 3085046103164618170
No ratings yet
Inbound 3085046103164618170
2 pages
Machine Learning
No ratings yet
Machine Learning
30 pages
Web Application
No ratings yet
Web Application
13 pages
ML Exp 4,5
No ratings yet
ML Exp 4,5
7 pages
Discrete Random
No ratings yet
Discrete Random
57 pages
Jaipur Jewellery Consumer Insights
100% (1)
Jaipur Jewellery Consumer Insights
27 pages
Chapter 4 Market Analysis
No ratings yet
Chapter 4 Market Analysis
41 pages
100 MCQs For Research Methodology
No ratings yet
100 MCQs For Research Methodology
10 pages
Digital Escape Rooms Boost Student Motivation
No ratings yet
Digital Escape Rooms Boost Student Motivation
14 pages
The Welfare State As Piggy Bank Information Risk Uncertainty and The Role of The State 1st Edition Barr Download
100% (3)
The Welfare State As Piggy Bank Information Risk Uncertainty and The Role of The State 1st Edition Barr Download
127 pages
Final Exams Schedule F2024
No ratings yet
Final Exams Schedule F2024
3 pages
Recovery Road Christine Feehan Instant Download
No ratings yet
Recovery Road Christine Feehan Instant Download
150 pages
Smith's Patient Centered Interviewing: An Evidence-Based Method 4th Edition Edition Auguste H. Fortin PDF Download
100% (1)
Smith's Patient Centered Interviewing: An Evidence-Based Method 4th Edition Edition Auguste H. Fortin PDF Download
104 pages
Sequential Analysis Hypothesis Testing and Changepoint Detection (Etc.) (Z-Library)
No ratings yet
Sequential Analysis Hypothesis Testing and Changepoint Detection (Etc.) (Z-Library)
600 pages
NCR Final Shs Pr2 q1 m1 1
No ratings yet
NCR Final Shs Pr2 q1 m1 1
30 pages
Bank Employee Turnover Factors
100% (1)
Bank Employee Turnover Factors
5 pages
The Ruble: A Political History Ekaterina Pravilova Instant Download
No ratings yet
The Ruble: A Political History Ekaterina Pravilova Instant Download
152 pages
Activity 2 - Sampling and Sources of Data
No ratings yet
Activity 2 - Sampling and Sources of Data
1 page
Prelim Exam 2nd Sem For Students
No ratings yet
Prelim Exam 2nd Sem For Students
4 pages
CHAPTER 1 Cute 5
No ratings yet
CHAPTER 1 Cute 5
19 pages
Statistics & Probability Assignment
No ratings yet
Statistics & Probability Assignment
5 pages
A Pragmatic Analysis of Illocutionary Act in Home Alone 3 Movie
No ratings yet
A Pragmatic Analysis of Illocutionary Act in Home Alone 3 Movie
9 pages
Literature Review of Tuition Impact On Learning of Students
50% (4)
Literature Review of Tuition Impact On Learning of Students
33 pages
Chapter 4 (Hypothesis Testing)
No ratings yet
Chapter 4 (Hypothesis Testing)
20 pages
Desi Wahyuni
No ratings yet
Desi Wahyuni
3 pages
Multilingualism's Impact on EFL Success
No ratings yet
Multilingualism's Impact on EFL Success
21 pages
SC - MATH 3132 Course Outline
No ratings yet
SC - MATH 3132 Course Outline
3 pages
Sample Global Smart Luggage System Market Research Report 2024-2031
No ratings yet
Sample Global Smart Luggage System Market Research Report 2024-2031
51 pages
Multinomial Likelihood & PCA Analysis
No ratings yet
Multinomial Likelihood & PCA Analysis
2 pages
Martín Albo, J., Núñez, J., Navarro, J., & Grijalvo, F. (2007)
No ratings yet
Martín Albo, J., Núñez, J., Navarro, J., & Grijalvo, F. (2007)
11 pages
Audit Sampling Essentials
No ratings yet
Audit Sampling Essentials
3 pages
Chapter13 Slides
No ratings yet
Chapter13 Slides
24 pages
Polymer Color Optimization Study
No ratings yet
Polymer Color Optimization Study
7 pages

ML Assignment 9

Uploaded by

ML Assignment 9

Uploaded by

9w3itlede

0.1 Apply PCA on heart_disease.csv for implementing binary classification.

# Load the dataset

# Display the first few rows

[2]: # Check for missing values

# Drop or fill missing values as required

y = data['target'] # Replace 'target' with the actual target column name

[4]: X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.2,␣

[5]: scaler = StandardScaler()

[6]: # Choose the number of principal components to keep (e.g., 2 components)

print(f'Explained Variance Ratio: {pca.explained_variance_ratio_}')

Explained Variance Ratio: [0.2072575 0.12434085]

[7]: model = LogisticRegression()

[8]: y_pred = model.predict(X_test_pca)

You might also like