0% found this document useful (0 votes)

70 views5 pages

CSE AIML Flood Prediction Guide

Uploaded by

Arijeet ros

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

70 views5 pages

CSE AIML Flood Prediction Guide

Uploaded by

Arijeet ros

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

GIET UNIVERSITY, GUNUPUR

SCHOOL OF ENGINEERING AND TECHNOLOGY

DEPARTMENT OF CSE (AIML)

Step 1: Import Python Libraries:-

import pandas as pd
import numpy as np
import matplotlib.pyplot as plt
import seaborn as sns
from sklearn import model_selection
from sklearn.linear_model import LogisticRegression

from sklearn.model_selection import train_test_split

Step 2: Read the Dataset:-

df= pd.read_csv('kerala.csv')
df.head(5)

Step 3: Explore the Dataset:-

1)df.info()

2)df.shape

3)df.describe()

4)df.corr()

Replace:- In order to train this Python model, we need the values of our target
output to be 0 & 1. So, we'll replace values in the Floods column (YES, NO)
with (1, 0) respectively
df['FLOODS'].replace(['YES', 'NO'], [1,0], inplace=True)
df.head(5)

null values:- To find the null values In the dataset

df.isnull().mean().sort_values(ascending=False) * 100

corr:- To identifying the correlation between the data points using heat map

NAME OF THE STUDENT: Arijeet Mishra ROLL NO – 21CSEAIML008

PAGE NO: 01
GIET UNIVERSITY, GUNUPUR
SCHOOL OF ENGINEERING AND TECHNOLOGY
DEPARTMENT OF CSE (AIML)

corr df.corr()

sns.heatmap(corr, xticklabels corr.columns, yticklabels

corr.columns)

Step 3: Feature Selection:-

Start by importing the Select Best library:

from sklearn.feature_selection import SelectKBest

from sklearn.feature_selection import chi2

After, define X & Y:-

X= df.iloc[:,1:14] //for all features

Y= df.iloc[:,-1] //for target output (floods)

Select the top 3 features:-

best_features= SelectKBest(score_func=chi2, k=3)

fit= best_features.fit(X,Y)

Now we create data frames for the features and the score of each
feature:

df_scores= pd.DataFrame(fit.scores_)
df_columns= pd.DataFrame(X.columns)

Finally, we’ll combine all the features and their corresponding scores in
one data frame:

features_scores= pd.concat([df_columns, df_scores], axis=1)

features_scores.columns= ['Features', 'Score']
features_scores.sort_values(by = 'Score')

Step 4: Build the Model:-

X= df[['SEP', 'JUN', 'JUL']] the top 3 features
Y= df[['FLOODS']] the target output

Splitting the dataset into train and test:-

X_train,X_test,y_train,y_test=train_test_split(X,Y,test_siz
e=0.4,random_state=100)

NAME OF THE STUDENT: Arijeet Mishra ROLL NO – 21CSEAIML008

PAGE NO: 02
GIET UNIVERSITY, GUNUPUR
SCHOOL OF ENGINEERING AND TECHNOLOGY
DEPARTMENT OF CSE (AIML)

Create a logistic regression body:-

logreg= LogisticRegression()
logreg.fit(X_train,y_train)

we predict the likelihood of a flood using the logistic regression body we

created:-
y_pred=logreg.predict(X_test)
print (X_test) #test dataset
print (y_pred) #predicted values

Step 5: Evaluate the Model’s Performance:-

• 5.1:- Mean Absolute Error(MAE):- MAE is a straightforward metric that calculates

the absolute difference between actual and predicted values. The degree of errors for
predictions and observations is measured using the average absolute errors for the
entire group.

from sklearn.metrics import mean absolute_error

print("MAE", mean_absolute_error(y_test,y_pred)

• 5.2:- Mean Squared Error(MSE)

MSE is a popular and straightforward statistic with a bit of variation in mean

absolute error. The squared difference between the actual and anticipated values
is calculated using mean squared error.

from sklearn.metrics import mean_squared_error

print("MSE", mean_squared_error(y_test,y_pred)

• 5.3:-Root Mean Squared Error(RMSE)

As the term, RMSE implies that it is a straightforward square root of mean

squared error.

NAME OF THE STUDENT: Arijeet Mishra ROLL NO – 21cseaiml008

PAGE NO: 03
GIET UNIVERSITY, GUNUPUR
SCHOOL OF ENGINEERING AND TECHNOLOGY
DEPARTMENT OF CSE (AIML)

• R Squared (R2)

The R2 score, also called the coefficient of determination, is one of the

performance evaluation measures for the regression-based machine learning
model. Simply put, it measures how close the target data points are to the fitted
line. As we have shown, MAE and MSE are context-dependent, but the R2 score
is context neutral. So, with the help of R squared, we have a baseline model to
compare to a model that none of the other metrics give

from sklearn.metrics import r2_score

r2 = r2_score(y_test, y_pred)

print(r2)

Classification Report:-
A classification report is a performance evaluation report that is used
to evaluate the performance of machine learning models by the
following 5 criteria:

• Accuracy is a score used to evaluate the model’s performance. The

higher it is, the better.
• Recall measures the model’s ability to correctly predict the true
positive values.
• Precision is the ratio of true positives to the sum of both true and
false positives.
• F-score combines precision and recall into one metric. Ideally, its
value should be closest to 1, the better.
• Support is the number of actual occurrences of each class in the
dataset.

NAME OF THE STUDENT: Arijeet Mishra Roll no – 21cseaiml008

PAGE NO: 04
GIET UNIVERSITY, GUNUPUR
SCHOOL OF ENGINEERING AND TECHNOLOGY
DEPARTMENT OF CSE (AIML)

from sklearn import metrics

from sklearn.metrics import classification_report
print(‘Accuracy: ‘,metrics.accuracy_score(y_test, y_pred))
print(‘Recall: ‘,metrics.recall_score(y_test, y_pred,
zero_division=1))
print(“Precision:”,metrics.precision_score(y_test, y_pred,
zero_division=1))
print(“CL Report:”,metrics.classification_report(y_test,
y_pred, zero_division=1))

ROC Curve:-

The receiver operating characteristic (ROC) curve is used to display the

sensitivity and specificity of the logistic regression model by calculating the true
positive and false positive rates.

From the ROC curve, we can calculate the area under the curve (AUC) whose
value ranges from 0 to 1. You’ll remember that the closer to 1, the better it is for
our predictive modeling.

• To determine the ROC curve, first define the metrics:-

y_pred_proba= logreg.predict_proba(X_test) [::,1]

• Then, calculate the true positive and false positive rates:-

false_positive_rate, true_positive_rate, _ =
metrics.roc_curve(y_test, y_pred_proba)

• Next, calculate the AUC to see the model's performance:-

auc= metrics.roc_auc_score(y_test, y_pred_proba)

• Finally, plot the ROC curve:-

plt.plot(false_positive_rate,
true_positive_rate,label="AUC="+str(auc))
plt.title('ROC Curve')
plt.ylabel('True Positive Rate')
plt.xlabel('false Positive Rate')
plt.legend(loc=4)

NAME OF THE STUDENT: Arijeet Mishra roll no – 21AIML008

PAGE NO: 05

Chapter 10 Logistic Reg - Week 07 - 01
No ratings yet
Chapter 10 Logistic Reg - Week 07 - 01
31 pages
Admission Prediction Guide
No ratings yet
Admission Prediction Guide
13 pages
Parth ML
No ratings yet
Parth ML
24 pages
Da - Week 9
No ratings yet
Da - Week 9
20 pages
(Feature Engineering) (Extended-Cheatsheet)
100% (1)
(Feature Engineering) (Extended-Cheatsheet)
9 pages
DR T V V Pavan Kumar - Assign - 2
No ratings yet
DR T V V Pavan Kumar - Assign - 2
5 pages
Kartik MLP 4-9prg
No ratings yet
Kartik MLP 4-9prg
10 pages
Regression Analysis - Cheatsheet
No ratings yet
Regression Analysis - Cheatsheet
9 pages
Unit 2 Supervised Learning
No ratings yet
Unit 2 Supervised Learning
20 pages
MLA Manual
No ratings yet
MLA Manual
25 pages
Cheat Sheet Linear and Logistic Regression
No ratings yet
Cheat Sheet Linear and Logistic Regression
2 pages
23BCE7199 ML Lab Assignment
No ratings yet
23BCE7199 ML Lab Assignment
15 pages
ML Combined
No ratings yet
ML Combined
254 pages
Supervised Learning
100% (1)
Supervised Learning
15 pages
Machine Learning Assignment-2
No ratings yet
Machine Learning Assignment-2
7 pages
23BCE7092 ML Lab Assignment
No ratings yet
23BCE7092 ML Lab Assignment
14 pages
22K61A0654 2 Sasi Auto
No ratings yet
22K61A0654 2 Sasi Auto
24 pages
Assignment 1:: Intro To Machine Learning
No ratings yet
Assignment 1:: Intro To Machine Learning
6 pages
Machine Learning Strategies
No ratings yet
Machine Learning Strategies
59 pages
Data Analytics Program
No ratings yet
Data Analytics Program
11 pages
Michal Kosinski - Private Traits and Attributes Are Predictable From Digital Records of Human Behavior PDF
No ratings yet
Michal Kosinski - Private Traits and Attributes Are Predictable From Digital Records of Human Behavior PDF
4 pages
TP - Ipynb - Colab
No ratings yet
TP - Ipynb - Colab
6 pages
Ritesh Mangla ML PracticalFile
No ratings yet
Ritesh Mangla ML PracticalFile
55 pages
ML Lab
No ratings yet
ML Lab
29 pages
Lecture Material 11
No ratings yet
Lecture Material 11
14 pages
B-56 Sanket Jambhulkar MLA-3
No ratings yet
B-56 Sanket Jambhulkar MLA-3
7 pages
Machine Learning
No ratings yet
Machine Learning
30 pages
Sla4a 21im30005
No ratings yet
Sla4a 21im30005
11 pages
Aychew Chernet
No ratings yet
Aychew Chernet
8 pages
Weather Data ML Model Guide
No ratings yet
Weather Data ML Model Guide
4 pages
Data-Analytics-Manual Lab G.anill Kumar
No ratings yet
Data-Analytics-Manual Lab G.anill Kumar
23 pages
Da 012307
No ratings yet
Da 012307
8 pages
MC4301 - ML Unit 2 (Model Evaluation and Feature Engineering)
No ratings yet
MC4301 - ML Unit 2 (Model Evaluation and Feature Engineering)
40 pages
Train
No ratings yet
Train
17 pages
Machine Learning Lab Manual 06
100% (1)
Machine Learning Lab Manual 06
8 pages
Data Mining with Python Lab Guide
No ratings yet
Data Mining with Python Lab Guide
39 pages
St. John College of Engineering and Management, Palghar - Maharashtra
No ratings yet
St. John College of Engineering and Management, Palghar - Maharashtra
11 pages
Introduction to Machine Learning
No ratings yet
Introduction to Machine Learning
122 pages
Rain in Australia Logistic Regression Classifier
No ratings yet
Rain in Australia Logistic Regression Classifier
10 pages
21CSC305P ML - Lab Programs 1 - 9
No ratings yet
21CSC305P ML - Lab Programs 1 - 9
36 pages
DA Programs
No ratings yet
DA Programs
44 pages
CH 05 PPTaccessible
No ratings yet
CH 05 PPTaccessible
60 pages
Binary Classifier Evaluation Guide
No ratings yet
Binary Classifier Evaluation Guide
12 pages
ML in Python Part-2
No ratings yet
ML in Python Part-2
21 pages
DataAnalytics Lab Manual
No ratings yet
DataAnalytics Lab Manual
35 pages
Machine Learning Basics 1683717543
No ratings yet
Machine Learning Basics 1683717543
15 pages
Project Paarth
No ratings yet
Project Paarth
21 pages
DSBDA Practicals
No ratings yet
DSBDA Practicals
16 pages
ML Manual Final
No ratings yet
ML Manual Final
35 pages
Import Pandas As PD DF PD - Read - CSV ("Titanic - Train - CSV") DF - Head
No ratings yet
Import Pandas As PD DF PD - Read - CSV ("Titanic - Train - CSV") DF - Head
20 pages
TYCS Practical
No ratings yet
TYCS Practical
26 pages
AIML Project
No ratings yet
AIML Project
4 pages
Machine Learning Lab Assignment 1
No ratings yet
Machine Learning Lab Assignment 1
23 pages
ML Lab Manual
No ratings yet
ML Lab Manual
36 pages
FYMCA IDSLab A6 Submission
No ratings yet
FYMCA IDSLab A6 Submission
9 pages
Data Analysis for Beginners
No ratings yet
Data Analysis for Beginners
8 pages
The Biometric Computing Recognition and Registration 1st Edition Karm Veer Arya (Editor) Instant Download
100% (3)
The Biometric Computing Recognition and Registration 1st Edition Karm Veer Arya (Editor) Instant Download
66 pages
Rainfall Prediction Using Machine Learning
No ratings yet
Rainfall Prediction Using Machine Learning
9 pages
Good Practices in Visual Inspection - Drury
No ratings yet
Good Practices in Visual Inspection - Drury
85 pages
Logistic Regression Tutorial Python
No ratings yet
Logistic Regression Tutorial Python
30 pages
Unsupervised Anomaly-Based Malware Detection Using Hardware Features
No ratings yet
Unsupervised Anomaly-Based Malware Detection Using Hardware Features
12 pages
Fundamentals of Machine Learning With QA
No ratings yet
Fundamentals of Machine Learning With QA
41 pages
Capstone Project - Credit Risk Analysis
67% (6)
Capstone Project - Credit Risk Analysis
50 pages
Detection and Prediction of Rice Leaf Disease Using A Hybrid CNN-SVM Model
No ratings yet
Detection and Prediction of Rice Leaf Disease Using A Hybrid CNN-SVM Model
19 pages
Injury Severity Scores
No ratings yet
Injury Severity Scores
32 pages
Pandas: Reference Sheet
No ratings yet
Pandas: Reference Sheet
9 pages
FakeAVCeleb A Novel Audio-Video Multimodal DeepFake Dataset
No ratings yet
FakeAVCeleb A Novel Audio-Video Multimodal DeepFake Dataset
22 pages
Hybrid Deep Learning for DDoS in SDN
No ratings yet
Hybrid Deep Learning for DDoS in SDN
17 pages
Dyslexia and WISC-III
No ratings yet
Dyslexia and WISC-III
34 pages
Zerox Ready
No ratings yet
Zerox Ready
21 pages
Heart Rate Analysis (R)
No ratings yet
Heart Rate Analysis (R)
72 pages
Journal of Clinical and Experimental Neuropsychology: Merve Çebi, Gülsen Babacan, Öget Öktem Tanör & Hakan Gürvit
No ratings yet
Journal of Clinical and Experimental Neuropsychology: Merve Çebi, Gülsen Babacan, Öget Öktem Tanör & Hakan Gürvit
10 pages
Final Assignment - Research Proposal by Roquia Salam
No ratings yet
Final Assignment - Research Proposal by Roquia Salam
8 pages
A Deep Learning Ensemble Approach For Diabetic Ret
No ratings yet
A Deep Learning Ensemble Approach For Diabetic Ret
10 pages
Name:Fedrick Samuel W Reg No: 19MIS1112 Course: Machine Learning (SWE4012) Slot: L11 + L12 Faculty: Dr.M. Premalatha
No ratings yet
Name:Fedrick Samuel W Reg No: 19MIS1112 Course: Machine Learning (SWE4012) Slot: L11 + L12 Faculty: Dr.M. Premalatha
30 pages
6-Human-Related Anomaly Detection in Surveillance Videos
No ratings yet
6-Human-Related Anomaly Detection in Surveillance Videos
10 pages
Enhancing Brain Tumor Detection in MRI Images Through Explainable AI Using Grad-CAM With Resnet 50
No ratings yet
Enhancing Brain Tumor Detection in MRI Images Through Explainable AI Using Grad-CAM With Resnet 50
19 pages
VGG 16
No ratings yet
VGG 16
18 pages
Assessing The Prognostic Performance of The Child-Pugh, Model For End-Stage Liver Disease and ALBI Score in Npatients With DeCi
No ratings yet
Assessing The Prognostic Performance of The Child-Pugh, Model For End-Stage Liver Disease and ALBI Score in Npatients With DeCi
9 pages
Enhancing Pseudarthrosis Diagnosis Dynamic Radiographs After Cervical Fusion With Stand Alone Intervertebral Cage
No ratings yet
Enhancing Pseudarthrosis Diagnosis Dynamic Radiographs After Cervical Fusion With Stand Alone Intervertebral Cage
12 pages
Detecting Bitcoin Ponzi Schemes
No ratings yet
Detecting Bitcoin Ponzi Schemes
10 pages
Title Auhtor Journal Publisher Problem Argument Method Findings & Interpretation Year
No ratings yet
Title Auhtor Journal Publisher Problem Argument Method Findings & Interpretation Year
2 pages
Urkund Report - 15 - Priyanshu - Roul - SIRP - PDF (D76571759)
No ratings yet
Urkund Report - 15 - Priyanshu - Roul - SIRP - PDF (D76571759)
11 pages
Is The New Injury Severity Score (NISS) A
No ratings yet
Is The New Injury Severity Score (NISS) A
7 pages
Anomalous Motion Detection On Highway Using Deep Learning: Harpreet Singh Emily M. Hand Kostas Alexis
No ratings yet
Anomalous Motion Detection On Highway Using Deep Learning: Harpreet Singh Emily M. Hand Kostas Alexis
5 pages
Development and Validation of Two Artificial Intelligence Models For Diagnosing Benign, Pigmented Facial Skin Lesions
No ratings yet
Development and Validation of Two Artificial Intelligence Models For Diagnosing Benign, Pigmented Facial Skin Lesions
6 pages

CSE AIML Flood Prediction Guide

Uploaded by

CSE AIML Flood Prediction Guide

Uploaded by

GIET UNIVERSITY, GUNUPUR

SCHOOL OF ENGINEERING AND TECHNOLOGY

Step 1: Import Python Libraries:-

from sklearn.model_selection import train_test_split

Step 2: Read the Dataset:-

Step 3: Explore the Dataset:-

null values:- To find the null values In the dataset

NAME OF THE STUDENT: Arijeet Mishra ROLL NO – 21CSEAIML008

sns.heatmap(corr, xticklabels corr.columns, yticklabels

Step 3: Feature Selection:-

Start by importing the Select Best library:

from sklearn.feature_selection import SelectKBest

After, define X & Y:-

X= df.iloc[:,1:14] //for all features

Select the top 3 features:-

best_features= SelectKBest(score_func=chi2, k=3)

features_scores= pd.concat([df_columns, df_scores], axis=1)

Step 4: Build the Model:-

Splitting the dataset into train and test:-

NAME OF THE STUDENT: Arijeet Mishra ROLL NO – 21CSEAIML008

Create a logistic regression body:-

we predict the likelihood of a flood using the logistic regression body we

Step 5: Evaluate the Model’s Performance:-

• 5.1:- Mean Absolute Error(MAE):- MAE is a straightforward metric that calculates

from sklearn.metrics import mean absolute_error

• 5.2:- Mean Squared Error(MSE)

MSE is a popular and straightforward statistic with a bit of variation in mean

from sklearn.metrics import mean_squared_error

• 5.3:-Root Mean Squared Error(RMSE)

As the term, RMSE implies that it is a straightforward square root of mean

NAME OF THE STUDENT: Arijeet Mishra ROLL NO – 21cseaiml008

The R2 score, also called the coefficient of determination, is one of the

from sklearn.metrics import r2_score

• Accuracy is a score used to evaluate the model’s performance. The

NAME OF THE STUDENT: Arijeet Mishra Roll no – 21cseaiml008

from sklearn import metrics

The receiver operating characteristic (ROC) curve is used to display the

• To determine the ROC curve, first define the metrics:-

• Then, calculate the true positive and false positive rates:-

• Next, calculate the AUC to see the model's performance:-

auc= metrics.roc_auc_score(y_test, y_pred_proba)

• Finally, plot the ROC curve:-

NAME OF THE STUDENT: Arijeet Mishra roll no – 21AIML008

You might also like