Code

The document details a lab task exploring the Iris dataset, including data loading, statistical analysis, and visualization. It implements Logistic Regression and Random Forest classifiers to predict species, reporting their accuracy and cross-validation results. Key findings include the best separating feature and the species with the largest average petal length.

Uploaded by

wetechhub1

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

12 views3 pages

Code

Uploaded by

wetechhub1

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 3

import numpy as np

import pandas as pd
import matplotlib.pyplot as plt
from sklearn import datasets
from sklearn.model_selection import train_test_split, cross_val_score
from sklearn.linear_model import LogisticRegression
from sklearn.metrics import accuracy_score
from sklearn.ensemble import RandomForestClassifier
from sklearn.feature_selection import f_classif

student_name = "Maitha Al Shamsi"

std_id = "202200129"
deadline = "11/Sept/25"

print("Lab Task_01: Exploring the Iris Dataset")

print(f"Student Name: {student_name}")
print(f"STD ID: {std_id}")
print(f"Deadline: {deadline}")
print("-" * 60)

iris = datasets.load_iris()
df = pd.DataFrame(data=iris.data, columns=iris.feature_names)
df['species'] = pd.Categorical.from_codes(iris.target, iris.target_names)

print("First 10 rows of the Iris dataset:")

display(df.head(10))

features = iris.feature_names
means = df[features].mean()
medians = df[features].median()
modes = df[features].mode().iloc[0]
stats_df = pd.DataFrame({'mean': means, 'median': medians, 'mode': modes})
display(stats_df)

petal_length_col = 'petal length (cm)'

petal_width_col = 'petal width (cm)'
print("\nPetal length/width min and max:")
print("Petal length min:", df[petal_length_col].min())
print("Petal length max:", df[petal_length_col].max())
print("Petal width min:", df[petal_width_col].min())
print("Petal width max:", df[petal_width_col].max())

for feature in features:

plt.figure(figsize=(6,4))
plt.hist(df[feature], bins=10)
plt.title(f'Histogram of {feature}')
plt.xlabel(feature)
plt.ylabel('Frequency')
plt.show()

plt.figure(figsize=(6,5))
species_codes = df['species'].cat.codes
plt.scatter(df[petal_length_col], df[petal_width_col], c=species_codes)
plt.title('Petal length vs Petal width (colored by species)')
plt.xlabel(petal_length_col)
plt.ylabel(petal_width_col)
for i, name in enumerate(iris.target_names):
plt.scatter([], [], label=name)
plt.legend()
plt.show()

plt.figure(figsize=(6,5))
grouped = [group['sepal length (cm)'].values for name, group in df.groupby('species')]
plt.boxplot(grouped, labels=df['species'].cat.categories)
plt.title('Sepal length distribution across species')
plt.xlabel('Species')
plt.ylabel('Sepal length (cm)')
plt.show()

F, p = f_classif(df[features], df['species'].cat.codes)
separability = pd.DataFrame({'feature': features, 'F_value': F, 'p_value':
p}).sort_values(by='F_value', ascending=False)
display(separability)
print("Best separating feature:", separability.iloc[0]['feature'])

mean_petal_by_species = df.groupby('species')[petal_length_col].mean()
display(mean_petal_by_species)
print("Species with largest average petal length:", mean_petal_by_species.idxmax())

X = df[features].values
y = df['species'].cat.codes.values
X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.3, random_state=42,
stratify=y)

lr = LogisticRegression(max_iter=200)
lr.fit(X_train, y_train)
y_pred_lr = lr.predict(X_test)
print("\nLogistic Regression accuracy:", accuracy_score(y_test, y_pred_lr))
rf = RandomForestClassifier(n_estimators=100, random_state=42)
rf.fit(X_train, y_train)
y_pred_rf = rf.predict(X_test)
print("Random Forest accuracy:", accuracy_score(y_test, y_pred_rf))

cv_scores = cross_val_score(LogisticRegression(max_iter=200), X, y, cv=5)

print("5-fold CV (Logistic Regression) mean accuracy:", cv_scores.mean())

Fds Slips
No ratings yet
Fds Slips
6 pages
1 Assignment 3 - Classification
No ratings yet
1 Assignment 3 - Classification
16 pages
19mid0034 (Chandru) - ML Lab Fat - Jupyter Notebook
No ratings yet
19mid0034 (Chandru) - ML Lab Fat - Jupyter Notebook
4 pages
ML Lab Programs
No ratings yet
ML Lab Programs
23 pages
AI & ML Lab Journal for MCA Students
No ratings yet
AI & ML Lab Journal for MCA Students
77 pages
Data Visualization and Matplot
No ratings yet
Data Visualization and Matplot
11 pages
Iris - Ipynb - Colaboratory
No ratings yet
Iris - Ipynb - Colaboratory
8 pages
EXP 07 (ML) - Sarthak
No ratings yet
EXP 07 (ML) - Sarthak
4 pages
EXP 07 (ML) - Darshu
No ratings yet
EXP 07 (ML) - Darshu
4 pages
EXP 07 (ML) - Ashu
No ratings yet
EXP 07 (ML) - Ashu
4 pages
Exp 07 (ML)
No ratings yet
Exp 07 (ML)
4 pages
Assignment No - 10
No ratings yet
Assignment No - 10
3 pages
Part A Assignment 10
No ratings yet
Part A Assignment 10
3 pages
TranMinhTu1 bt2 2
No ratings yet
TranMinhTu1 bt2 2
5 pages
ML 2.3 Prashant
No ratings yet
ML 2.3 Prashant
4 pages
Lab Manual
No ratings yet
Lab Manual
32 pages
Data Visualization With Maplotlib
No ratings yet
Data Visualization With Maplotlib
8 pages
K Means On IRIS Dataset
No ratings yet
K Means On IRIS Dataset
4 pages
b21 DSBDA Assignment No 10
No ratings yet
b21 DSBDA Assignment No 10
1 page
SC Assignment Q2
No ratings yet
SC Assignment Q2
7 pages
Implementing Logistic Regression For Iris Using Sklearn and Checking The Accuracy Using Confusion Matrix
No ratings yet
Implementing Logistic Regression For Iris Using Sklearn and Checking The Accuracy Using Confusion Matrix
7 pages
Import As Import As Import As From Import Import As Import
No ratings yet
Import As Import As Import As From Import Import As Import
7 pages
Iris - Copy1 - Jupyter Notebook
No ratings yet
Iris - Copy1 - Jupyter Notebook
8 pages
Prac 10
No ratings yet
Prac 10
6 pages
25 - Assignment10.ipynb - Colaboratory
No ratings yet
25 - Assignment10.ipynb - Colaboratory
13 pages
137 Vsec 6
No ratings yet
137 Vsec 6
2 pages
LAB # 07 KNN - Iris Dataset - Ipynb - Colab
No ratings yet
LAB # 07 KNN - Iris Dataset - Ipynb - Colab
8 pages
Oracle Generative AI (1Z0-1127-25) Mock Test - Set - 3
No ratings yet
Oracle Generative AI (1Z0-1127-25) Mock Test - Set - 3
5 pages
Eda Lab Record Anna University
No ratings yet
Eda Lab Record Anna University
5 pages
Dsbda Ouput 1-10
No ratings yet
Dsbda Ouput 1-10
89 pages
L3 - Classification - RandomForest - Jupyter Notebook
No ratings yet
L3 - Classification - RandomForest - Jupyter Notebook
6 pages
Experiment 11 PML
No ratings yet
Experiment 11 PML
3 pages
Jaswinder Pal Singh 2024-04-05: Library Data Print Unique
No ratings yet
Jaswinder Pal Singh 2024-04-05: Library Data Print Unique
4 pages
Dsbda La 10
No ratings yet
Dsbda La 10
4 pages
Ass - 10.ipynb - Colab
No ratings yet
Ass - 10.ipynb - Colab
8 pages
Dsbda Assig 6 Data Analytcs 3
No ratings yet
Dsbda Assig 6 Data Analytcs 3
6 pages
Cota12 6
No ratings yet
Cota12 6
4 pages
Sample
No ratings yet
Sample
1 page
Nandini Matplotlib Ws
No ratings yet
Nandini Matplotlib Ws
10 pages
Trần Mạnh Hùng 20192643.Ipynb - Colab
No ratings yet
Trần Mạnh Hùng 20192643.Ipynb - Colab
6 pages
Data Visualization 3
No ratings yet
Data Visualization 3
3 pages
Practical 10 Code
No ratings yet
Practical 10 Code
5 pages
10 (3146)
No ratings yet
10 (3146)
2 pages
Pra 8
No ratings yet
Pra 8
4 pages
1 10
No ratings yet
1 10
4 pages
KNN - Jupyter Notebook
No ratings yet
KNN - Jupyter Notebook
8 pages
Iris Flower Classification Project
No ratings yet
Iris Flower Classification Project
9 pages
Machine Learning - Lab Record
No ratings yet
Machine Learning - Lab Record
43 pages
Assignment 3
No ratings yet
Assignment 3
7 pages
SudaBERT A Pre-Trained Encoder Representation
No ratings yet
SudaBERT A Pre-Trained Encoder Representation
4 pages
Dsfasdflalksdflkasdjfasf
No ratings yet
Dsfasdflalksdflkasdjfasf
4 pages
FDS Exp 4
No ratings yet
FDS Exp 4
2 pages
MCQ's of Data Mining CIT-661 Part 1 - Prepared by GCUF Guiders
No ratings yet
MCQ's of Data Mining CIT-661 Part 1 - Prepared by GCUF Guiders
9 pages
Exp 3
No ratings yet
Exp 3
3 pages
Welcome To The Basics Guide To Generative AI and Prompt Engineering!
No ratings yet
Welcome To The Basics Guide To Generative AI and Prompt Engineering!
8 pages
Lec 28 Variations in BPNN
100% (1)
Lec 28 Variations in BPNN
20 pages
Pattern Recognition SPPU Notes
No ratings yet
Pattern Recognition SPPU Notes
4 pages
Intelligent Systems and Applications: Proceedings of The 2020 Intelligent Systems Conference (IntelliSys) Volume 2 Kohei Arai Download
No ratings yet
Intelligent Systems and Applications: Proceedings of The 2020 Intelligent Systems Conference (IntelliSys) Volume 2 Kohei Arai Download
101 pages
AD3511 - Deep Learning Lab Manual
No ratings yet
AD3511 - Deep Learning Lab Manual
38 pages
MMC102 - Module 4 - Notes
No ratings yet
MMC102 - Module 4 - Notes
39 pages
Manuscript Anonymous r0
No ratings yet
Manuscript Anonymous r0
44 pages
0 Image Processing 1753170901
No ratings yet
0 Image Processing 1753170901
27 pages
Tensorflow Lab Manual
No ratings yet
Tensorflow Lab Manual
64 pages
IJISAE Bhagyashree Pathak
No ratings yet
IJISAE Bhagyashree Pathak
23 pages
AWS Certified AI Practitioner (AIF-C01)
No ratings yet
AWS Certified AI Practitioner (AIF-C01)
24 pages
Innorootsmaincourse
No ratings yet
Innorootsmaincourse
11 pages
Iris Flower
No ratings yet
Iris Flower
2 pages
Machine Learning Foundation
No ratings yet
Machine Learning Foundation
13 pages
EDADet EncoderDecoder Domain Augmented Alignment Detector For Tiny Objects in Remote Sensing Images-5
No ratings yet
EDADet EncoderDecoder Domain Augmented Alignment Detector For Tiny Objects in Remote Sensing Images-5
15 pages
Introduction To N-Grams and Evaluation
No ratings yet
Introduction To N-Grams and Evaluation
7 pages
Ds Lab Assignment 5
No ratings yet
Ds Lab Assignment 5
4 pages
DWDM Lab Report
No ratings yet
DWDM Lab Report
12 pages
Counterfeit Currency Detection Using Machine Learn
No ratings yet
Counterfeit Currency Detection Using Machine Learn
7 pages
Ai Notes Unit2 - X
No ratings yet
Ai Notes Unit2 - X
19 pages
IJAS 25 069 Galley Proof
No ratings yet
IJAS 25 069 Galley Proof
6 pages
Takeoff Edu Group Matlab Title List
No ratings yet
Takeoff Edu Group Matlab Title List
4 pages
Action Recognition From Egocentric Videos Using RGB Depth Modalities
No ratings yet
Action Recognition From Egocentric Videos Using RGB Depth Modalities
6 pages
End of Semester Exam - Image - Processing and Computer Vision
No ratings yet
End of Semester Exam - Image - Processing and Computer Vision
3 pages
ML Lab Assessment2.Ipynb - Colab
No ratings yet
ML Lab Assessment2.Ipynb - Colab
3 pages
TWP - Group2 - The Development of Models in Semantic
No ratings yet
TWP - Group2 - The Development of Models in Semantic
5 pages
Hyper-Specific Topic Selection & Research Paper Generation Active Learning - Few-Shot Medical Image Segmentation With Graph-Augmented Contrastive Learning
No ratings yet
Hyper-Specific Topic Selection & Research Paper Generation Active Learning - Few-Shot Medical Image Segmentation With Graph-Augmented Contrastive Learning
10 pages
Po and Co
No ratings yet
Po and Co
5 pages
Memoona Basharat: Career Objective
No ratings yet
Memoona Basharat: Career Objective
2 pages
Ex No4
No ratings yet
Ex No4
3 pages
DSE 6 - Colab
No ratings yet
DSE 6 - Colab
5 pages
Assignment1ML Prem - Ipynb - Colab
No ratings yet
Assignment1ML Prem - Ipynb - Colab
4 pages

Code

Uploaded by

Code

Uploaded by

import numpy as np

student_name = "Maitha Al Shamsi"

print("Lab Task_01: Exploring the Iris Dataset")

print("First 10 rows of the Iris dataset:")

petal_length_col = 'petal length (cm)'

for feature in features:

cv_scores = cross_val_score(LogisticRegression(max_iter=200), X, y, cv=5)

You might also like