List of Experiment - Data Analysis Lab

The document lists 9 experiments related to data analysis and visualization to be performed by students. The experiments cover topics like data wrangling, preprocessing, summarization, outlier detection, transformations, visualizations using libraries like Pandas, Seaborn and scikit-learn. Open datasets mentioned include Iris, Titanic and stock price data.

Uploaded by

Atharva Digambar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

56 views2 pages

List of Experiment - Data Analysis Lab

Uploaded by

Atharva Digambar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 2

Department of AI & DS Engineering

List of Experiments

SUBJECT: Data Analysis Lab

CLASS: SY (A & B) SEMESTER: 4th

Academic Year : 2023-24

Sr. Title of Experiment CO PO PSO

No.
1. Study and implementation of Pandas Profiling, Sweetviz, Autoviz. CO4
2. Data Wrangling, I CO4,
Perform the following operations using Python on any open source da- CO5
taset (e.g., data.csv)
1. Import all the required Python Libraries.
2. Locate open source data from the web (e.g.,
https://www.kaggle.com). Provide a clear description
of the data and its source (i.e., URL of the web site).
3. Load the Dataset into pandas dataframe.
4. Data Preprocessing: check for missing values in the data us-
ing pandas is null(), describe() function to get some initial
statistics. Provide variable descriptions. Types of variables
etc. Check the dimensions of the data frame.
3. CO4,
Create an “Academic performance” dataset of students and perform the
following operations usingPython. CO5
1. Scan all variables for missing values and inconsistencies. If
there are missing values and/or inconsistencies, use any of
the suitable techniques to deal with them.
2. Scan all numeric variables for outliers. If there are outliers,
use any of the suitable techniquesto deal with them.
3. Apply data transformations on at least one of the variables.
The purpose of this transformation should be one of the fol-
lowing reasons: to change the scale for better understanding
of the variable, to convert a non-linear relation into a linear
one, or to decrease the skewness and convert the distribution
into a normal distribution.
Reason and document your approach properly.
4. CO4,
Perform the following operations on any open source dataset (e.g., da-
ta.csv) CO5
Provide summary statistics (mean, median, minimum, maxi-
mum, standard deviation) for a dataset (age, income etc.) with
numeric variables grouped by one of the qualitative (categori-
Department of AI & DS Engineering
cal) variable. For example, if your categorical variable is age
groups and quantitative variable is income, then provide sum-
mary statistics of income grouped by the age groups. Create a
list that contains a numeric value for each response to the cate-
gorical variable.
2. Write a Python program to display some basic statistical details
like percentile, mean, standard deviation etc. of the species of
‘Iris-setosa’, ‘Iris-versicolor’ and ‘Iris-versicolor’ of iris.csv da-
taset.
Provide the codes with outputs and explain everything that you do in this
step.
5. 1. Use the inbuilt dataset 'titanic'. The dataset contains 891 rows and CO4,
contains information about the passengers who boarded the unfortunate CO5
Titanic ship. Use the Seaborn library to see if we can find any patterns in
the data.
2. Write a code to check how the price of the ticket (column name: 'fare')
for each passenger is distributed by plotting a histogram.
6. CO4,
Use the inbuilt dataset 'titanic' as used in the above problem. Plot a box
plot for distribution of age with respect to each gender along with the CO5
information about whether they survived or not. (Column names : 'sex'
and 'age')
Write observations on the inference from the above statistics.
7. Data Visualization III CO4,
Download the Iris flower dataset or any other dataset CO5
into a DataFrame.
(e.g., https://archive.ics.uci.edu/ml/datasets/Iris ). Scan the dataset and give
the inference as:
a. List down the features and their types (e.g., numeric, nominal)
available in the dataset.
b. Create a histogram for each feature in the dataset to illustrate
the feature distributions.
c. Create a boxplot for each feature in the dataset.
Compare distributions and identify outliers.
8. Implement mini project on Predicting Stock Prices Using Pandas and CO4,
Sckit –learn CO5
9. Implement color detection using Pandas and Autoviz, Sweetviz CO4,
CO5

Mr. B. B. Kondbhar

Subject In-charge

DSBDA Lab Manual
No ratings yet
DSBDA Lab Manual
155 pages
DSBDAlab Manual
No ratings yet
DSBDAlab Manual
116 pages
DSBDA LAB - MANUAL (Autosaved) - Sd1-Converted-1-2
100% (1)
DSBDA LAB - MANUAL (Autosaved) - Sd1-Converted-1-2
256 pages
Data Science & Big Data Lab Manual
No ratings yet
Data Science & Big Data Lab Manual
117 pages
DSBDA Manual
No ratings yet
DSBDA Manual
76 pages
Icse Phython Programs
No ratings yet
Icse Phython Programs
65 pages
Data Science Manual
No ratings yet
Data Science Manual
155 pages
SL-III Lab Manual
No ratings yet
SL-III Lab Manual
74 pages
Data - Science - Manaul (Te)
No ratings yet
Data - Science - Manaul (Te)
78 pages
DBDAL LAB - MANUAL - Final
No ratings yet
DBDAL LAB - MANUAL - Final
93 pages
Data Science Lab Manual
No ratings yet
Data Science Lab Manual
85 pages
Data Science & Big Data Lab Guide
No ratings yet
Data Science & Big Data Lab Guide
167 pages
Data Science Lab Manual..
No ratings yet
Data Science Lab Manual..
54 pages
Dsbda Lab Manual
No ratings yet
Dsbda Lab Manual
167 pages
Python in Research
No ratings yet
Python in Research
18 pages
ML File Syllabus
No ratings yet
ML File Syllabus
43 pages
Dsbdal Lab Manual
No ratings yet
Dsbdal Lab Manual
107 pages
Date Preparation and Exploration:: Titanic Data - CSV
No ratings yet
Date Preparation and Exploration:: Titanic Data - CSV
5 pages
DSML Problem Statements
No ratings yet
DSML Problem Statements
8 pages
ML Lab Manual
No ratings yet
ML Lab Manual
36 pages
23CS5PCDEV
No ratings yet
23CS5PCDEV
5 pages
23CS5PCDEV
No ratings yet
23CS5PCDEV
4 pages
PR List Dsbda
No ratings yet
PR List Dsbda
2 pages
DS Slips Solutions Sem 5
No ratings yet
DS Slips Solutions Sem 5
23 pages
22CS5PEDEV
No ratings yet
22CS5PEDEV
5 pages
Aids - 21ad62 - Datascience Lab Manual-1
No ratings yet
Aids - 21ad62 - Datascience Lab Manual-1
15 pages
ML Lab File
No ratings yet
ML Lab File
43 pages
DSBDA Lab Plan
No ratings yet
DSBDA Lab Plan
5 pages
CS3362 Data Science Laboratory Manual 2022-23
No ratings yet
CS3362 Data Science Laboratory Manual 2022-23
54 pages
DVA Lab Manual
No ratings yet
DVA Lab Manual
20 pages
PM - SPM Assignment - Shubham Kothawade - Scaler
No ratings yet
PM - SPM Assignment - Shubham Kothawade - Scaler
210 pages
22CS5PEDEV
No ratings yet
22CS5PEDEV
3 pages
Data Science Edit
No ratings yet
Data Science Edit
1 page
Data Science Lab
No ratings yet
Data Science Lab
2 pages
21hcs4108 Davpracticals
No ratings yet
21hcs4108 Davpracticals
29 pages
Training Report On Data Analysis With Python
No ratings yet
Training Report On Data Analysis With Python
12 pages
40 Startup Interview Questions in ML
100% (3)
40 Startup Interview Questions in ML
33 pages
1
No ratings yet
1
3 pages
Dav End Sem
No ratings yet
Dav End Sem
2 pages
Data Analysis and Processing Tasks
No ratings yet
Data Analysis and Processing Tasks
3 pages
DATASCIENCE
No ratings yet
DATASCIENCE
3 pages
PRACTICAL QUESTIONS For DSBDA
No ratings yet
PRACTICAL QUESTIONS For DSBDA
9 pages
DAV Guidelines
No ratings yet
DAV Guidelines
4 pages
1152CS239-Intro. To Data Science-Syllabus
No ratings yet
1152CS239-Intro. To Data Science-Syllabus
6 pages
Data Science
No ratings yet
Data Science
3 pages
Cs3361 Set3 Fds Anna University
No ratings yet
Cs3361 Set3 Fds Anna University
3 pages
Syllabus AIML
No ratings yet
Syllabus AIML
14 pages
DSBDAL Lab Manual
No ratings yet
DSBDAL Lab Manual
26 pages
Rdatascience - Problem Statements
No ratings yet
Rdatascience - Problem Statements
2 pages
IndexPage FODS
No ratings yet
IndexPage FODS
2 pages
Data Science
No ratings yet
Data Science
18 pages
Deep Learning Unit I II MCQ
No ratings yet
Deep Learning Unit I II MCQ
2 pages
Python For Data Sceince l1 Hands On
No ratings yet
Python For Data Sceince l1 Hands On
5 pages
Data Science for Engineers Course
No ratings yet
Data Science for Engineers Course
8 pages
Data Analysis Lab with Python
No ratings yet
Data Analysis Lab with Python
11 pages
Python Practice Questions
No ratings yet
Python Practice Questions
5 pages
DS-DS Lab-1
No ratings yet
DS-DS Lab-1
4 pages
DAV Guidelines
No ratings yet
DAV Guidelines
4 pages
‏لقطة شاشة ٢٠٢٤-٠٥-٠٧ في ٧.٢٧.١٤ م
No ratings yet
‏لقطة شاشة ٢٠٢٤-٠٥-٠٧ في ٧.٢٧.١٤ م
12 pages
Three Example Lagrange Multiplier Problems PDF
No ratings yet
Three Example Lagrange Multiplier Problems PDF
4 pages
Cds3005 Foundations-Of-data-science LP 1.0 18 Cds3005 Foundation-Of-data-science LP 1.0 1 Foundations of Data Science
No ratings yet
Cds3005 Foundations-Of-data-science LP 1.0 18 Cds3005 Foundation-Of-data-science LP 1.0 1 Foundations of Data Science
2 pages
CS3361 Set3
No ratings yet
CS3361 Set3
3 pages
AI Lab File - C
No ratings yet
AI Lab File - C
52 pages
AI Fundamentals Level 1 Quiz - Attempt Review
No ratings yet
AI Fundamentals Level 1 Quiz - Attempt Review
9 pages
Exploring Painting Synthesis With Diffusion Models 2
No ratings yet
Exploring Painting Synthesis With Diffusion Models 2
4 pages
SSNAO Dupliant
No ratings yet
SSNAO Dupliant
9 pages
Reading Graphs - White
No ratings yet
Reading Graphs - White
8 pages
Algorithm Design and Analysis (CS60007) Assignment 1: 1 Interval Scheduling
No ratings yet
Algorithm Design and Analysis (CS60007) Assignment 1: 1 Interval Scheduling
6 pages
Wavelet Packet: A Multirate Adaptive Filter For De-Noising of TDM Signal
No ratings yet
Wavelet Packet: A Multirate Adaptive Filter For De-Noising of TDM Signal
6 pages
Bhattacharjee Et Al 2021 Nonlinear Model Predictive Control and Collision Cone Based Missile Guida
No ratings yet
Bhattacharjee Et Al 2021 Nonlinear Model Predictive Control and Collision Cone Based Missile Guida
17 pages
Python Practical Questions@Subas
No ratings yet
Python Practical Questions@Subas
7 pages
Design Problem 1
No ratings yet
Design Problem 1
5 pages
Experiment #2: Continuous-Time Signal Representation I. Objectives
No ratings yet
Experiment #2: Continuous-Time Signal Representation I. Objectives
14 pages
Trend Lines Case Study ONLINE
No ratings yet
Trend Lines Case Study ONLINE
25 pages
Monte Carlo
No ratings yet
Monte Carlo
4 pages
Hidden Markov Models
No ratings yet
Hidden Markov Models
51 pages
Wooldridge 7e Ch03 IM
No ratings yet
Wooldridge 7e Ch03 IM
20 pages
CH-7, MATH-5 - LECTURE - NOTE - Summer - 20-21
No ratings yet
CH-7, MATH-5 - LECTURE - NOTE - Summer - 20-21
10 pages
Advanced Control Strategies
No ratings yet
Advanced Control Strategies
29 pages
Power Systems Numerical Methods
No ratings yet
Power Systems Numerical Methods
31 pages
Perturbation Theory
No ratings yet
Perturbation Theory
6 pages
DPCM
No ratings yet
DPCM
2 pages
The Theoretical Framework of The Optimization of Public Transport Travel
No ratings yet
The Theoretical Framework of The Optimization of Public Transport Travel
7 pages
An Ensemble Method For Phishing Websites Detection Based On XGBoost
No ratings yet
An Ensemble Method For Phishing Websites Detection Based On XGBoost
6 pages
Ju DG-Recon Depth-Guided Neural 3D Scene Reconstruction ICCV 2023 Paper
No ratings yet
Ju DG-Recon Depth-Guided Neural 3D Scene Reconstruction ICCV 2023 Paper
11 pages
Modified Binary Search Algorithm: Ankit R. Chadha Rishikesh Misal Tanaya Mokashi
No ratings yet
Modified Binary Search Algorithm: Ankit R. Chadha Rishikesh Misal Tanaya Mokashi
4 pages
Chapter 1 - Lesson 1 - Course Intro and Discrete or Continuous Random Variables
No ratings yet
Chapter 1 - Lesson 1 - Course Intro and Discrete or Continuous Random Variables
21 pages
Practice Problem Set 3: OA4201 Nonlinear Programming
No ratings yet
Practice Problem Set 3: OA4201 Nonlinear Programming
6 pages
Binomial Heaps: Manoj Kumar DTU, Delhi
No ratings yet
Binomial Heaps: Manoj Kumar DTU, Delhi
36 pages