Data Science Fundamentals

The document discusses various data science concepts and algorithms including working with packages in Python, NumPy arrays, Pandas data frames, and univariate analysis on diabetes data. Example code is provided to work with NumPy, Pandas, and perform frequency, mean, median, mode, variance, standard deviation, skewness and kurtosis on a diabetes dataset.

Uploaded by

thilakraj.a0321

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

49 views22 pages

Data Science Fundamentals

Uploaded by

thilakraj.a0321

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 22

DATA SCIENCE FUNDAMENTALS

LAB EXERCISE
PROGRAMS AND OUTPUTS
EX:1 PACKAGES FOR DATA SCINCE IN PYTHON

AIM : TO DOWNLOAD,INSTALL AND EXPLORE THE FEATURES

OF PYTHON FOR DATA ANALYTICS.
ALGORITHM :
STEP 1 :
STEP 2 :
STEP 3 :
STEP 4 :
STEP 5 :
PROGRAM :

OUTPUT :
AIM : WORKING WITH NUMPY ARRAYS

ALGORITHM :
STEP 1 :
STEP 2 :
STEP 3 :
STEP 4 :
STEP 5 :

PROGRAM :
import numpy as np
list_1 = [1, 2, 3, 4]
list_2 = [5, 6, 7, 8]
list_3 = [9, 10, 11, 12]
sample_array = np.array([list_1,
list_2,
list_3])
print("Numpy multi dimensional array in python\n",
sample_array)
OUTPUT :
Numpy multi dimensional array in python
[[ 1 2 3 4]
[ 5 6 7 8]
[ 9 10 11 12]]
AIM : WORKING WITH PANDAS DATA FRAMES
ALGORTIHM :
STEP 1 :
STEP 2:
STEP 3 :
STEP 4 :
STEP 5 :
PROGRAM :
import pandas as pd
data = {'Name':['Jai', 'Princi', 'Gaurav', 'Anuj'],
'Age':[27, 24, 22, 32],
'Address':['Delhi', 'Kanpur', 'Allahabad', 'Kannauj'],
'Qualification':['Msc', 'MA', 'MCA', 'Phd']}
df = pd.DataFrame(data)
print(df[['Name', 'Qualification']])
OUTPUT :
EX.NO 5. USE THE DIABETES DATA SET FROM UCI
AND PIMA INDIANS DATE:DIABETES DATA SET FOR
PERFORMING THE FOLLOWING:

A) UNIVARIATE ANALYSIS: FREQUENCY, MEAN, MEDIAN,

MODE, VARIANCE, STANDARD DEVIATION, SKEWNESS
AND KURTOSIS.

AIM:
To explore various commands for doing Univariate analytics on
the UCI AND PIMA
INDIANS DIABETES data set.
ALGORITHM:
STEP 1: Start the program
STEP 2: To download the UCI AND PIMA INDIANS DIABETES
data set using Kaggle.
STEP 3: To read data from UCI AND PIMA INDIANS DIABETES
data set.
STEP 4: To find the mean, median, mode, variance,
standard deviation, skewness and kurtosis in the
given excel data set package.
STEP 5: Display the output.
STEP 6: Stop the program.
PROGRAM:
import
pandas as pd
import numpy
as np
import matplotlib.pyplot
as plt import seaborn
as sns
sns.set_style('darkgrid')
%matplotlib inline
from matplotlib.ticker import
FormatStrFormatter import warnings
warnings.filterwarnings('ignore')

df =
pd.read_csv('C:/Users/kirub/Documents/Learning/Untitled
Folder/diabetes.csv') df.head()
df.shape
df.dtypes
df['Outcome']=df['Outcome'].astype('bool')
df.dtypes['Outcome']
df.info()
df.describ
e().T

# Frequency# finding the

unique count df1 =
df['Outcome'].value_count
s()

#
displaying
df1
print(df1)
#mean
df.mean()
#median
df.median(
)
#mode df.mode()
#Variance
df.var()
#standard
deviation
df.std()
#
#kurtosis
df.kurtosis(axis=0,skipn
a=True)
df['Outcome'].kurtosis(axis=0,s
kipna=True) #skewness
# skewness along the
index axis df.skew(axis
= 0, skipna = True)

# skip the na values

# find skewness in
each row df.skew(axis
= 1, skipna = True)

#Pregnancy variable
preg_proportion =
np.array(df['Pregnancies'].value_counts())
preg_month =
np.array(df['Pregnancies'].value_counts().index)
preg_proportion_perc =
np.array(np.round(preg_proportion/
sum(preg_proportion),3)*100,dtype=int)

preg =
pd.DataFrame({'month':preg_month,'count_of_preg_prop':preg_p
roportion,'percentage_pro portion':preg_proportion_perc})
preg.set_index(['month'],inplac
e=True) preg.head(10)
sns.countplot(data=df['Outcome'])

sns.distplot(df['Pregnancies'])

sns.boxplot(data=df['Pregnancies'])
OUTPUT:
AIM :
ALGORITHM :
STEP : 1
STEP : 2
STEP : 3
STEP : 4
STEP : 5
OUTPUT :
AIM :
ALGORITHM :
STEP : 1
STEP : 2
STEP : 3
STEP : 4
STEP : 5
OUTPUT :
AIM :
ALGORITHM :
STEP : 1
STEP : 2
STEP : 3
STEP : 4
STEP : 5
OUTPUT :
AIM :
ALGORITHM :
STEP : 1
STEP : 2
STEP : 3
STEP : 4
STEP : 5
OUTPUT :
AIM :
ALGORITHM :
STEP : 1
STEP : 2
STEP : 3
STEP : 4
STEP : 5
OUTPUT :
AIM :
ALGORITHM :
STEP : 1
STEP : 2
STEP : 3
STEP : 4
STEP : 5
OUTPUT :

Manual Yamaha Stagea
100% (1)
Manual Yamaha Stagea
188 pages
cs3362 Foundations of Data Science Lab Manual
67% (9)
cs3362 Foundations of Data Science Lab Manual
53 pages
Soal Dan Kunci Jawaban PTS SMT 2
100% (1)
Soal Dan Kunci Jawaban PTS SMT 2
7 pages
DAV Practical
No ratings yet
DAV Practical
12 pages
Practical 4
No ratings yet
Practical 4
3 pages
Anexa 1 - 412
No ratings yet
Anexa 1 - 412
4 pages
Data Perparation Penting
No ratings yet
Data Perparation Penting
12 pages
Dsa Lab Record (Ai&Ds)
No ratings yet
Dsa Lab Record (Ai&Ds)
34 pages
Fds
No ratings yet
Fds
30 pages
Ad3411 - Dsa Lab Manual
No ratings yet
Ad3411 - Dsa Lab Manual
34 pages
FDSA Lab Manual 1
No ratings yet
FDSA Lab Manual 1
34 pages
Sanyam Data Science
No ratings yet
Sanyam Data Science
33 pages
Py 10
No ratings yet
Py 10
5 pages
Open-Circuit Time Constant Analysis: Asas As Hs K Bsbs Bs
No ratings yet
Open-Circuit Time Constant Analysis: Asas As Hs K Bsbs Bs
24 pages
Practical 1
No ratings yet
Practical 1
5 pages
Fdsa Record Ai&Ds
No ratings yet
Fdsa Record Ai&Ds
26 pages
Dsa Lab
No ratings yet
Dsa Lab
28 pages
Purple Modern Course Z-Fold Brochure
No ratings yet
Purple Modern Course Z-Fold Brochure
2 pages
Data Science Programs
No ratings yet
Data Science Programs
11 pages
FDSA Lab Manual Aim Algorithm
No ratings yet
FDSA Lab Manual Aim Algorithm
32 pages
Practical Assignment4 1
No ratings yet
Practical Assignment4 1
6 pages
She - Elvis Costello - Strings Arrangement
No ratings yet
She - Elvis Costello - Strings Arrangement
8 pages
CS3362 Data Science Laboratory Manual 2022-23
No ratings yet
CS3362 Data Science Laboratory Manual 2022-23
54 pages
NumPy and Pandas Step
No ratings yet
NumPy and Pandas Step
9 pages
PP DWDM 4 5
No ratings yet
PP DWDM 4 5
26 pages
Pipe Sizing
No ratings yet
Pipe Sizing
11 pages
FDSA Lab Manual
No ratings yet
FDSA Lab Manual
31 pages
Statistics IMP Questions and Answers
No ratings yet
Statistics IMP Questions and Answers
23 pages
DA Manual - Part B
No ratings yet
DA Manual - Part B
13 pages
ML Lab Manual-Iso
No ratings yet
ML Lab Manual-Iso
40 pages
CS3361 Set2
No ratings yet
CS3361 Set2
6 pages
Python Data Exploration Guide
100% (1)
Python Data Exploration Guide
12 pages
Index
No ratings yet
Index
4 pages
Data Science Practical Problems
No ratings yet
Data Science Practical Problems
40 pages
Lab Manual - MachineLearningLaboratory-DR - Vaishnavi
No ratings yet
Lab Manual - MachineLearningLaboratory-DR - Vaishnavi
71 pages
FDS LAB Record Print
No ratings yet
FDS LAB Record Print
45 pages
Python Data Analysis Guide
No ratings yet
Python Data Analysis Guide
1 page
Data Pre-Processing
No ratings yet
Data Pre-Processing
22 pages
Data Science and Analtics Laboratory
No ratings yet
Data Science and Analtics Laboratory
21 pages
Data Science and Analtics Laboratory
No ratings yet
Data Science and Analtics Laboratory
21 pages
ML Programs
No ratings yet
ML Programs
41 pages
DataAnalytics Lab Manual
No ratings yet
DataAnalytics Lab Manual
35 pages
DWDM Lab Manual
No ratings yet
DWDM Lab Manual
32 pages
FDS Final Manual
No ratings yet
FDS Final Manual
41 pages
FDS Slot 1
No ratings yet
FDS Slot 1
19 pages
Python Basics for Beginners
No ratings yet
Python Basics for Beginners
4 pages
Informatics Practices Record Class 12
No ratings yet
Informatics Practices Record Class 12
60 pages
ModuleAr Merged
No ratings yet
ModuleAr Merged
42 pages
Data Science Laboratory
No ratings yet
Data Science Laboratory
40 pages
AD3411
No ratings yet
AD3411
28 pages
Model Military International 07.2019
100% (7)
Model Military International 07.2019
68 pages
2A - Python+Data Analysis For Pyhton2 v2
No ratings yet
2A - Python+Data Analysis For Pyhton2 v2
38 pages
ML Data Preprocessing in Python
No ratings yet
ML Data Preprocessing in Python
9 pages
K-Nearest Neighbors For Diabetes Prediction: Malik Yousaf (F2020019038) Ahsan Rauf (F2020019057)
No ratings yet
K-Nearest Neighbors For Diabetes Prediction: Malik Yousaf (F2020019038) Ahsan Rauf (F2020019057)
15 pages
Data Science Practical Book - Ipynb
No ratings yet
Data Science Practical Book - Ipynb
21 pages
Python Data Analysis Package Guide
No ratings yet
Python Data Analysis Package Guide
18 pages
Industrial Screw Pumps Guide
100% (1)
Industrial Screw Pumps Guide
4 pages
ML Proj Diabetes
No ratings yet
ML Proj Diabetes
51 pages
Word 2010 Practice Exercise Guide
No ratings yet
Word 2010 Practice Exercise Guide
2 pages
Inventions of The American Industrial Revolution
No ratings yet
Inventions of The American Industrial Revolution
14 pages
Python For Machine Learning
No ratings yet
Python For Machine Learning
66 pages
Hrithik Saini Class 12th c1, Roll No 1033
No ratings yet
Hrithik Saini Class 12th c1, Roll No 1033
25 pages
FAQ299
No ratings yet
FAQ299
2 pages
AC Disconnects
No ratings yet
AC Disconnects
2 pages
DAV Guidelines
No ratings yet
DAV Guidelines
4 pages
PCM 1867
No ratings yet
PCM 1867
141 pages
Durascale Manual
No ratings yet
Durascale Manual
9 pages
Data Science Lab Manual: Pandas & Analysis
No ratings yet
Data Science Lab Manual: Pandas & Analysis
53 pages
Fundamentals of Data Science Students
No ratings yet
Fundamentals of Data Science Students
52 pages
е з Guizhou Tyre Co.,Ltd
No ratings yet
е з Guizhou Tyre Co.,Ltd
76 pages
Control of Sulphur Oxides
No ratings yet
Control of Sulphur Oxides
10 pages
CS 3362 FDS
No ratings yet
CS 3362 FDS
53 pages
Mechanics of Fluid (UCE03B03) Total Credit: 03 Contact Periods: 03 (2L+1T+0P)
No ratings yet
Mechanics of Fluid (UCE03B03) Total Credit: 03 Contact Periods: 03 (2L+1T+0P)
5 pages
The Agahozo-Shalom Youth Village in Rwanda: Year-Long Village Fellows Program
No ratings yet
The Agahozo-Shalom Youth Village in Rwanda: Year-Long Village Fellows Program
6 pages
Cryptography Cipher Case Study
No ratings yet
Cryptography Cipher Case Study
6 pages
Simrit Seal Profile
No ratings yet
Simrit Seal Profile
5 pages
Final Report From The Examination of The Aviation Accident No 192/2010/11 Involving The Tu-154M Airplane, Tail Number 101, Which Occurred On April 10th, 2010 in The Area of The SMOLENSK NORTH Airfield
No ratings yet
Final Report From The Examination of The Aviation Accident No 192/2010/11 Involving The Tu-154M Airplane, Tail Number 101, Which Occurred On April 10th, 2010 in The Area of The SMOLENSK NORTH Airfield
328 pages
The Galactic Union of Advanced Lifeforms
No ratings yet
The Galactic Union of Advanced Lifeforms
2 pages
TMC Manifesto
No ratings yet
TMC Manifesto
72 pages
IT Professional Resume
No ratings yet
IT Professional Resume
3 pages
Ad3411 - Student
No ratings yet
Ad3411 - Student
27 pages
1000 KW Mitsubishi Diesel Generator Set - Non EPA - TP-M1000-T1-60 PDF
No ratings yet
1000 KW Mitsubishi Diesel Generator Set - Non EPA - TP-M1000-T1-60 PDF
5 pages
101 Ways To Promote Your Real Estate Web Site
100% (1)
101 Ways To Promote Your Real Estate Web Site
391 pages
The Use of Link Motion On Mechanical Presses
No ratings yet
The Use of Link Motion On Mechanical Presses
6 pages
Daylighting for School Design
50% (2)
Daylighting for School Design
48 pages