0% found this document useful (0 votes)

87 views3 pages

Python Code

This Python code document shows how to load and explore CSV data files using the Pandas library. It demonstrates how to load data, rename columns, display rows of data, concatenate columns, and calculate descriptive statistics. The code loads CSV files on cereal and housing data, explores the data dimensions and types, and calculates summary values like mean, standard deviation, minimum, maximum and median for numeric columns.

Uploaded by

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

87 views3 pages

Python Code

Uploaded by

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 3

Pg 24 Table 2.

# Import required packages

import pandas as pd

# Load data

housing_df = pd.read_csv ('WestRoxbury.csv')

housing_df.shape # find the dimension of data frame

housing_df.head () # show the first five rows

print (housing_df) # show all the data

# Rename columns: replace spaces with '_' to allow dot notation

housing_df = housing_df.rename (columns={'Total Value' : 'Total_Value'}) # explicit

housing_df.columns = [s.strip().replace(' ', '_') for s in housing_df.columns] # all columns

# Practice showing the first four rows of the data

housing_df.loc[0:3] # loc[a:b] gives rows a to b, inclusive

housing_df.iloc[0:4] # iloc[a:b] gives rows a to b-1

# Different ways of showing the first 10 values in column Total_Value

housing_df ['Total_Value'] .iloc[0:10]

housing_df.iloc [4, 0:10]

housing_df.iloc [4:5, 0:10] # use a slice to return a data frame

# Use pd.concat to combine non-consecutive columns into a new data frame

# The axis argument specifies the dimension along which the

# concatenation happens, 0=rows, 1=columns

pd.concat([housing_df.iloc[4:6,0:2], housing_df.iloc[4:6,4:6]], axis=1)

# To specify a full column, use:

housing.iloc[:,0:1]

housing.Total_Value

housing_df['Total_Value'] [0:10] # show the first 10 rows of the first column

# Descriptive statistics

print ('Number of rows ', len(housing_df['Total_Value'])) # show length of first column

print ('Mean of Total_Value ', housing_df['Total_Value'] .mean()) # show mean of column

housing_df.describe() # show summary statistics for each column

Table 4.3

Cereals_df = pd.read_csv(‘Cereals.csv’)

Cereals_df =cereals_df.rename(columns={‘CAT, MEDV’ ; ‘CAT_MEDV’})

Cereals_df.head(9)

Cereals_df .describe()

# Compute mean, standard deviation, min, max, median

# cereals

Print (‘Mean : ‘, Cereals_df.cereals.mean())

Print (‘Std. dev : ‘, Cereals_df. cereals.std())

Print (‘Min : ‘, Cereals_df. cereals.min())

Print (‘Max : ‘, Cereals_df. cereals.max())

Print (‘Median : ‘, Cereals_df. cereals.median())

# Compute mean, standard dev., min, max, median

Pd.DataFrame({‘mean’ : cereals_df.mean() ,

‘sd’ : cereals_df.std() ,

‘min’ : cereals_df.min() ,

‘max’ : cereals_df.max() ,
‘median’ : cereals_df.median})

Python code in practice

import pandas as pd

df = pd.read_csv("Cereals.csv")

df.head()

# import pandas

import pandas as pd

# import matplotlib

import matplotlib.pyplot as plt

# import seaborn

import seaborn as sns

%matplotlib inline

UI/UX Design Essentials Guide
100% (2)
UI/UX Design Essentials Guide
20 pages
9050 User's Manual PDF
100% (1)
9050 User's Manual PDF
147 pages
Os Lab 3
No ratings yet
Os Lab 3
13 pages
Complexity Criteria for Reports
No ratings yet
Complexity Criteria for Reports
2 pages
GitHub - Genymobile - Scrcpy - Display and Control Your Android Device
No ratings yet
GitHub - Genymobile - Scrcpy - Display and Control Your Android Device
9 pages
MySQL Setup & Workbench Guide
No ratings yet
MySQL Setup & Workbench Guide
13 pages
Vdocuments - MX - Student Manual Abt CCP tsm143 Rslogix 5000 Level 3 Project Developmentpdf
No ratings yet
Vdocuments - MX - Student Manual Abt CCP tsm143 Rslogix 5000 Level 3 Project Developmentpdf
375 pages
TLE-TE 9 - Q1 - W5 - Mod5 - ICT CSS
100% (4)
TLE-TE 9 - Q1 - W5 - Mod5 - ICT CSS
31 pages
2G Integration Steps and MML Updates
No ratings yet
2G Integration Steps and MML Updates
4 pages
Ntro OM: Duction Puting
No ratings yet
Ntro OM: Duction Puting
49 pages
Ledger Mapping Restrictions Guide
No ratings yet
Ledger Mapping Restrictions Guide
2 pages
Computer Science Before College
No ratings yet
Computer Science Before College
11 pages
DEVOPS Parça4
No ratings yet
DEVOPS Parça4
15 pages
How To Install Java
No ratings yet
How To Install Java
17 pages
B Entry Point Specification v2 1 March2011 20110406011840641
No ratings yet
B Entry Point Specification v2 1 March2011 20110406011840641
50 pages
PM Debug Info
No ratings yet
PM Debug Info
275 pages
Sharepoint Online and Office 365 Administration
No ratings yet
Sharepoint Online and Office 365 Administration
238 pages
What Is Google App Engine? - Definition From TechTarget
No ratings yet
What Is Google App Engine? - Definition From TechTarget
8 pages
8086 Microprocessor Guide
No ratings yet
8086 Microprocessor Guide
26 pages
Aerohive PPSK User Management Guide
No ratings yet
Aerohive PPSK User Management Guide
29 pages
Wolf Crypt
No ratings yet
Wolf Crypt
1 page
Kunal 2
No ratings yet
Kunal 2
14 pages
Secure PA Systems for Large Projects
No ratings yet
Secure PA Systems for Large Projects
8 pages
CV Data Engineer English
No ratings yet
CV Data Engineer English
2 pages
AWS SAA Cheat Sheet
No ratings yet
AWS SAA Cheat Sheet
3 pages
Mike Pietielin Senior Software Engineer
No ratings yet
Mike Pietielin Senior Software Engineer
1 page
12 IT Sample Question Papper 01
No ratings yet
12 IT Sample Question Papper 01
3 pages
Object Oriented Programming20230316130823
No ratings yet
Object Oriented Programming20230316130823
33 pages
3HAC081964 PM IRB 1010-En
No ratings yet
3HAC081964 PM IRB 1010-En
340 pages
Unit 5 1 Basicblocks
No ratings yet
Unit 5 1 Basicblocks
39 pages