0% found this document useful (0 votes)

28 views5 pages

L-3 (Data Frame Part 2) .Ipynb - Colab

The document provides a tutorial on using Python's Pandas library for data manipulation, focusing on creating and modifying DataFrames. It covers creating DataFrames from dictionaries, describing data, and various methods for selecting, updating, and dropping data. Additionally, it demonstrates how to use indexing and renaming for better data management.

Uploaded by

ashishpal2804

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

28 views5 pages

L-3 (Data Frame Part 2) .Ipynb - Colab

Uploaded by

ashishpal2804

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

3/14/25, 4:36 PM L-3 (Data Frame Part 2).

ipynb - Colab

keyboard_arrow_down Python Data Frames Part 2

# import libraries
import numpy as np
import pandas as pd

#Create a Dictionary of series

d1 = {'Name':pd.Series(['Tina','Jatin','Ritu','Vinish','Sita','Ritu','Sita','Ritu',
'Darvid','Gurpreet','Beena','Adesh']),
'Age':pd.Series([25,26,25,23,31,29,30,34,40,30,51,46]),
'Rating':pd.Series([4.23,3.24,3.98,2.56,3.20,4.6,3.8,3.78,2.98,4.80,4.10,3.65])
}
#Create a DataFrame
df = pd.DataFrame(d1)

print(df)

print(df.describe(include=['object']))

Name Age Rating

0 Tina 25 4.23
1 Jatin 26 3.24
2 Ritu 25 3.98
3 Vinish 23 2.56
4 Sita 31 3.20
5 Ritu 29 4.60
6 Sita 30 3.80
7 Ritu 34 3.78
8 Darvid 40 2.98
9 Gurpreet 30 4.80
10 Beena 51 4.10
11 Adesh 46 3.65
Name
count 12
unique 9
top Ritu
freq 3

print(df.describe(include=['number']))

Age Rating
count 12.000000 12.000000
mean 32.500000 3.743333
std 8.816307 0.661628
min 23.000000 2.560000
25% 25.750000 3.230000
50% 30.000000 3.790000
75% 35.500000 4.132500
max 51.000000 4.800000

print(df.describe(include='all'))

Name Age Rating

count 12 12.000000 12.000000
unique 9 NaN NaN
top Ritu NaN NaN
freq 3 NaN NaN
mean NaN 32.500000 3.743333
std NaN 8.816307 0.661628
min NaN 23.000000 2.560000
25% NaN 25.750000 3.230000
50% NaN 30.000000 3.790000
75% NaN 35.500000 4.132500
max NaN 51.000000 4.800000

#Pandas Dataframe.pop() : Deletes a COLUMN from the dataframe

print(marks)
marks.pop("Maths")
print("\n Changed Frame\n")
print(marks)

RollNo Name Eco Maths

0 1 Arnab 18 57
1 2 Kritika 23 45
2 3 Divyam 51 37
3 4 Vivaan 40 60
4 5 Aaaroosh 18 27

Changed Frame

https://colab.research.google.com/drive/1r50IfA2defohJdwiqmLmTdqZ_JIP0wf2#printMode=true 1/5
3/14/25, 4:36 PM L-3 (Data Frame Part 2).ipynb - Colab
RollNo Name Eco
0 1 Arnab 18
1 2 Kritika 23
2 3 Divyam 51
3 4 Vivaan 40
4 5 Aaaroosh 18

# Dataframe.drop() - Does not change the dataframe unless inplace = True

# A list of index labels is passed and the rows corresponding to those labels are dropped using drop()
marks.drop([1,4], inplace = True)
print(marks)

# Dataframe.drop() for dropping a column

marks.drop("Eco", axis=1)
print(marks)

RollNo Name Eco

0 1 Arnab 18
2 3 Divyam 51
3 4 Vivaan 40

marks.drop("Eco", axis=1, inplace = True)

print(marks)

RollNo Name
0 1 Arnab
2 3 Divyam
3 4 Vivaan

# Extracting rows using Pandas .loc[] .iloc[]

# Create a sample student dataset consisting of 5 columns – age, section, city, gender, and favorite color.
# This dataset will contain both numerical as well as categorical variables:

# create a sample dataframe

data = pd.DataFrame({
'age' : [ 10, 22, 13, 21, 12, 11, 17],
'section' : [ 'A', 'B', 'C', 'B', 'B', 'A', 'A'],
'city' : [ 'Gurgaon', 'Delhi', 'Mumbai', 'Delhi', 'Mumbai', 'Delhi', 'Mumbai'],
'gender' : [ 'M', 'F', 'F', 'M', 'M', 'M', 'F'],
'favourite_color' : [ 'red', np.NAN, 'yellow', np.NAN, 'black', 'green', 'red'] })
print(data)

age section city gender favourite_color

0 10 A Gurgaon M red
1 22 B Delhi F NaN
2 13 C Mumbai F yellow
3 21 B Delhi M NaN
4 12 B Mumbai M black
5 11 A Delhi M green
6 17 A Mumbai F red

print(data.loc[1:3]) # Observe that the entire range of row labels is displayed

age section city gender favourite_color

1 22 B Delhi F NaN
2 13 C Mumbai F yellow
3 21 B Delhi M NaN

print(data.loc[[1,4,5]]) # list of row labels

age section city gender favourite_color

1 22 B Delhi F NaN
4 12 B Mumbai M black
5 11 A Delhi M green

# select all rows with a condition in a column

print("\nData with age greater than 15\n")
print(data.loc[data.age >= 15])

Data with age greater than 15

age section city gender favourite_color

1 22 B Delhi F NaN
3 21 B Delhi M NaN
6 17 A Mumbai F red

https://colab.research.google.com/drive/1r50IfA2defohJdwiqmLmTdqZ_JIP0wf2#printMode=true 2/5
3/14/25, 4:36 PM L-3 (Data Frame Part 2).ipynb - Colab
# select rows with multiple conditions
print(data.loc[(data.age >= 12) & (data.gender == 'M')])

age section city gender favourite_color

3 21 B Delhi M NaN
4 12 B Mumbai M black

# select few columns with a condition

data.loc[(data.age >= 12), ['city', 'gender']]

city gender

1 Delhi F

2 Mumbai F

3 Delhi M

4 Mumbai M

6 Mumbai F
 

# Update the values of a particular column on selected rows

print(data)
data.loc[(data.age >= 12), ['section']] = 'M'
print(data)

age section city gender favourite_color

0 10 A Gurgaon M red
1 22 B Delhi F NaN
2 13 C Mumbai F yellow
3 21 B Delhi M NaN
4 12 B Mumbai M black
5 11 A Delhi M green
6 17 A Mumbai F red
age section city gender favourite_color
0 10 A Gurgaon M red
1 22 M Delhi F NaN
2 13 M Mumbai F yellow
3 21 M Delhi M NaN
4 12 M Mumbai M black
5 11 A Delhi M green
6 17 M Mumbai F red

# update multiple columns with condition

data.loc[(data.age >= 20), ['section', 'city']] = ['S','Pune']
print(data)

age section city gender favourite_color

0 10 A Gurgaon M red
1 22 S Pune F NaN
2 13 M Mumbai F yellow
3 21 S Pune M NaN
4 12 M Mumbai M black
5 11 A Delhi M green
6 17 M Mumbai F red

data.index=['a','b','c','d','e','f','g']
print(data)

age section city gender favourite_color

a 10 A Gurgaon M red
b 22 S Pune F NaN
c 13 M Mumbai F yellow
d 21 S Pune M NaN
e 12 M Mumbai M black
f 11 A Delhi M green
g 17 M Mumbai F red

data.loc[0:2] # error as labels are 'a' to 'g'

https://colab.research.google.com/drive/1r50IfA2defohJdwiqmLmTdqZ_JIP0wf2#printMode=true 3/5
3/14/25, 4:36 PM L-3 (Data Frame Part 2).ipynb - Colab

---------------------------------------------------------------------------
TypeError Traceback (most recent call last)
<ipython-input-46-b83772436b9b> in <cell line: 1>()
----> 1 data.loc[0:2] # error as labels are 'a' to 'g'

6 frames
/usr/local/lib/python3.10/dist-packages/pandas/core/indexes/base.py in _maybe_cast_slice_bound(self, label, side, kind)
6621 # reject them, if index does not contain label
6622 if (is_float(label) or is_integer(label)) and label not in self:
-> 6623 raise self._invalid_indexer("slice", label)
6624
6625 return label

TypeError: cannot do slice indexing on Index with these indexers [0] of type int

 

#iloc()
# select rows with indexes
data.iloc[[0,2]]

age section city gender favourite_color

a 10 A Gurgaon M red

c 13 M Mumbai F yellow
 

# select rows with particular indexes and particular columns

data.iloc[[0,2],[1,3]]

section gender

a A M

c M F

# select a range of rows

data.iloc[1:3]

age section city gender favourite_color

b 22 S Pune F NaN

c 13 M Mumbai F yellow

# select a range of rows and columns

data.iloc[1:3,2:4]

city gender

b Pune F

c Mumbai F

# changing index labels and column labels with rename()

print(marks)
marks.rename(index={0:'R1', 1:'R2',2:'R3',3:'R4',4:'R5'} , inplace = True)
print(marks)

RollNo Name
0 1 Arnab
2 3 Divyam
3 4 Vivaan
RollNo Name
R1 1 Arnab
R3 3 Divyam
R4 4 Vivaan

marks.rename(columns={'RollNo':'Roll Num', 'Name':'Student Name','Eco':'Economics','Maths':'Mathematics'} ,

inplace = True)
print(marks)

Roll Num Student Name

R1 1 Arnab
R3 3 Divyam
R4 4 Vivaan

https://colab.research.google.com/drive/1r50IfA2defohJdwiqmLmTdqZ_JIP0wf2#printMode=true 4/5
3/14/25, 4:36 PM L-3 (Data Frame Part 2).ipynb - Colab

https://colab.research.google.com/drive/1r50IfA2defohJdwiqmLmTdqZ_JIP0wf2#printMode=true 5/5

Python Cheat Sheet 2.0
100% (1)
Python Cheat Sheet 2.0
10 pages
Column Base Plate Calculation Report
No ratings yet
Column Base Plate Calculation Report
13 pages
Python & Pandas Cheat Sheet Guide
100% (2)
Python & Pandas Cheat Sheet Guide
5 pages
Python & Data Science Cheat Sheet
100% (4)
Python & Data Science Cheat Sheet
11 pages
Subject: Computer Organization Sub Code: 21Cs34 Semester: 3
No ratings yet
Subject: Computer Organization Sub Code: 21Cs34 Semester: 3
43 pages
Series 1
No ratings yet
Series 1
408 pages
Pandas
No ratings yet
Pandas
27 pages
Geotechnical Study for Baghdad Site
No ratings yet
Geotechnical Study for Baghdad Site
20 pages
Grade 8 August Holiday Revision Booklet
No ratings yet
Grade 8 August Holiday Revision Booklet
154 pages
Pandas Practicals - Term-1
100% (1)
Pandas Practicals - Term-1
18 pages
Python Project File
No ratings yet
Python Project File
31 pages
Revision Notes DataFrame XII IP
No ratings yet
Revision Notes DataFrame XII IP
8 pages
Solenoid Valve 2/2 Way N.O. Direct Acting - Dampness-Proof IP 67
No ratings yet
Solenoid Valve 2/2 Way N.O. Direct Acting - Dampness-Proof IP 67
2 pages
100 Pandas Puzzles
No ratings yet
100 Pandas Puzzles
20 pages
Assignment 1 (Set A)
No ratings yet
Assignment 1 (Set A)
4 pages
2023-Wireless Communications-CEP-Project
No ratings yet
2023-Wireless Communications-CEP-Project
4 pages
Dataframe Cheat Sheet
No ratings yet
Dataframe Cheat Sheet
2 pages
Unit 4
No ratings yet
Unit 4
27 pages
Even Students
No ratings yet
Even Students
36 pages
Oddstudents
No ratings yet
Oddstudents
35 pages
British Standard: A Single Copy of This British Standard Is Licensed To Giorgio Cavalieri On March 15, 2001
No ratings yet
British Standard: A Single Copy of This British Standard Is Licensed To Giorgio Cavalieri On March 15, 2001
21 pages
Numpy Boolean Indexing: Filter
No ratings yet
Numpy Boolean Indexing: Filter
39 pages
12 IP CBSE Practical File (PART-1)
No ratings yet
12 IP CBSE Practical File (PART-1)
27 pages
Python Interviews
No ratings yet
Python Interviews
154 pages
Pandas Series and DataFrame Guide
No ratings yet
Pandas Series and DataFrame Guide
98 pages
Pandas & Vis 2
No ratings yet
Pandas & Vis 2
11 pages
Pandas 2 Complete Notes Class XII
No ratings yet
Pandas 2 Complete Notes Class XII
18 pages
Pandas Cheat Sheet
No ratings yet
Pandas Cheat Sheet
2 pages
PDF&Rendition 1
No ratings yet
PDF&Rendition 1
47 pages
IP Practical File
No ratings yet
IP Practical File
27 pages
Create A DataFrame
No ratings yet
Create A DataFrame
24 pages
Ip Study
No ratings yet
Ip Study
18 pages
Acknowledgement
No ratings yet
Acknowledgement
25 pages
12 Pandas
No ratings yet
12 Pandas
14 pages
Practical File ANKIT RAJ CLASS 12-F
No ratings yet
Practical File ANKIT RAJ CLASS 12-F
48 pages
Python Unit 4&5 Que
No ratings yet
Python Unit 4&5 Que
33 pages
Marking Scheme Practical Paper
No ratings yet
Marking Scheme Practical Paper
5 pages
Transformer Test Report
No ratings yet
Transformer Test Report
17 pages
Ayush IP
No ratings yet
Ayush IP
24 pages
Ip Practical
No ratings yet
Ip Practical
23 pages
Data Aggregation and Group Operations
No ratings yet
Data Aggregation and Group Operations
34 pages
Practical File 2024
No ratings yet
Practical File 2024
25 pages
Practical File 12th
No ratings yet
Practical File 12th
19 pages
F2014L
No ratings yet
F2014L
4 pages
Ip Project
No ratings yet
Ip Project
16 pages
Data Sci
No ratings yet
Data Sci
29 pages
Pandas Practice for Students
No ratings yet
Pandas Practice for Students
12 pages
Data Wrangling - Jupyter Notebook
No ratings yet
Data Wrangling - Jupyter Notebook
5 pages
Fortnightly Test Series 2023 24 - RM (P1) Test 01A
No ratings yet
Fortnightly Test Series 2023 24 - RM (P1) Test 01A
20 pages
Day08-Pandas-Tutorial: Pandas - by Punith V T
No ratings yet
Day08-Pandas-Tutorial: Pandas - by Punith V T
8 pages
Notebook PYTHON DATA SCIENCE
No ratings yet
Notebook PYTHON DATA SCIENCE
16 pages
Pandas
No ratings yet
Pandas
13 pages
Pandas Plots
No ratings yet
Pandas Plots
14 pages
Wa0012.
No ratings yet
Wa0012.
30 pages
Topology Concepts for Math Students
No ratings yet
Topology Concepts for Math Students
45 pages
Pragya File
No ratings yet
Pragya File
31 pages
Pandas
No ratings yet
Pandas
5 pages
Informatics Practices Project
No ratings yet
Informatics Practices Project
20 pages
DHP Journal
No ratings yet
DHP Journal
29 pages
Suryadatta National School Class 12 CBSE Informatics Practices Practicals List
No ratings yet
Suryadatta National School Class 12 CBSE Informatics Practices Practicals List
19 pages
Python Codes
No ratings yet
Python Codes
17 pages
Xii Record (Dataframe & CSV)
No ratings yet
Xii Record (Dataframe & CSV)
11 pages
WEBINTEL GUIDED LAB ACTIVITY Introduction To Pandas
No ratings yet
WEBINTEL GUIDED LAB ACTIVITY Introduction To Pandas
1 page
Journal 12
No ratings yet
Journal 12
54 pages
Functionapplicationp PDF
No ratings yet
Functionapplicationp PDF
6 pages
Delta ASDA A A+ User Manual
No ratings yet
Delta ASDA A A+ User Manual
383 pages
ST Joseph'S Convent Senior Secondary School: Name:-Shatakshi Gaur Class:-Xii Sec:-A Board Roll No.
No ratings yet
ST Joseph'S Convent Senior Secondary School: Name:-Shatakshi Gaur Class:-Xii Sec:-A Board Roll No.
65 pages
Pseudocode and Flow Charts
100% (1)
Pseudocode and Flow Charts
42 pages
Commands SQL, Python (BASICS)
No ratings yet
Commands SQL, Python (BASICS)
7 pages
Time Series Analysis Group 9
No ratings yet
Time Series Analysis Group 9
16 pages
Big Data Computing: Week 8 Quiz
No ratings yet
Big Data Computing: Week 8 Quiz
3 pages
ET - W2021 (2131905) (GTURanker - Com)
No ratings yet
ET - W2021 (2131905) (GTURanker - Com)
2 pages
UV Lab Report - BE
No ratings yet
UV Lab Report - BE
15 pages
Python Pandas Assignment Guide
No ratings yet
Python Pandas Assignment Guide
9 pages
Aging Performance and Moisture Solubility of Veg. Oils For Power Trfs.
No ratings yet
Aging Performance and Moisture Solubility of Veg. Oils For Power Trfs.
6 pages
Electronic Cheat Sheet
No ratings yet
Electronic Cheat Sheet
1 page
Week 4
No ratings yet
Week 4
35 pages
Unit 4th DAQ and Amplifier
No ratings yet
Unit 4th DAQ and Amplifier
88 pages
PM-0.5 MK: - Reference Manual
No ratings yet
PM-0.5 MK: - Reference Manual
7 pages
Fire Protection System
No ratings yet
Fire Protection System
60 pages
JavaScript Global Object and Promise Polyfills
No ratings yet
JavaScript Global Object and Promise Polyfills
88 pages
B11 Building Enviro Systems and Control Exam Questions
No ratings yet
B11 Building Enviro Systems and Control Exam Questions
20 pages
Wave Properties of Light
No ratings yet
Wave Properties of Light
36 pages
Hydroponic Gardening Guide
No ratings yet
Hydroponic Gardening Guide
11 pages
SPPS M1507 D Datasheet
No ratings yet
SPPS M1507 D Datasheet
2 pages
Kathrein 80010430 PDF
No ratings yet
Kathrein 80010430 PDF
1 page
Grade 7 Science: Heat & Energy
No ratings yet
Grade 7 Science: Heat & Energy
9 pages
Cellulose Polymorphy, Crystallite Size, and The Segal
No ratings yet
Cellulose Polymorphy, Crystallite Size, and The Segal
6 pages

L-3 (Data Frame Part 2) .Ipynb - Colab

Uploaded by

L-3 (Data Frame Part 2) .Ipynb - Colab

Uploaded by

3/14/25, 4:36 PM L-3 (Data Frame Part 2).

keyboard_arrow_down Python Data Frames Part 2

#Create a Dictionary of series

Name Age Rating

Name Age Rating

#Pandas Dataframe.pop() : Deletes a COLUMN from the dataframe

RollNo Name Eco Maths

# Dataframe.drop() - Does not change the dataframe unless inplace = True

# Dataframe.drop() for dropping a column

RollNo Name Eco

marks.drop("Eco", axis=1, inplace = True)

# Extracting rows using Pandas .loc[] .iloc[]

# create a sample dataframe

age section city gender favourite_color

print(data.loc[1:3]) # Observe that the entire range of row labels is displayed

age section city gender favourite_color

print(data.loc[[1,4,5]]) # list of row labels

age section city gender favourite_color

# select all rows with a condition in a column

Data with age greater than 15

age section city gender favourite_color

age section city gender favourite_color

# select few columns with a condition

# Update the values of a particular column on selected rows

age section city gender favourite_color

# update multiple columns with condition

age section city gender favourite_color

age section city gender favourite_color

data.loc[0:2] # error as labels are 'a' to 'g'

age section city gender favourite_color

# select rows with particular indexes and particular columns

# select a range of rows

age section city gender favourite_color

# select a range of rows and columns

# changing index labels and column labels with rename()

marks.rename(columns={'RollNo':'Roll Num', 'Name':'Student Name','Eco':'Economics','Maths':'Mathematics'} ,

Roll Num Student Name

You might also like