Pandas DataFrame Notes - 12pages-Pages-4

The document provides various methods for selecting, modifying, and managing rows in a DataFrame using pandas. It covers techniques such as slicing by label/index, appending rows, dropping rows, boolean selection, and sorting. Additionally, it includes traps and considerations for handling row indices and duplicates.

Uploaded by

Sàazón Kasula

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

38 views1 page

Pandas DataFrame Notes - 12pages-Pages-4

Uploaded by

Sàazón Kasula

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 1

Select a slice of rows by label/index

Working with rows [inclusive-from : inclusive–to [ : step]]

df = df['a':'c'] # rows 'a' through 'c'
Get the row index and labels Trap: cannot work for integer labelled rows – see
idx = df.index # get row index previous code snippet on integer position slicing.
label = df.index[0] # first row label
label = df.index[-1] # last row label Append a row of column totals to a DataFrame
l = df.index.tolist() # get as a list # Option 1: use dictionary comprehension
a = df.index.values # get as an array sums = {col: df[col].sum() for col in df}
sums_df = DataFrame(sums,index=['Total'])
Change the (row) index df = df.append(sums_df)
df.index = idx # new ad hoc index
df = df.set_index('A') # col A new index # Option 2: All done with pandas
df = df.set_index(['A', 'B']) # MultiIndex df = df.append(DataFrame(df.sum(),
df = df.reset_index() # replace old w new columns=['Total']).T)
# note: old index stored as a col in df
df.index = range(len(df)) # set with list Iterating over DataFrame rows
df = df.reindex(index=range(len(df))) for (index, row) in df.iterrows(): # pass
df = df.set_index(keys=['r1','r2','etc']) Trap: row data type may be coerced.
df.rename(index={'old':'new'}, inplace=True)
Sorting DataFrame rows values
Adding rows df = df.sort(df.columns[0],
df = original_df.append(more_rows_in_df) ascending=False)
Hint: convert row to a DataFrame and then append. df.sort(['col1', 'col2'], inplace=True)
Both DataFrames should have same column labels.
Sort DataFrame by its row index
Dropping rows (by name) df.sort_index(inplace=True) # sort by row
df = df.drop('row_label') df = df.sort_index(ascending=False)
df = df.drop(['row1','row2']) # multi-row
Random selection of rows
Boolean row selection by values in a column import random as r
df = df[df['col2'] >= 0.0] k = 20 # pick a number
df = df[(df['col3']>=1.0) | (df['col1']<0.0)] selection = r.sample(range(len(df)), k)
df = df[df['col'].isin([1,2,5,7,11])] df_sample = df.iloc[selection, :] # get copy
df = df[~df['col'].isin([1,2,5,7,11])] Note: this randomly selected sample is not sorted
df = df[df['col'].str.contains('hello')]
Trap: bitwise "or", "and" “not; (ie. | & ~) co-opted to be Drop duplicates in the row index
Boolean operators on a Series of Boolean df['index'] = df.index # 1 create new col
Trap: need parentheses around comparisons. df = df.drop_duplicates(cols='index',
take_last=True)# 2 use new col
Selecting rows using isin over multiple columns del df['index'] # 3 del the col
# fake up some data df.sort_index(inplace=True)# 4 tidy up
data = {1:[1,2,3], 2:[1,4,9], 3:[1,8,27]}
df = DataFrame(data) Test if two DataFrames have same row index
len(a)==len(b) and all(a.index==b.index)
# multi-column isin
lf = {1:[1, 3], 3:[8, 27]} # look for Get the integer position of a row or col index label
f = df[df[list(lf)].isin(lf).all(axis=1)] i = df.index.get_loc('row_label')
Trap: index.get_loc() returns an integer for a unique
Selecting rows using an index match. If not a unique match, may return a slice/mask.
idx = df[df['col'] >= 2].index
print(df.ix[idx]) Get integer position of rows that meet condition
a = np.where(df['col'] >= 2) #numpy array
Select a slice of rows by integer position
[inclusive-from : exclusive-to [: step]] Test if the row index values are unique/monotonic
start is 0; end is len(df)
if df.index.is_unique: pass # ...
df = df[:] # copy entire DataFrame b = df.index.is_monotonic_increasing
df = df[0:2] # rows 0 and 1 b = df.index.is_monotonic_decreasing
df = df[2:3] # row 2 (the third row)
df = df[-1:] # the last row
Find row index duplicates
df = df[:-1] # all but the last row
if df.index.has_duplicates:
df = df[::2] # every 2nd row (0 2 ..)
print(df.index.duplicated())
Trap: a single integer without a colon is a column label
Note: also similar for column label duplicates.
for integer numbered columns.
Version 30 April 2017 - [Draft – Mark Graph – mark dot the dot graph at gmail dot com – @Mark_Graph on twitter]
4

Python Cheat Sheet 2.0
100% (1)
Python Cheat Sheet 2.0
10 pages
Become A Ninja With Vue (Ninja Squad) (Z-Library)
No ratings yet
Become A Ninja With Vue (Ninja Squad) (Z-Library)
399 pages
Python & Data Science Cheat Sheet
100% (4)
Python & Data Science Cheat Sheet
11 pages
Day7 PandasCoreFeatures
No ratings yet
Day7 PandasCoreFeatures
4 pages
Python For Data Science 1662157639
No ratings yet
Python For Data Science 1662157639
6 pages
Pandas DataFrame Notes
No ratings yet
Pandas DataFrame Notes
13 pages
Pandas Part-2
No ratings yet
Pandas Part-2
9 pages
Pandas
No ratings yet
Pandas
5 pages
Python Interviews
No ratings yet
Python Interviews
154 pages
100 Pandas Puzzles
No ratings yet
100 Pandas Puzzles
20 pages
Pandas Cheat Sheet
100% (1)
Pandas Cheat Sheet
2 pages
Exp 3
No ratings yet
Exp 3
10 pages
Python & Pandas Cheat Sheet Guide
100% (2)
Python & Pandas Cheat Sheet Guide
5 pages
12 Pandas
No ratings yet
12 Pandas
9 pages
Pandas Merged
No ratings yet
Pandas Merged
2 pages
Unit3 - 3) Pandas - Ipynb - Colab
No ratings yet
Unit3 - 3) Pandas - Ipynb - Colab
11 pages
Cheat Sheet: The Pandas Dataframe Object: Preliminaries Get Your Data Into A Dataframe
100% (1)
Cheat Sheet: The Pandas Dataframe Object: Preliminaries Get Your Data Into A Dataframe
12 pages
Revision Notes DataFrame XII IP
No ratings yet
Revision Notes DataFrame XII IP
8 pages
Data Analysis Tools
No ratings yet
Data Analysis Tools
26 pages
Pandas
No ratings yet
Pandas
44 pages
Pandas Introduction: What Is Python Pandas Used For?
No ratings yet
Pandas Introduction: What Is Python Pandas Used For?
28 pages
Pandas DataFrame Cheat Sheet
100% (1)
Pandas DataFrame Cheat Sheet
10 pages
Pandas DataFrame Cheat Sheet
No ratings yet
Pandas DataFrame Cheat Sheet
4 pages
Pandas Data Wrangling Cheat Sheet
100% (2)
Pandas Data Wrangling Cheat Sheet
6 pages
Python Cheat Sheets
97% (33)
Python Cheat Sheets
11 pages
Commands SQL, Python (BASICS)
No ratings yet
Commands SQL, Python (BASICS)
7 pages
Data Analysis With Python
No ratings yet
Data Analysis With Python
60 pages
Pandas
No ratings yet
Pandas
13 pages
Cheat Sheet Template
No ratings yet
Cheat Sheet Template
3 pages
Pandas DataFrame Notes
No ratings yet
Pandas DataFrame Notes
10 pages
Fundamental - Python
No ratings yet
Fundamental - Python
3 pages
Pandas DataFrame Notes
100% (1)
Pandas DataFrame Notes
10 pages
Data Handling for Data Scientists
No ratings yet
Data Handling for Data Scientists
163 pages
Cheat Sheet
No ratings yet
Cheat Sheet
10 pages
Pandas & PyNumS Essentials
No ratings yet
Pandas & PyNumS Essentials
10 pages
Unit IV
No ratings yet
Unit IV
49 pages
05getting Started With Pandas
No ratings yet
05getting Started With Pandas
44 pages
Pandas
No ratings yet
Pandas
27 pages
Content Pandas Cheat Sheet
No ratings yet
Content Pandas Cheat Sheet
9 pages
Unit 2 notes-II
No ratings yet
Unit 2 notes-II
47 pages
Lab 1 ML Lab
No ratings yet
Lab 1 ML Lab
15 pages
Dataframe Ip
No ratings yet
Dataframe Ip
75 pages
Cheat Sheet: The Pandas Dataframe Object I: Preliminaries Get Your Data Into A Dataframe
No ratings yet
Cheat Sheet: The Pandas Dataframe Object I: Preliminaries Get Your Data Into A Dataframe
12 pages
Cheat Python
No ratings yet
Cheat Python
8 pages
Python Pandas-Data Frames
No ratings yet
Python Pandas-Data Frames
41 pages
Ip Study
No ratings yet
Ip Study
18 pages
Pandas: Import
100% (1)
Pandas: Import
13 pages
Pandas
No ratings yet
Pandas
1 page
Pandas Cheat Sheet for Data Manipulation
No ratings yet
Pandas Cheat Sheet for Data Manipulation
1 page
DataFrames Continued
No ratings yet
DataFrames Continued
9 pages
Python Pandas and DataFrame Basics
No ratings yet
Python Pandas and DataFrame Basics
20 pages
Cheat Sheet
No ratings yet
Cheat Sheet
12 pages
Add and Modifying Rows Renaming
No ratings yet
Add and Modifying Rows Renaming
4 pages
Numpy Boolean Indexing: Filter
No ratings yet
Numpy Boolean Indexing: Filter
39 pages
DataFrame Ac Win Final
No ratings yet
DataFrame Ac Win Final
30 pages
AI & Data Science Lab Record
No ratings yet
AI & Data Science Lab Record
28 pages
Seismic Behaviors and Resilient Capacity of CFRP-confined Concrete Columns
No ratings yet
Seismic Behaviors and Resilient Capacity of CFRP-confined Concrete Columns
12 pages
Seismic Performance Assessment of A
No ratings yet
Seismic Performance Assessment of A
19 pages
Marconite - Earthing Compounds - Granular Marconite Compound Earthing
No ratings yet
Marconite - Earthing Compounds - Granular Marconite Compound Earthing
8 pages
An Economic Evaluation System For Building Construction Projects in The Conceputal Phase
No ratings yet
An Economic Evaluation System For Building Construction Projects in The Conceputal Phase
6 pages
Current Transformer Basics - Understanding Ratio, Polarity, and Class
No ratings yet
Current Transformer Basics - Understanding Ratio, Polarity, and Class
25 pages
The Z-Transform and Discrete Functions: Z KT X KT X T X Z X
No ratings yet
The Z-Transform and Discrete Functions: Z KT X KT X T X Z X
5 pages
TJ Bodies Place Demands Before Extending Term: Kathmandu
No ratings yet
TJ Bodies Place Demands Before Extending Term: Kathmandu
12 pages
How To Play The Back
No ratings yet
How To Play The Back
7 pages
Relative Reference: Month Total Income Total Expense Net Income
No ratings yet
Relative Reference: Month Total Income Total Expense Net Income
13 pages
Lesson 3 Control Structures C++ For Students
No ratings yet
Lesson 3 Control Structures C++ For Students
20 pages
DATA STRUCTURES DESIGN LAB Manual
No ratings yet
DATA STRUCTURES DESIGN LAB Manual
56 pages
Algol Compiler Message
No ratings yet
Algol Compiler Message
126 pages
ADS (CSS) Final Question Bank Format For (III-I)
No ratings yet
ADS (CSS) Final Question Bank Format For (III-I)
3 pages
AI ML and Data Science PDF
No ratings yet
AI ML and Data Science PDF
11 pages
Lab 03
No ratings yet
Lab 03
9 pages
SE T01 - Pseudo Code I
No ratings yet
SE T01 - Pseudo Code I
10 pages
DowReplayEA Review - mq5
No ratings yet
DowReplayEA Review - mq5
3 pages
Lab - 02 - Nhóm 7
No ratings yet
Lab - 02 - Nhóm 7
6 pages
ARM C Language Extensions - Al Grant
No ratings yet
ARM C Language Extensions - Al Grant
81 pages
Fruit Quality and Defect Image Classification With Conditional GAN Data Augmentation
No ratings yet
Fruit Quality and Defect Image Classification With Conditional GAN Data Augmentation
11 pages
Compiler Design Lab Manual
No ratings yet
Compiler Design Lab Manual
37 pages
Introduction To Algorithm TK13020 - Kelas A
No ratings yet
Introduction To Algorithm TK13020 - Kelas A
12 pages
Strategies and Algorithms For Clustering Large Datasets: A Review
No ratings yet
Strategies and Algorithms For Clustering Large Datasets: A Review
20 pages
The Cache Memory Book (Jim Handy) (Z-Library)
No ratings yet
The Cache Memory Book (Jim Handy) (Z-Library)
331 pages
Atik
No ratings yet
Atik
8 pages
Bca Sep 2024
0% (1)
Bca Sep 2024
24 pages
DLD Mid Term Exam
No ratings yet
DLD Mid Term Exam
2 pages
Pentium - Salient Features
No ratings yet
Pentium - Salient Features
16 pages
Class 12 Computer Science Exam
No ratings yet
Class 12 Computer Science Exam
2 pages
Module 1 - 2021 Scheme
No ratings yet
Module 1 - 2021 Scheme
110 pages
Assignment 4
No ratings yet
Assignment 4
6 pages
PP QB (LR24) Sem 2 Cie1
No ratings yet
PP QB (LR24) Sem 2 Cie1
64 pages
Quine Mclusky Method
No ratings yet
Quine Mclusky Method
12 pages
Short Error Question
No ratings yet
Short Error Question
4 pages
Array Programs For Interviews 1727455838
No ratings yet
Array Programs For Interviews 1727455838
192 pages
Addition of Sparse Matrices
No ratings yet
Addition of Sparse Matrices
24 pages
Open NN
No ratings yet
Open NN
2 pages

Pandas DataFrame Notes - 12pages-Pages-4

Uploaded by

Pandas DataFrame Notes - 12pages-Pages-4

Uploaded by

Select a slice of rows by label/index

Working with rows [inclusive-from : inclusive–to [ : step]]

You might also like