0% found this document useful (0 votes)

11 views4 pages

2.data Frame Selection and Indexing

Uploaded by

Md. Mizanur Rahman

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

11 views4 pages

2.data Frame Selection and Indexing

Uploaded by

Md. Mizanur Rahman

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

Data Frame Selection and Indexing

We've seen how to call built-in data frames and how to create them using data.frame() along with vectors. Let's revisit our weather data
frame and learn how to select elements from within the dataframe using bracket notation:

In [1]:

# Some made up weather data

days <- c('mon','tue','wed','thu','fri')
temp <- c(22.2,21,23,24.3,25)
rain <- c(TRUE, TRUE, FALSE, FALSE, TRUE)

# Pass in the vectors:

df <- data.frame(days,temp,rain)

In [2]:
df

Out[2]:

days temp rain

1 mon 22.2 TRUE

2 tue 21 TRUE

3 wed 23 FALSE

4 thu 24.3 FALSE

5 fri 25 TRUE

We can use the same bracket notation we used for matrices:

df[rows,columns]

In [4]:

# Everything from first row

df[1,]
Out[4]:

days temp rain

1 mon 22.2 TRUE

In [5]:

#Everything from first column

df[,1]

Out[5]:

mon tue wed thu fri

In [6]:

# Grab Friday data

df[5,]

Out[6]:

days temp rain

5 fri 25 TRUE

Selecting using column names

Here is where data frames become very powerful, we can use column names to select data for the columns instead of having to
remember numbers. So for example:
In [8]:

# All rain values

df[,'rain']

Out[8]:

TRUE TRUE FALSE FALSE TRUE

In [11]:

# First 5 rows for days and temps

df[1:5,c('days','temp')]

Out[11]:

days temp

1 mon 22.2

2 tue 21

3 wed 23

4 thu 24.3

5 fri 25

If you want all the values of a particular column you can use the dollar sign directly after the dataframe as follows:

df.name$column.name

In [12]:

df$rain

Out[12]:

TRUE TRUE FALSE FALSE TRUE

In [15]:

df$days

Out[15]:

mon tue wed thu fri

You can also use bracket notation to return a data frame format of the same information:

In [14]:

df['rain']

Out[14]:

rain

1 TRUE

2 TRUE

3 FALSE

4 FALSE

5 TRUE

In [18]:

df['days']
Out[18]:

days

1 mon

2 tue

3 wed

4 thu

5 fri

Filtering with a subset condition

We can use the subset() function to grab a subset of values from our data frame based off some condition. So for example, imagin we
wanted to grab the days where it rained (rain=True), we can use the subset() function as follows:

In [19]:

subset(df,subset=rain==TRUE)

Out[19]:

days temp rain

1 mon 22.2 TRUE

2 tue 21 TRUE

5 fri 25 TRUE

Notice how the condition uses some sort of comparison operator, in the above case ==. Let's grab days where the temperature was
greater than 23:

In [20]:

subset(df,subset= temp>23)

Out[20]:

days temp rain

4 thu 24.3 FALSE

5 fri 25 TRUE

Another thing to note is that we didn't pass in the column name as a character string, subset knows that you are referring to a column in
that data frame.

Odering a Data Frame

We can sort the order of our data frame by using the order function. You pass in the column you want to sort by into the order()
function, then you use that vector to select from the dataframe. Let's see an example of sorting by the temperature:

In [28]:
sorted.temp <- order(df['temp'])

In [29]:

df[sorted.temp,]
Out[29]:

days temp rain

2 tue 21 TRUE

1 mon 22.2 TRUE

3 wed 23 FALSE

4 thu 24.3 FALSE

5 fri 25 TRUE
Let's take a look at what sorted.temp actually is:

In [30]:
sorted.temp

Out[30]:

2 1 3 4 5

Ok, so we are just asking for those index elements in that order (by default ascending, we can pass a negative sign to do descending
order):

In [31]:

desc.temp <- order(-df['temp'])

In [32]:
df[desc.temp,]

Out[32]:

days temp rain

5 fri 25 TRUE

4 thu 24.3 FALSE

3 wed 23 FALSE

1 mon 22.2 TRUE

2 tue 21 TRUE

We could have also used the other column selection methods we learned:

In [34]:

sort.temp <- order(df$temp)

df[sort.temp,]
Out[34]:

days temp rain

2 tue 21 TRUE

1 mon 22.2 TRUE

3 wed 23 FALSE

4 thu 24.3 FALSE

5 fri 25 TRUE

That's it for data frames! We will definitely revisit this and explore data frames A LOT more, but we should test you understanding first! Up
next an exercise!

Factors
No ratings yet
Factors
23 pages
MATH115F19Outline PDF
No ratings yet
MATH115F19Outline PDF
2 pages
R Data Subsetting & Manipulation Guide
No ratings yet
R Data Subsetting & Manipulation Guide
44 pages
DSCI 100 Cheat Sheet
No ratings yet
DSCI 100 Cheat Sheet
3 pages
Statistics With R Week 3
No ratings yet
Statistics With R Week 3
3 pages
Dplyr Grammar for Data Wrangling
No ratings yet
Dplyr Grammar for Data Wrangling
21 pages
R Programming Cont..
No ratings yet
R Programming Cont..
24 pages
MIT 302 - Statistical Computing II - Tutorial 02
No ratings yet
MIT 302 - Statistical Computing II - Tutorial 02
5 pages
Refer Slide Time: 00:12
No ratings yet
Refer Slide Time: 00:12
11 pages
Module 8
No ratings yet
Module 8
59 pages
R Cheat Sheets for ECON1267
No ratings yet
R Cheat Sheets for ECON1267
13 pages
DV Lab
No ratings yet
DV Lab
52 pages
Data
No ratings yet
Data
40 pages
Tutorial-Introduction To Dplyr
No ratings yet
Tutorial-Introduction To Dplyr
54 pages
Pandas Row/Column Selection Guide
No ratings yet
Pandas Row/Column Selection Guide
7 pages
Intro To Data Science Lecture 4
No ratings yet
Intro To Data Science Lecture 4
13 pages
Important R Codes and Notes
No ratings yet
Important R Codes and Notes
13 pages
R Data Frame - Javatpoint
No ratings yet
R Data Frame - Javatpoint
14 pages
R Functions
No ratings yet
R Functions
8 pages
10 Minutes To Pandas - Pandas 2.1.1 Documentation
No ratings yet
10 Minutes To Pandas - Pandas 2.1.1 Documentation
24 pages
R Data Handling Guide
No ratings yet
R Data Handling Guide
16 pages
Unit 2 Reading and Writing Files
No ratings yet
Unit 2 Reading and Writing Files
33 pages
Reshape2 - R - Flexibly Reshape Data - A Reboot of The Reshape Package
No ratings yet
Reshape2 - R - Flexibly Reshape Data - A Reboot of The Reshape Package
14 pages
6 Working With Data Frames in R
No ratings yet
6 Working With Data Frames in R
8 pages
MLStack Cafe 2
No ratings yet
MLStack Cafe 2
11 pages
fancyDPLYR Funcs
No ratings yet
fancyDPLYR Funcs
31 pages
Phan Project2 Report
No ratings yet
Phan Project2 Report
10 pages
05getting Started With Pandas
No ratings yet
05getting Started With Pandas
44 pages
Chapter 1 - Part 2 - DataFrame
No ratings yet
Chapter 1 - Part 2 - DataFrame
48 pages
Iloc and Loc Uses PDF
No ratings yet
Iloc and Loc Uses PDF
16 pages
Comparison With R / R Libraries
No ratings yet
Comparison With R / R Libraries
12 pages
Data Manipulation With Pandas
No ratings yet
Data Manipulation With Pandas
39 pages
R Language PDF
100% (1)
R Language PDF
619 pages
Expt. No. Basic Math Date
No ratings yet
Expt. No. Basic Math Date
24 pages
What Is A Data Frame in R?
No ratings yet
What Is A Data Frame in R?
5 pages
Pandas Notes
No ratings yet
Pandas Notes
4 pages
Cas13 R ch2 3.R
No ratings yet
Cas13 R ch2 3.R
7 pages
Module IV
No ratings yet
Module IV
43 pages
DataFrame Ac Win Final
No ratings yet
DataFrame Ac Win Final
30 pages
Study Guide Data Manipulation With R
No ratings yet
Study Guide Data Manipulation With R
4 pages
Pandas Quick Start Guide
No ratings yet
Pandas Quick Start Guide
23 pages
Gries Stefan Thomas (2013) - Statistics For Linguistics With R - 2
No ratings yet
Gries Stefan Thomas (2013) - Statistics For Linguistics With R - 2
100 pages
R Basic and Advanced
No ratings yet
R Basic and Advanced
9 pages
Dataframes-I (Create - Selection)
No ratings yet
Dataframes-I (Create - Selection)
12 pages
Class 05-Case Study
No ratings yet
Class 05-Case Study
6 pages
BigData - BCom Unit 4
No ratings yet
BigData - BCom Unit 4
9 pages
Lecture 5 (Managing and Understanding Data)
No ratings yet
Lecture 5 (Managing and Understanding Data)
9 pages
Dev Record Aids
No ratings yet
Dev Record Aids
24 pages
Python For Data Science 1662157639
No ratings yet
Python For Data Science 1662157639
6 pages
Introduction To Dplyr
No ratings yet
Introduction To Dplyr
9 pages
Pandas Dataframe
No ratings yet
Pandas Dataframe
8 pages
Dev Lab Manual Org
No ratings yet
Dev Lab Manual Org
28 pages
Cleaning Data in R
No ratings yet
Cleaning Data in R
9 pages
Pandas Cheat Sheet for Data Manipulation
No ratings yet
Pandas Cheat Sheet for Data Manipulation
1 page
Cleaning Data
No ratings yet
Cleaning Data
17 pages
Fonction Dplyr
No ratings yet
Fonction Dplyr
5 pages
Basic Data Objects in R
No ratings yet
Basic Data Objects in R
18 pages
Code Basics & Data Manipulation With R: Literature: Wickham & Grolemund R For Data Science Ch. 3, 16
No ratings yet
Code Basics & Data Manipulation With R: Literature: Wickham & Grolemund R For Data Science Ch. 3, 16
31 pages
Pure VS Applied
No ratings yet
Pure VS Applied
1 page
Objective Based
No ratings yet
Objective Based
1 page
Play - No Play - Data
No ratings yet
Play - No Play - Data
1 page
PMASDS - Payment Notice
No ratings yet
PMASDS - Payment Notice
1 page
Makeup Mid Exam
No ratings yet
Makeup Mid Exam
1 page
Bus Schedule Friday
No ratings yet
Bus Schedule Friday
1 page
CDA Assignment4
No ratings yet
CDA Assignment4
12 pages
Lecture On Q - Control
No ratings yet
Lecture On Q - Control
8 pages
Stages ResearchProcess ImranKhan
No ratings yet
Stages ResearchProcess ImranKhan
8 pages
BIA vs. RA: Key Differences Explained
No ratings yet
BIA vs. RA: Key Differences Explained
1 page
Statistics Basics for Beginners
No ratings yet
Statistics Basics for Beginners
53 pages
CIA Triad
No ratings yet
CIA Triad
1 page
RAW Data
No ratings yet
RAW Data
22 pages
Lecture 4. Dispersion
No ratings yet
Lecture 4. Dispersion
6 pages
Monetary and Financial System (MAFS)
No ratings yet
Monetary and Financial System (MAFS)
183 pages
Neural Networks Part-1
No ratings yet
Neural Networks Part-1
88 pages
Module Mathematics
No ratings yet
Module Mathematics
5 pages
Grade12preboardexamination Markingscheme Maths Seta
No ratings yet
Grade12preboardexamination Markingscheme Maths Seta
20 pages
Matrix Exponent Explained
No ratings yet
Matrix Exponent Explained
29 pages
Mortenson, Michael E. - Mathematics For Computer Graphics Applications PDF
100% (2)
Mortenson, Michael E. - Mathematics For Computer Graphics Applications PDF
368 pages
Laplace Expansion Theorem
No ratings yet
Laplace Expansion Theorem
9 pages
Module 3 Vector Spaces and Linear Transformations
No ratings yet
Module 3 Vector Spaces and Linear Transformations
37 pages
QR Factorization Chapter4
0% (1)
QR Factorization Chapter4
12 pages
Generalized Inverses: How To Invert A Non-Invertible Matrix
No ratings yet
Generalized Inverses: How To Invert A Non-Invertible Matrix
9 pages
MATLAB Programming Tutorial - Version 05-: Electromagnetic Fields Theory (BEE3113)
100% (1)
MATLAB Programming Tutorial - Version 05-: Electromagnetic Fields Theory (BEE3113)
32 pages
Unit 3 2-D CST Element
No ratings yet
Unit 3 2-D CST Element
68 pages
KTU Kannur
No ratings yet
KTU Kannur
62 pages
Novel Coordinate Transformations For Antenna Applications: IEEE Transactions On Antennas and Propagation January 1985
No ratings yet
Novel Coordinate Transformations For Antenna Applications: IEEE Transactions On Antennas and Propagation January 1985
7 pages
Mathematics - A Course in Fluid Mechanics With Vector Field
No ratings yet
Mathematics - A Course in Fluid Mechanics With Vector Field
198 pages
Assignment Linear Algebra 1
No ratings yet
Assignment Linear Algebra 1
2 pages
Module 4.1 Introduction To Signal Space
No ratings yet
Module 4.1 Introduction To Signal Space
52 pages
Traffic Flow Matrix
No ratings yet
Traffic Flow Matrix
10 pages
Dama50 Unit3n
No ratings yet
Dama50 Unit3n
36 pages
Recursive Matrix Formulas for B-Splines
No ratings yet
Recursive Matrix Formulas for B-Splines
7 pages
Jacobian Algorithm
No ratings yet
Jacobian Algorithm
6 pages
Compensation Methods For Network Solutions by Optimally Ordered Triangular Factorization
No ratings yet
Compensation Methods For Network Solutions by Optimally Ordered Triangular Factorization
5 pages
Advanced Matrix Concepts
No ratings yet
Advanced Matrix Concepts
5 pages
Control Systems Lab: Introduction To MATLAB
No ratings yet
Control Systems Lab: Introduction To MATLAB
50 pages
Class 12 Maths Notes Chapter - 3 Matrices
No ratings yet
Class 12 Maths Notes Chapter - 3 Matrices
60 pages
CH 3 Matrices Multiple Choice Questions With Answers PDF
100% (1)
CH 3 Matrices Multiple Choice Questions With Answers PDF
3 pages
Mat Dipsy LL
No ratings yet
Mat Dipsy LL
2 pages
Linear Algebra Basics for Computing
No ratings yet
Linear Algebra Basics for Computing
148 pages
L2 - Simultaneous Equation Solution
No ratings yet
L2 - Simultaneous Equation Solution
10 pages
Continuity, Differentiability & Matrices MCQs
No ratings yet
Continuity, Differentiability & Matrices MCQs
11 pages
Eigenvalues Eigenvectors and Differential Equations
No ratings yet
Eigenvalues Eigenvectors and Differential Equations
56 pages

2.data Frame Selection and Indexing

Uploaded by

2.data Frame Selection and Indexing

Uploaded by

Data Frame Selection and Indexing

# Some made up weather data

# Pass in the vectors:

days temp rain

1 mon 22.2 TRUE

4 thu 24.3 FALSE

We can use the same bracket notation we used for matrices:

# Everything from first row

days temp rain

1 mon 22.2 TRUE

#Everything from first column

mon tue wed thu fri

# Grab Friday data

days temp rain

Selecting using column names

# All rain values

TRUE TRUE FALSE FALSE TRUE

# First 5 rows for days and temps

TRUE TRUE FALSE FALSE TRUE

mon tue wed thu fri

Filtering with a subset condition

days temp rain

1 mon 22.2 TRUE

days temp rain

4 thu 24.3 FALSE

Odering a Data Frame

days temp rain

1 mon 22.2 TRUE

4 thu 24.3 FALSE

desc.temp <- order(-df['temp'])

days temp rain

4 thu 24.3 FALSE

1 mon 22.2 TRUE

sort.temp <- order(df$temp)

days temp rain

1 mon 22.2 TRUE

4 thu 24.3 FALSE

You might also like