0% found this document useful (0 votes)

2K views6 pages

Week 2 Lab: Data Analysis Insights

This document summarizes the key points of a data analysis lab on flight data: - The lab involves analyzing flight data from February to SFO and calculating summary statistics and measures of central tendency for arrival delays. - When grouping the data by carrier, Frontier Airlines had the highest interquartile range of arrival delays. - Questions were answered about average and median departure delays from NYC airports by month, with January having the highest average and March the highest median. - Based on on-time departure percentage, LGA would be the best NYC airport to fly out of.

Uploaded by

Sampada Desai

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

2K views6 pages

Week 2 Lab: Data Analysis Insights

Uploaded by

Sampada Desai

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6



Week 2 Lab: Introduction to Data

 10/10 points earned (100%)

Quiz passed!

Back to Week 2

1/1
 points

1.
Create a new data frame that includes ights headed to SFO in February,
and save this data frame assfo_feb_ ights. How many ights meet these
criteria?

32735

Correct Response

1345

3563

2286

1/1
 points

Make a histogram and calculate appropriate summary statistics for arrival

Make a histogram and calculate appropriate summary statistics for arrival
delays of sfo_feb_ ights. Which of the following is false?

No ight is delayed more than 2 hours.

Correct Response

The distribution has several extreme values on the right side.

The distribution is right skewed.

The distribution is unimodal.

More than 50% of ights arrive on time or earlier than scheduled.

1/1
 points

3.
Calculate the median and interquartile range for arr_delays of ights in the
sfo_feb_ ights data frame, grouped by carrier. Which carrier has the highest
IQR of arrival delays?

JetBlue Airways

Frontier Airlines

American Airlines

Virgin America

Delta and United Airlines

Correct Response

1/1
 points

Which month has the highest average departure delay from an NYC airport?
Which month has the highest average departure delay from an NYC airport?

July

Correct Response

January

March

October

December

1/1
 points

5.
Which month has the highest median departure delay from an NYC airport?

October

January

July

December

Correct Response

March

1/1
 points

6.
Is the mean or the median a more reliable measure for deciding which
month(s) to avoid ying if you really dislike delayed ights, and why?

Mean would be more reliable as the distribution of delays is

Mean would be more reliable as the distribution of delays is
symmetric.

Median would be more reliable as the distribution of delays is

symmetric.

Median would be more reliable as the distribution of delays is

skewed.

Correct Response

Mean would be more reliable as it gives us the true average.

Both give us useful information.

1/1
 points

7.
If you were selecting an airport simply based on on time departure
percentage, which NYC airport would you choose to y out of?

LGA

Correct Response

JFK

EWR

1/1
 points

Mutate the data frame so that it includes a new variable that contains the
Mutate the data frame so that it includes a new variable that contains the
average speed, avg_speed traveled by the plane for each journey (in mph).
What is the tail number of the plane with the fastest avg_speed? Hint:
Average speed can be calculated as distance divided by number of hours of
travel, and note that air_time is given in minutes. If you just want to show the
avg_speed and tailnum and none of the other variables, use the select
function at the end of your pipe to select just these two variables with
select(avg_speed, tailnum). You can google this tail number to nd out more
about the aircraft.

N779JB

N959UW

N755US

N666DN

Correct Response

N947UW

1/1
 points

9.
Make a scatterplot of avg_speed vs. distance. Which of the following is true
about the relationship between average speed and distance.

The relationship is linear.

As distance increases the average speed of ights decreases.

The distribution of distances are uniform over 0 to 5000 miles.

There is an overall positive association between distance and

average speed.

Correct Response

There are no outliers.

1/1
 points

10.
Suppose you de ne a ight to be “on time” if it gets to the destination on
time or earlier than expected, regardless of any departure delays. Mutate
the data frame to create a new variable called arr_type with levels "on
time"and "delayed" based on this de nition. Then, determine the on time
arrival percentage based on whether the ight departed on time or not.
What proportion of ights that were "delayed" departing arrive "on time"?
(answer should be in the form 0.## where ## is between 2 and 7 decimal
places, inclusive)

0.1833639

Correct Response

  

Statistics Assignment
No ratings yet
Statistics Assignment
17 pages
MCD2080 Business Statistics Group Assignment-Final
No ratings yet
MCD2080 Business Statistics Group Assignment-Final
5 pages
Business Statistics Final Exam Solutions PDF
No ratings yet
Business Statistics Final Exam Solutions PDF
10 pages
Tutoring Session 2023 - Statistics For Business
No ratings yet
Tutoring Session 2023 - Statistics For Business
65 pages
USTH Exercise B1 Chapter1
No ratings yet
USTH Exercise B1 Chapter1
5 pages
HW Ses3 Solutions
No ratings yet
HW Ses3 Solutions
5 pages
Chapter - Two - Data Types and Data Collection v2
No ratings yet
Chapter - Two - Data Types and Data Collection v2
51 pages
Predicting Flight Delays
No ratings yet
Predicting Flight Delays
6 pages
Intro To Data Coursera
No ratings yet
Intro To Data Coursera
9 pages
SNU Assignment 1
No ratings yet
SNU Assignment 1
3 pages
Main Summary
No ratings yet
Main Summary
19 pages
Tutorial 9
No ratings yet
Tutorial 9
1 page
Data Presentation Final
No ratings yet
Data Presentation Final
14 pages
18BCE10291 - Outliers Assignment
No ratings yet
18BCE10291 - Outliers Assignment
10 pages
EEE356 MidtermExam 2024 Questions
No ratings yet
EEE356 MidtermExam 2024 Questions
6 pages
Introduction To Data-2
No ratings yet
Introduction To Data-2
13 pages
TLCH
No ratings yet
TLCH
15 pages
Flight Price Prediction Project Presentation
No ratings yet
Flight Price Prediction Project Presentation
15 pages
AA Flight Delay Factors Analysis
No ratings yet
AA Flight Delay Factors Analysis
11 pages
620 Case Study2
No ratings yet
620 Case Study2
2 pages
Exercises 01
No ratings yet
Exercises 01
2 pages
GNR 652 Assignment 2
No ratings yet
GNR 652 Assignment 2
4 pages
Report
No ratings yet
Report
25 pages
Fair Airline Price
No ratings yet
Fair Airline Price
21 pages
Ormulate The Data Science Problem
No ratings yet
Ormulate The Data Science Problem
5 pages
Airline Data Analysis
No ratings yet
Airline Data Analysis
20 pages
DMcase 2
No ratings yet
DMcase 2
5 pages
KrutikaKolhe 862467252 HW4
No ratings yet
KrutikaKolhe 862467252 HW4
16 pages
MachineLearningBigR Tutorial
No ratings yet
MachineLearningBigR Tutorial
5 pages
HW1 - Predicting Airfares On New Routes - JJ - (2024 Spring)
No ratings yet
HW1 - Predicting Airfares On New Routes - JJ - (2024 Spring)
3 pages
Descriptive Statistics, Hypothesis Testing, and Basic
No ratings yet
Descriptive Statistics, Hypothesis Testing, and Basic
62 pages
cl1 Aer
No ratings yet
cl1 Aer
4 pages
Presentation Mvda-1
No ratings yet
Presentation Mvda-1
18 pages
Topics
No ratings yet
Topics
11 pages
CSE1703 - Fundamental of Data Science
No ratings yet
CSE1703 - Fundamental of Data Science
6 pages
Data Analysis Problems
No ratings yet
Data Analysis Problems
12 pages
LAB1 - Descriptive Statistics
No ratings yet
LAB1 - Descriptive Statistics
4 pages
Analysis of Factors in Flight Delay: Yiyang Xu, Luyao Liu, Xichen Gao and Fanyu Frank Zeng
No ratings yet
Analysis of Factors in Flight Delay: Yiyang Xu, Luyao Liu, Xichen Gao and Fanyu Frank Zeng
7 pages
Week2 Cheat Sheet Data Wrangling With Tidyverse
No ratings yet
Week2 Cheat Sheet Data Wrangling With Tidyverse
4 pages
Data Collection & Univariate Analysis
No ratings yet
Data Collection & Univariate Analysis
196 pages
Course: Applied Statistics Projects: Bui Anh Tuan March 1, 2022
No ratings yet
Course: Applied Statistics Projects: Bui Anh Tuan March 1, 2022
9 pages
STAT721 Test1 2022 Solutions
No ratings yet
STAT721 Test1 2022 Solutions
5 pages
Lab 06
No ratings yet
Lab 06
2 pages
GROUP 07 CLASS CC02 Ê
No ratings yet
GROUP 07 CLASS CC02 Ê
36 pages
Flight Delays and Passenger Preferences: An Axiomatic Approach
No ratings yet
Flight Delays and Passenger Preferences: An Axiomatic Approach
25 pages
Predicting Flight Delays
No ratings yet
Predicting Flight Delays
5 pages
Practical 9 - Time-Series Forecasting
No ratings yet
Practical 9 - Time-Series Forecasting
5 pages
Lab EDA and Hypothesis Testing
No ratings yet
Lab EDA and Hypothesis Testing
2 pages
Loading Datasets From Excel/CSV: A) Local R Database Dataset
No ratings yet
Loading Datasets From Excel/CSV: A) Local R Database Dataset
4 pages
Group 07 Class CC02
No ratings yet
Group 07 Class CC02
38 pages
IT223 - Assignment #1
No ratings yet
IT223 - Assignment #1
2 pages
Data Analysis with R for Beginners
No ratings yet
Data Analysis with R for Beginners
4 pages
22CS5PEDEV
No ratings yet
22CS5PEDEV
5 pages
Presentation On Flight Price Prediction
No ratings yet
Presentation On Flight Price Prediction
30 pages
Assignment1 Code and Conclude DSA Nikhil Mishra
No ratings yet
Assignment1 Code and Conclude DSA Nikhil Mishra
36 pages
CS3352 Foundations of Data Science Apr May 2024 Question Paper Download 2
No ratings yet
CS3352 Foundations of Data Science Apr May 2024 Question Paper Download 2
7 pages
Task 4P-1
No ratings yet
Task 4P-1
5 pages
BDM - Mining Over Datasets
No ratings yet
BDM - Mining Over Datasets
20 pages
Fly or Drive Lab
No ratings yet
Fly or Drive Lab
6 pages
Apprenticeship - From Theory To Method and Back Again (Suny - Michael W - Coy
No ratings yet
Apprenticeship - From Theory To Method and Back Again (Suny - Michael W - Coy
336 pages
Design of Cable Trench
78% (9)
Design of Cable Trench
4 pages
Admission To Foundation Program of Tianjin University 2025
No ratings yet
Admission To Foundation Program of Tianjin University 2025
5 pages
TNCT Q1 COT On Roles of Parts of A Whole
No ratings yet
TNCT Q1 COT On Roles of Parts of A Whole
43 pages
Heat Transfer CHE F241: Basic Concepts
No ratings yet
Heat Transfer CHE F241: Basic Concepts
36 pages
+3 Final - Programme - 2015
No ratings yet
+3 Final - Programme - 2015
4 pages
Time Table Summer 2024 SMME V1.1
No ratings yet
Time Table Summer 2024 SMME V1.1
1 page
Expanded World Creation for SWN
No ratings yet
Expanded World Creation for SWN
8 pages
Size 365 Days of Mathematics For Class X A5
No ratings yet
Size 365 Days of Mathematics For Class X A5
202 pages
Zyzz Bible
No ratings yet
Zyzz Bible
66 pages
Organization and Management: Schools Division of Dipolog City Dipolog City Government
No ratings yet
Organization and Management: Schools Division of Dipolog City Dipolog City Government
20 pages
Unit Test Integral Calculus Set A
No ratings yet
Unit Test Integral Calculus Set A
4 pages
Concave vs Convex Mirror Quiz
100% (4)
Concave vs Convex Mirror Quiz
5 pages
Full Chapter of Social Psychology 10th Edition by Saul Kassin Ebook and TestBank Bundle EPUB DOCX PDF Download Now
No ratings yet
Full Chapter of Social Psychology 10th Edition by Saul Kassin Ebook and TestBank Bundle EPUB DOCX PDF Download Now
405 pages
European Steel and Alloy Grades: 10crmo9-10 (1.7380)
No ratings yet
European Steel and Alloy Grades: 10crmo9-10 (1.7380)
3 pages
Question 1213992
No ratings yet
Question 1213992
6 pages
Thermo - 6
0% (1)
Thermo - 6
14 pages
Grade 5 Term 3 Lessons Plans
No ratings yet
Grade 5 Term 3 Lessons Plans
132 pages
Exploring Grammar in Writing Article
No ratings yet
Exploring Grammar in Writing Article
4 pages
2007 02 17 GENV Cofimvaba Landfill Site Phase 2
No ratings yet
2007 02 17 GENV Cofimvaba Landfill Site Phase 2
42 pages
Exhumation Report: Etafuleni Cemetery
No ratings yet
Exhumation Report: Etafuleni Cemetery
5 pages
English Portfolio Class 12
No ratings yet
English Portfolio Class 12
8 pages
Mpisane Bonga Basil 2015
No ratings yet
Mpisane Bonga Basil 2015
86 pages
MCV4U Chapter 1 Assignment - A9d0c550a10552355394502 - 240213 - 153054
No ratings yet
MCV4U Chapter 1 Assignment - A9d0c550a10552355394502 - 240213 - 153054
2 pages
(Gabarito Dia 1) Quarto Bernoulli 2022
No ratings yet
(Gabarito Dia 1) Quarto Bernoulli 2022
50 pages
Remote Sensing: Radiometric Variations of On-Orbit FORMOSAT-5 RSI From Vicarious and Cross-Calibration Measurements
No ratings yet
Remote Sensing: Radiometric Variations of On-Orbit FORMOSAT-5 RSI From Vicarious and Cross-Calibration Measurements
18 pages
Potentiometre
No ratings yet
Potentiometre
3 pages
JavaScript Developer I Demo
No ratings yet
JavaScript Developer I Demo
5 pages
Development of A Pico-Hydro Electric Generator Wit
No ratings yet
Development of A Pico-Hydro Electric Generator Wit
10 pages