Problem1:: Pharmaceuticals - Csv. For Each Firm, The Following Variables Are Recorded

The document discusses using hierarchical cluster analysis on two datasets: the first contains crime, poverty, and income data for US cities, with the analysis finding 3 clusters; the second contains financial data for pharmaceutical companies, with the analysis identifying 4 clusters based on variables like market capitalization, profit margins, and growth rates. Association rule mining is also applied to course enrollment data, identifying the 6 strongest relationships between combinations of statistics courses.

Uploaded by

sumit kumar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

100% found this document useful (1 vote)

794 views5 pages

Problem1:: Pharmaceuticals - Csv. For Each Firm, The Following Variables Are Recorded

Uploaded by

sumit kumar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 5

Problem1:

Given data set (raw1) talk about crime, poverty and income of different US cities.
Use Hierarchical cluster analysis to explore and analyze the given dataset as follows:
a. Which method you are Using and Why?
b. How many clusters? And why
c. Distribution of cluster
Ans:
I have used Agglomerative clustering. This is because agglomerative method is less
rigid than divisive clustering. I have used the hclust function for the same. The code
we used for doing this in R is as follows:

>input <- read.csv("raw1.csv", header = T)

> dim(input)
[1] 41 4
> mydata <- input[1:41, 2:4]
> dim(mydata)
[1] 41 3
> normalized_data <- scale(mydata)
> View(normalized_data)
>d<-dist(normalized_data,method=”Euclidean”)
> hc <- hclust(d, method = "complete")
> plot(hc)
> plot(hc,labels = input$City,hang = -4)

Problem2
Pharmaceutical Industry. An equities analyst is studying the pharmaceutical industry and
would like your help in exploring and understanding the financial data collected by her firm.
Her main objective is to understand the structure of the pharmaceutical industry using some
basic financial measures.
Financial data gathered on 21 firms in the pharmaceutical industry are available in the file
Pharmaceuticals.csv. For each firm, the following variables are recorded:
1. Market capitalization (in billions of dollars)
2. Beta
3. Price/earnings ratio
4. Return on equity
5. Return on assets
6. Asset turnover
7. Leverage
8. Estimated revenue growth
9. Net profit margin
10. Median recommendation (across major brokerages)
11. Location of firm’s headquarters
12. Stock exchange on which the firm is listed
Use Hierarchical cluster analysis to explore and analyze the given dataset as follows:
a. Use only the numerical variables (1 to 9) to cluster the 21 firms. Justify the various
Choices made in conducting the cluster analysis, such as weights for different variables,
The specific clustering algorithm(s) used, the number of clusters formed, and so on.
b. Interpret the clusters with respect to the numerical variables used in forming the
Clusters.
c. Is there a pattern in the clusters with respect to the numerical variables (10 to 12)?
(Those not used in forming the clusters)

Ans:

> data<-read.csv("Pharmaceuticals.csv",header=T)

> data

> dim(data)

> pharma<-data[1:21,1:9]

> pharma

> pharma<-data[1:21,3:11]

> abdata<-scale(pharma)

> View(abdata)

> fit<-hclust(abdata)

> d<-dist(abdata,method="euclidean")

> plot(d)

> fit<-hclust(d)

> plot(fit)

> plot(fit,labels=data$Market_Cap,hang=-1)

> plot(fit,labels=data$Name,hang=-1)

> plot(fit,labels=data$Location,hang=-1)
C.
Problem3
Identifying Course Combinations. The Institute for Statistics Education at Statistics.com
offers online courses in statistics and analytics, and is seeking information that will help in
packaging and sequencing courses. Consider the data in the file Course-Topics.csv, the first
few rows of which are shown in below Table (Coursetopics.csv). These data are for
purchases of online statistics courses at Statistics.com. Each row represents the courses
attended by a single customer. The firm wishes to assess alternative sequencings and
bundling of courses. Use association rules to analyse these data, and draw course frequency
and find out top 6 rules based on lift value using support and confidence are .01 and .5
respectively.

ct <- read.csv("Coursetopics.csv")
ct.mat <- as.matrix(ct)
ct.tran <- as(ct.mat,"transactions")
inspect(ct.tran)
rules.all <- apriori(ct.tran,parameter=list(minlen=2, supp=0.01,conf=0.5))
inspect(sort(rules.all,by="lift")[1:6])

Hedge Fund Setup & Analysis
100% (1)
Hedge Fund Setup & Analysis
32 pages
KVA Anusha - PGP12021 - BA
100% (1)
KVA Anusha - PGP12021 - BA
8 pages
The Book of Me - Life Coach Yourself To Success PDF
0% (1)
The Book of Me - Life Coach Yourself To Success PDF
192 pages
Day Trading Volume Analysis Guide
100% (1)
Day Trading Volume Analysis Guide
5 pages
RFBT Republic Act No11232 Revised Corporation Code of The Philippines With Answers Compress
No ratings yet
RFBT Republic Act No11232 Revised Corporation Code of The Philippines With Answers Compress
63 pages
Demutualization of Stock Exchange
No ratings yet
Demutualization of Stock Exchange
5 pages
Case Study - Joy of Running PDF
No ratings yet
Case Study - Joy of Running PDF
4 pages
Finance Terminology - List of Financial Terms With Examples
No ratings yet
Finance Terminology - List of Financial Terms With Examples
7 pages
PGP12101 B Akula Padma Priya DA
No ratings yet
PGP12101 B Akula Padma Priya DA
20 pages
Indirect and Mutual Holdings: Answers To Questions 1
No ratings yet
Indirect and Mutual Holdings: Answers To Questions 1
29 pages
Case Study: Appex Corporation: Team # 9
No ratings yet
Case Study: Appex Corporation: Team # 9
10 pages
Hernandez Nievera Vs Hernandea
No ratings yet
Hernandez Nievera Vs Hernandea
1 page
Executive Shirt Company
No ratings yet
Executive Shirt Company
6 pages
SecB Group7 ODD Case
No ratings yet
SecB Group7 ODD Case
2 pages
Marketing to Rural India
No ratings yet
Marketing to Rural India
3 pages
Netflix's Financial Growth Analysis
100% (1)
Netflix's Financial Growth Analysis
27 pages
Cluster Analysis in R TML
No ratings yet
Cluster Analysis in R TML
5 pages
Polaroid Debt Strategy & Rating Improvement
No ratings yet
Polaroid Debt Strategy & Rating Improvement
8 pages
Building Effective One On One Work Relationship 20110818
No ratings yet
Building Effective One On One Work Relationship 20110818
1 page
Lululemon Final Report
No ratings yet
Lululemon Final Report
27 pages
SEM I Protect Your Company or Cousin
No ratings yet
SEM I Protect Your Company or Cousin
9 pages
Group3 - Pilgrim Bank (A) Customer Profitability
No ratings yet
Group3 - Pilgrim Bank (A) Customer Profitability
13 pages
05-24-01 Frankfurter PDF
No ratings yet
05-24-01 Frankfurter PDF
13 pages
MCQ's On Financial Management
No ratings yet
MCQ's On Financial Management
22 pages
Iifl Jack - 1-2
No ratings yet
Iifl Jack - 1-2
42 pages
Cost of Capital
No ratings yet
Cost of Capital
39 pages
Document Behavioral Finance Notes...
No ratings yet
Document Behavioral Finance Notes...
11 pages
Exhibit 3 - Materials Inventory in 2010 (April 2010 - March 2011)
100% (1)
Exhibit 3 - Materials Inventory in 2010 (April 2010 - March 2011)
54 pages
Class 10 - Statistics SMT1-2019 2020
No ratings yet
Class 10 - Statistics SMT1-2019 2020
109 pages
James Burke: A Career in American Business: Organizational Behaviour-II Group:B
100% (1)
James Burke: A Career in American Business: Organizational Behaviour-II Group:B
6 pages
100% (2)
11 pages
Group 2 - Appex Corporation
100% (3)
Group 2 - Appex Corporation
12 pages
What Do You Think Hilton Leadership Should Do After The Blackstone Acquisition? Should They Further Invest in CRM or Simply Maintain The Status Quo?
No ratings yet
What Do You Think Hilton Leadership Should Do After The Blackstone Acquisition? Should They Further Invest in CRM or Simply Maintain The Status Quo?
1 page
ISM Case Analysis (Cisco Systems) : Group 13 Section - A
No ratings yet
ISM Case Analysis (Cisco Systems) : Group 13 Section - A
6 pages
Acc 411 Advanced Financial Accounting 2
No ratings yet
Acc 411 Advanced Financial Accounting 2
99 pages
CP Damini Jyoti PDF
100% (1)
CP Damini Jyoti PDF
90 pages
Case Title: Wendy Peterson Situation Analysis
No ratings yet
Case Title: Wendy Peterson Situation Analysis
1 page
ELP Update VCC Structure Recommended For Funds in GIFT City
No ratings yet
ELP Update VCC Structure Recommended For Funds in GIFT City
10 pages
0092aSCM in HAL
No ratings yet
0092aSCM in HAL
17 pages
Game Theory Exam Scenarios
No ratings yet
Game Theory Exam Scenarios
3 pages
Coca Cola - Portfolio Project
No ratings yet
Coca Cola - Portfolio Project
15 pages
Summary 1
No ratings yet
Summary 1
5 pages
Psi Case Study (Epgp-11-026)
No ratings yet
Psi Case Study (Epgp-11-026)
2 pages
Hyrule Cinema Pricing Strategy Analysis
100% (1)
Hyrule Cinema Pricing Strategy Analysis
14 pages
CH 032
No ratings yet
CH 032
57 pages
PGP2 Nict 2013PGPM039
No ratings yet
PGP2 Nict 2013PGPM039
5 pages
Hewlitt Corp Financial Planning Guide
No ratings yet
Hewlitt Corp Financial Planning Guide
2 pages
The Medicines Company Presentation Final Original
100% (1)
The Medicines Company Presentation Final Original
24 pages
Metropolitan Research Inc. Case Study
No ratings yet
Metropolitan Research Inc. Case Study
6 pages
Question: Erika and Kitty, Who Are Twins, Just Received $30,000 Each For Their 25th Birthday. They Both Hav..
No ratings yet
Question: Erika and Kitty, Who Are Twins, Just Received $30,000 Each For Their 25th Birthday. They Both Hav..
4 pages
The Decision Dilemma Part A
0% (1)
The Decision Dilemma Part A
4 pages
Project Report On Investment Banking and Financial Markets in Kotak Mahindra Bank
No ratings yet
Project Report On Investment Banking and Financial Markets in Kotak Mahindra Bank
10 pages
Optimizing CRU Rental Profitability
No ratings yet
Optimizing CRU Rental Profitability
5 pages
VAR Package Pricing at Mission Hospital
No ratings yet
VAR Package Pricing at Mission Hospital
6 pages
Mygola's Future in Online Travel Planning
No ratings yet
Mygola's Future in Online Travel Planning
5 pages
Chapter 15 Solutions: Solutions To Questions For Review and Discussion
No ratings yet
Chapter 15 Solutions: Solutions To Questions For Review and Discussion
28 pages
Genentech - Capacity Planning: A Case Study On
No ratings yet
Genentech - Capacity Planning: A Case Study On
7 pages
Precise Software Insight Strategy
No ratings yet
Precise Software Insight Strategy
4 pages
A00006 PDF Eng
No ratings yet
A00006 PDF Eng
20 pages
Chapter13 Slides
No ratings yet
Chapter13 Slides
24 pages
Three Squirrels and A Pile of Nuts
No ratings yet
Three Squirrels and A Pile of Nuts
6 pages
L19 - Chi Square Test 1
No ratings yet
L19 - Chi Square Test 1
17 pages
Moore Medical's System Upgrade Choices
No ratings yet
Moore Medical's System Upgrade Choices
8 pages
Aravind Eye Care Model Analysis
100% (1)
Aravind Eye Care Model Analysis
8 pages
Advanced Marketing Management: Case Study: Culinarian Cookware: Pondering Price Promotion
No ratings yet
Advanced Marketing Management: Case Study: Culinarian Cookware: Pondering Price Promotion
3 pages
Xiaomi Ansoff Matrix - NAIM
No ratings yet
Xiaomi Ansoff Matrix - NAIM
2 pages
Master Operations Scheduling Game - Work Sheet: (Write Your Names or Roll No.)
No ratings yet
Master Operations Scheduling Game - Work Sheet: (Write Your Names or Roll No.)
1 page
Moore Medical's CRM & ERP Analysis
No ratings yet
Moore Medical's CRM & ERP Analysis
4 pages
Chapters 9 and 10 Edited
No ratings yet
Chapters 9 and 10 Edited
18 pages
NIFTY Blue Chip Fund Overview
No ratings yet
NIFTY Blue Chip Fund Overview
1 page
End Term Exam
No ratings yet
End Term Exam
9 pages
Pramanik Containers Case Analysis
No ratings yet
Pramanik Containers Case Analysis
7 pages
CLUSTERING ANALYSIS State Wise Health PDF
No ratings yet
CLUSTERING ANALYSIS State Wise Health PDF
14 pages
Donner Group 4
No ratings yet
Donner Group 4
6 pages
Solution - Canonical Decision Problem
No ratings yet
Solution - Canonical Decision Problem
24 pages
Quiz CS
No ratings yet
Quiz CS
1 page
Wausau Equipment Company
No ratings yet
Wausau Equipment Company
6 pages
The Legal Opinion of Advising The Minority Shareholders of TNM Plc.
No ratings yet
The Legal Opinion of Advising The Minority Shareholders of TNM Plc.
8 pages
Republic vs. City of Parañaque
No ratings yet
Republic vs. City of Parañaque
13 pages
History of IDBI Bank
No ratings yet
History of IDBI Bank
65 pages
The Mexico-China Sourcing Game: Teaching Global Dual Sourcing
No ratings yet
The Mexico-China Sourcing Game: Teaching Global Dual Sourcing
8 pages
Group10 Appex Final Final
100% (3)
Group10 Appex Final Final
25 pages
Individual Assignment AFM-A182
No ratings yet
Individual Assignment AFM-A182
2 pages
F105 June 2014 Examiners Report
No ratings yet
F105 June 2014 Examiners Report
14 pages
Business Analytics - Prediction Model
No ratings yet
Business Analytics - Prediction Model
3 pages
Admin Law Cases I (Digests)
No ratings yet
Admin Law Cases I (Digests)
5 pages
Lim, Duran & Associates For Petitioner. Renato J. Dilag For Private Respondent
No ratings yet
Lim, Duran & Associates For Petitioner. Renato J. Dilag For Private Respondent
6 pages
Clean Edge
No ratings yet
Clean Edge
9 pages
Classic Knitwear Case Analysis
No ratings yet
Classic Knitwear Case Analysis
5 pages

Problem1:: Pharmaceuticals - Csv. For Each Firm, The Following Variables Are Recorded

Uploaded by

Problem1:: Pharmaceuticals - Csv. For Each Firm, The Following Variables Are Recorded

Uploaded by

Problem1:

>input <- read.csv("raw1.csv", header = T)

You might also like