Thanks to visit codestin.com
Credit goes to www.scribd.com

0% found this document useful (0 votes)
19 views4 pages

Project Handbook

Uploaded by

Tang Holam
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
19 views4 pages

Project Handbook

Uploaded by

Tang Holam
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 4

Project Handbook GFQR 1026

GFQR 1026 Big Data in X


(Second Semester, 2023-2024)
Group Project Handbook

Overview:
The Group Project is part of the course requirement, which carries 25% weight of the overall course
assessment. Each group must be composed of NO MORE THAN 4 students in your own section, and
the submission should be based on the group as same as the case study group. Once you have enrolled to
a group, permission will be given to your instructors for sharing your email address to other group
members in the same group. If you opt out of email sharing, please send a consent form to your instructors
and indicate that you will not give permission for email sharing before the enrollment deadline.

The purpose of the group project is to test how well students understand and apply the concepts and skills
to perform analysis on a real case. Students are required to select a topic of interest, conduct data cleaning
and analysis, and make conclusion for your findings. During the data analysis process, you need to
examine large and varied data sets to gain insights about what they contain, such as hidden patterns,
unknown correlations, market trends, customer preferences etc. For example, you may pick a company to
study its sales performance in 2022/23? Or how well Instagram can be used to sell cosmetics?

Two datasets (Life Expectancy (WHO) or Computer Education Survey) are given to you. Students
may use one of the given datasets or find your own one (but no extra mark will be given). If you want to
use your own dataset, you need to send email to your instructor for approval, otherwise, marks will be
deducted.

References for the datasets:


1. https://www.kaggle.com/datasets/kumarajarshi/life-expectancy-who/
2. https://data.world/technology/stack-overflow-developer-survey/workspace/file?filename=2021-
survey_results_public.csv

Requirements:
1. Group Report
Prepare a tidy report in MS Word format (with the file name gpxx-report.docx, where xx is your
group number) with around 2000 words. The report must include the following:
a) Introduction (dataset introduction)
b) The characteristics of the Big Data (4Vs) in the industry/field you choose (Health/Education)
c) The benefits of using the Big Data in that industry/field (Health/Education)

©COMP, HKBU 2024. All Rights Reserved. This content is copyright protected and shall not be shared, uploaded or
distributed.
1
Project Handbook GFQR 1026

d) Objectives of your study


• Describe what hidden patterns, correlations, market trends and customer preferences you want
to find out.
• Explore other insights that can be obtained from the dataset.
e) Data cleaning and transformation using Excel
• Describe the data used in your study and how they are related to your study.
• Describe the functions/skills you used in data cleansing or transformation with some
screenshots captured from the Excel file (e.g. vlookup, grouping, filtering, text to columns,
replace missing values etc).
f) Analysis results with appropriate charts
• Descriptive results (less than 50 words for each objective)
• Implication, reason or suggestion (less than 120 words for each objective)
Notes:
✓ You have to use the SAME software for data analysis and visualization (either Excel or Tableau
but NOT BOTH), otherwise, marks will be deducted (for cleaning, you should use Excel)
✓ Rename the worksheets properly in Excel and (or) Tableau.
✓ Use appropriate chart titles
✓ Include data labels in the charts
✓ For 3-4 members, please include at least 6 meaningful objectives. For 1-2 members, please
include at least 4 meaningful objectives
✓ Place the charts in PowerPoint and Word document in readable format (adjust font size!)
g) Conclusion
h) Individual assessment form (you can download it from buelearning and attached it to the end of
your report, see the sample on the next page

Formatting requirements
✓ Suggested font size is 12 and line spacing is 1.5
✓ Use consistent font and font size
✓ Insert a cover page (by using Insert → Cover Page) that includes Project topic, Course name
and section number, group number and member names and SID numbers
✓ Refer to the sample report for more details

2. Oral presentation
a) Prepare a PowerPoint file (with the file name gpxx-report.pptx, where xx is your group number)
for the oral presentation.
b) Only include background information, objectives, data analysis with charts and insights
c) Individual oral presentation score will be given to each student. All students must give the oral
presentation in English (No mark will be given if you forget to attend oral presentation)
d) The duration of the presentation is around 10-15 minutes including Q & A
©COMP, HKBU 2024. All Rights Reserved. This content is copyright protected and shall not be shared, uploaded or
distributed.
2
Project Handbook GFQR 1026

Submission:
Deadline: 8 April 2023 5pm (Mon.)

Submit the softcopies of following files to the BU MOODLE: (No hardcopy submission is required)

1. A Word file (gpxx-report.docx): put it in Turnitin submission box1


2. An Excel file (gpxx-report.xlsx): show how you clean and analyze data (with pivot tables and charts)
3. A Tableau file (gpxx-report.twbx) : show how you analyze the data (with charts)
4. A PowerPoint file (gpxx-report.pptx) for oral presentation

Note:
✓ The files gpxx-report.xlsx, gpxx-report.twbx and gpxx-report.pptx should be placed in another
submission box, not Turnitin submission box.
✓ As your project files will be collected by computer at 5:00 p.m. sharp on the deadline day and the
network traffic may be very heavy on that day, you are strongly advised to upload your files one
or two days before the deadline
✓ Penalty will be imposed on the act of academic dishonesty such as plagiarism, or submission of
materials for assessment which is not your own work. A student found to have committed an act
of plagiarism shall receive an “F” grade for the course. You are strongly advised to read the
handbook on Avoiding Plagiarism, especially the section on “The Cost of Plagiarism”. The
handbook is available on the Academic Registry website at
http://buar.hkbu.edu.hk/index.php/current_students_and_alumni/academic_guidelines/avoiding _plagiarism

Serious penalty will be imposed for late submission!!!

Assessment criteria:
1. Group Report and PowerPoint (21%)
a. Characteristics, Benefits of using the related Big Data
b. Appropriateness of data use and objectives
c. Data cleansing and data transformation
d. Data analysis by using (a) right tool(s) and right method(s)
e. Result interpretation/Insights
f. PowerPoint design and content
2. Oral presentation (4%)
a. Presentation skills

1
Turnitin is an academic plagiarism checker for teachers and students. The first time that you submit your group report (docx)
to Turnitin, you are allowed to see the percentage of similarity. This percentage will not be released to the student in
subsequent submissions.
©COMP, HKBU 2024. All Rights Reserved. This content is copyright protected and shall not be shared, uploaded or
distributed.
3
Project Handbook GFQR 1026

Individual Assessment Form (sample)


Tasks Student Student Student Student
A B C D

Introduction (company background and


the dataset description)
The characteristics of the Big Data (4Vs)?

The benefits of using the Big Data

Objectives

Data cleaning and transformation

Analysis results

Implication and suggestion

Conclusion

PowerPoint content and design

Group xx Student Name


A Chan Tai Man (xxxxxxxx)
B Chan Siu Man (xxxxxxxx)
C Chan Tai Ming (xxxxxxxx)
D Chan Sai Ming (xxxxxxxx)

©COMP, HKBU 2024. All Rights Reserved. This content is copyright protected and shall not be shared, uploaded or
distributed.
4

You might also like