Thanks to visit codestin.com
Credit goes to www.scribd.com

0% found this document useful (0 votes)
28 views4 pages

Weak and Bright Student Assignment

Qb

Uploaded by

ahmadtaqu38
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
28 views4 pages

Weak and Bright Student Assignment

Qb

Uploaded by

ahmadtaqu38
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 4

A group of two Weak Students must take the help of one Bright Student to

solve these questions.

12 Application-Level:

1. List the examples of common stream sources in a data stream management system.
CO4, PO1
2. Explain the challenges faced in stream processing.
CO4, PO2
3. Demonstrate how the Bloom filter can be used to filter streams with an example.
CO4, PO5
4. Describe the Count-Distinct Problem and its relevance in stream analytics.
CO4, PO1
5. Explain the space requirements for the Flajolet-Martin Algorithm.
CO4, PO2
6. List the advantages of using sampling techniques in stream data processing.
CO4, PO7
7. Describe how the Datar-Gionis-Indyk-Motwani (DGIM) Algorithm handles the counting
of ones in a window.
CO4, PO3
8. Explain how decaying windows are used in stream processing.
CO4, PO5
9. List the key differences between content-based recommendations and collaborative
filtering.
CO5, PO1
10. Describe the role of social graphs in recommendation systems.
CO5, PO6
11. Explain how clustering techniques can be used for social network graphs.
CO5, PO4
12. Demonstrate how product recommendations are generated using collaborative filtering
in an e-commerce platform.
CO5, PO5

8 Higher-Order Thinking Skills Questions:

1. Define collaborative filtering. Using an example of an e-commerce site like Flipkart or


Amazon, describe how it can be used to provide user recommendations.
CO5, PO11
2. Determine communities for the given social network graph using the Girvan-Newman
algorithm.
CO5, PO2
3. Suppose the stream is S=(2,1,6,1,5,9,2,3,5)S = (2, 1, 6, 1, 5, 9, 2, 3,
5)S=(2,1,6,1,5,9,2,3,5). Let hash functions h(x)=ax+bmod 16h(x) = ax + b \mod
16h(x)=ax+bmod16 for some aaa and bbb, and treat the result as a 4-bit binary integer.
Show how the Flajolet-Martin algorithm will estimate the number of distinct elements,
h(x)=4x+1mod 16h(x) = 4x + 1 \mod 16h(x)=4x+1mod16.
CO4, PO4
4. With a neat sketch, explain the architecture of the data-stream management system
(DSMS).
CO4, PO3
5. Analyze the advantages and limitations of the DGIM algorithm for answering queries
about the number of ones in a stream window.
CO4, PO5
6. Illustrate how clustering techniques can be applied to discover communities in a large
social network graph.
CO5, PO4
7. Develop a product recommendation system using a combination of content-based and
collaborative filtering techniques for an online retail platform.
CO5, PO11
8. Explain how decaying windows can be used to track the popularity of a topic in real-time
social media streams.
CO4, PO7

1. Flajolet-Martin Algorithm: Estimating Distinct Elements

Given a data stream S=(7,4,2,7,9,2,5,4,8,9)S = (7, 4, 2, 7, 9, 2, 5, 4, 8, 9)S=(7,4,2,7,9,2,5,4,8,9),


use the hash function h(x)=(3x+1)mod 16h(x) = (3x + 1) \mod 16h(x)=(3x+1)mod16 to compute
the binary hash values. Based on the Flajolet-Martin algorithm, estimate the number of distinct
elements in the stream.

2. Bloom Filter: Testing Membership

Suppose a Bloom filter is created with 5 hash functions and a bit array of size 20. You insert the
following elements into the filter: A,B,CA, B, CA,B,C. After inserting, you want to test if the
element DDD is in the set. The hash functions h1,h2,…,h5h_1, h_2, \dots, h_5h1​,h2​,…,h5​
return the following bit positions for DDD: 2, 7, 12, 15, and 18. Check whether DDD is likely in
the set or not.

3. DGIM Algorithm: Counting Ones in a Window

A binary stream consists of the following sequence of bits: S=101110010111S =


101110010111S=101110010111. You are using the DGIM algorithm to approximate the number
of 1s in the last 10 bits of the stream. Using the DGIM method of representing consecutive ones
in buckets, calculate the approximate number of ones in the last 10 bits of the stream.
4. Degree Calculation

Consider a small social network graph with 6 nodes connected as follows:

1–2,1–3,2–4,3–5,4–6,5–6

Calculate the degree for the all the nodes.

5. Flajolet-Martin Algorithm: Combining Estimates

Consider three different hash functions h1,h2,h3h_1, h_2, h_3h1​,h2​,h3​, applied to a


stream of data resulting in the following maximum number of trailing zeros for each hash
function: 3, 4, and 2 respectively. Using the Flajolet-Martin algorithm, combine these
estimates to get a final estimate for the number of distinct elements in the stream.

Course Outcomes

CSC702 .1 Understand the building blocks of Big Data Analytics.


CSC702 .2 Apply fundamental enabling techniques like Hadoop and MapReduce in solving real
world problems.
CSC702 .3 Understand different NoSQL systems and how it handles big data.
CSC702 .4 Apply advanced techniques for emerging applications like stream analytics.
CSC702 .5 Achieve adequate perspectives of big data analytics in various applications like
recommender systems, social media applications, etc.
CSC702 .6 Apply statistical computing techniques and graphics for analyzing big data.

Program Outcomes

Apply the knowledge of mathematics, science, engineering fundamentals, and


Engineering
PO1 an engineering specialization to the solution of complex engineering
knowledge problems.

Identify, formulate, review research literature, and analyse complex


PO2 Problem analysis Engineering problems reaching substantiated conclusions using first principles
of mathematics, natural sciences, and engineering sciences.

Design solutions for complex engineering problems and design system


Design/development components or processes that meet the specified needs with appropriate
PO3 consideration for the public health and safety, and the cultural, societal, and
of solutions
environmental considerations.

Conduct Use research-based knowledge and research methods including design of


PO4 investigations of experiments, analysis and interpretation of data, and synthesis of the
complex problems information to provide valid conclusions.

PO5 Modern tool usage Create, select, and apply appropriate techniques, resources, and modern
engineering and IT tools including prediction and modelling to complex
engineering activities with an understanding of the limitations.

Apply reasoning informed by the contextual knowledge to assess societal,


The engineer and
PO6 health, safety, legal and cultural issues and the consequent responsibilities
society relevant to the professional engineering practice.

Understand the impact of the professional engineering solutions in societal


Environment and
PO7 and environmental contexts, and demonstrate the knowledge of, and need for
sustainability sustainable development.

Apply ethical principles and commit to professional ethics and responsibilities


PO8 Ethics and norms of the engineering practice.

Individual and Function effectively as an individual, and as a member or leader in diverse


PO9 teams, and in multidisciplinary settings.
team work

Communicate effectively on complex engineering activities with the


engineering community and with society at large, such as, being able to
PO10 Communication comprehend and write effective reports and design documentation, make
effective presentations, and give and receive clear instructions.

Demonstrate knowledge and understanding of the engineering and


Project management
PO11 management principles and apply these to one’s own work as a member and
and finance leader in a team, to manage projects and in multidisciplinary environments.

Recognize the need for, and have the preparation and ability to engage in
PO12 Life-long learning independent and life-long learning in the broadest context of technological
change.

You might also like