Thanks to visit codestin.com
Credit goes to www.scribd.com

0% found this document useful (0 votes)
7 views23 pages

Coding & Tech Lesson 5 Slides

The document provides an overview of big data, contrasting it with traditional data by highlighting its unstructured nature, larger volume, and complexity. It discusses tools for managing big data, such as Apache Hadoop and MongoDB, and emphasizes the importance of data visualization and business intelligence in making data-driven decisions. Additionally, it outlines applications of big data across various sectors, including social media, e-commerce, and healthcare.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
7 views23 pages

Coding & Tech Lesson 5 Slides

The document provides an overview of big data, contrasting it with traditional data by highlighting its unstructured nature, larger volume, and complexity. It discusses tools for managing big data, such as Apache Hadoop and MongoDB, and emphasizes the importance of data visualization and business intelligence in making data-driven decisions. Additionally, it outlines applications of big data across various sectors, including social media, e-commerce, and healthcare.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 23

DIPLOMA IN

Coding and
Technology
Lesson: Big Data
Traditional data vs big data
Tools for big data
Applications for big data

Lesson Objectives
Traditional data vs
big data
DID YOU
KNOW?

Fun fact
Every two days we generate
as much data as we did from
the beginning of time up until
2003.
(Source: Disruptor Daily)
What is big data?
Big data is characterised by it’s large
volume of diverse data, from various
sources with distributed and
decentralised control.
Traditional data
• Data is structured
• Much smaller size
• Often centralised
• Easier to manipulate
• Compatible with traditional
database
Big data
• Unstructured
• Significantly larger than traditional
data
• Data sources are distributed
• Much more complex to manipulate
• Requires special database tools
Processing and
storing big data

Processing and storing big data is not


as straightforward as with traditional
databases.
• Data sources from various
repositories
• Data is not standardised
• Requires more computing power
• Unknown correlations to reveal
meaningful information
Processing
and storing
big data

(Source: Data Warehouse Information)


The three
V’s of big
data
Most organisations, big Volume Velocity Variety
or small manage a
considerable amount of
data generated through
its various data sources
and business processes.
Tools for big data
Tools to manage
big data

Apache Hadoop MongoDB KNIME


Extraction, transform
and load (ETL)
ETL refers to the process of retrieving
data from a number or sources into a
data warehouse.
• Extraction
• Transform
• Load
Data visualisation
tools
Data visualisation enables us to
perform graphical representations of
information and data - allowing us to
identify trends and patterns in our
data.
Types of data
visualisation

Dashboards Graphs Infographics


Business
intelligence (BI)

BI includes the use of technologies


and skills for the collection,
integration and analysis of
organisational information. BI
systems are data driven systems that
support executive decisions.
Trends in business
intelligence
• Predictive analytics
• Operational business
intelligence
• Increased AI adoption
• Digitisation and competitive
advantage
Applications of big data
Data-driven
decisions
• Which data to acquire?
• How to represent data?
• How to extract, clean and store
data?
• Which tools to use to analyse
data?
Data-driven
decisions

(Source: Elgendy and Elragal)


Big data in social
media
• Online forums
• Social media services
- Facebook
- Twitter
- Instagram
• Web blogs
Big data in e-commerce
• Transaction records
• Business reviews
• Membership records
• Credit card fraud
• Marketing analysis
Big data in
healthcare
• Digital medical records
• Disease outbreak tracking
• Epidemiology data
• Biomedical data

You might also like