DIPLOMA IN
Coding and
Technology
Lesson: Big Data
Traditional data vs big data
Tools for big data
Applications for big data
Lesson Objectives
Traditional data vs
big data
DID YOU
KNOW?
Fun fact
Every two days we generate
as much data as we did from
the beginning of time up until
2003.
(Source: Disruptor Daily)
What is big data?
Big data is characterised by it’s large
volume of diverse data, from various
sources with distributed and
decentralised control.
Traditional data
• Data is structured
• Much smaller size
• Often centralised
• Easier to manipulate
• Compatible with traditional
database
Big data
• Unstructured
• Significantly larger than traditional
data
• Data sources are distributed
• Much more complex to manipulate
• Requires special database tools
Processing and
storing big data
Processing and storing big data is not
as straightforward as with traditional
databases.
• Data sources from various
repositories
• Data is not standardised
• Requires more computing power
• Unknown correlations to reveal
meaningful information
Processing
and storing
big data
(Source: Data Warehouse Information)
The three
V’s of big
data
Most organisations, big Volume Velocity Variety
or small manage a
considerable amount of
data generated through
its various data sources
and business processes.
Tools for big data
Tools to manage
big data
Apache Hadoop MongoDB KNIME
Extraction, transform
and load (ETL)
ETL refers to the process of retrieving
data from a number or sources into a
data warehouse.
• Extraction
• Transform
• Load
Data visualisation
tools
Data visualisation enables us to
perform graphical representations of
information and data - allowing us to
identify trends and patterns in our
data.
Types of data
visualisation
Dashboards Graphs Infographics
Business
intelligence (BI)
BI includes the use of technologies
and skills for the collection,
integration and analysis of
organisational information. BI
systems are data driven systems that
support executive decisions.
Trends in business
intelligence
• Predictive analytics
• Operational business
intelligence
• Increased AI adoption
• Digitisation and competitive
advantage
Applications of big data
Data-driven
decisions
• Which data to acquire?
• How to represent data?
• How to extract, clean and store
data?
• Which tools to use to analyse
data?
Data-driven
decisions
(Source: Elgendy and Elragal)
Big data in social
media
• Online forums
• Social media services
- Facebook
- Twitter
- Instagram
• Web blogs
Big data in e-commerce
• Transaction records
• Business reviews
• Membership records
• Credit card fraud
• Marketing analysis
Big data in
healthcare
• Digital medical records
• Disease outbreak tracking
• Epidemiology data
• Biomedical data