Using Tensor Flow For Zero Day Attack Detection

The paper discusses a methodology for detecting zero-day attacks using machine learning techniques on Twitter data, achieving an 80% success rate. It highlights the strengths of utilizing social media for proactive threat detection while noting limitations such as reliance on Twitter data and the need for more sophisticated detection methods. The authors suggest improvements, including the integration of multiple machine learning models and user metadata analysis to enhance detection accuracy.

Uploaded by

Shaf Alam

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

16 views25 pages

Using Tensor Flow For Zero Day Attack Detection

Uploaded by

Shaf Alam

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 25

1

TABLE OF CONTENTS

CRITIQUE OF PAPER.........................................................................................2
DESCRIPTION OF DATASET..............................................................................4
RAW RESULTS OF ALL EXPERIMENTS..............................................................6
NOVELTY TO THE PAPER................................................................................25
CODE.............................................................................................................25
2

CRITIQUE OF PAPER
1. Background and Motivation:
o The paper addresses the challenge of detecting zero-day attacks using
social media data, specifically focusing on Twitter. The motivation lies in
the need for proactive threat detection, especially for emerging risks that
lack pre-existing anti-malware measures.
2. Methodology and Approach:
o The authors proposed using machine learning techniques, particularly
TensorFlow, to analyze Twitter data. Word categorization is employed to
identify vulnerabilities and counteract zero-day attacks swiftly. The
integration of the Natural Language Toolkit (NLTK) aids in extracting
targeted words across various languages. The study demonstrates an
80% success rate in detecting zero-day attacks using their tool.
o The process of data collection from Twitter is described, including the use
of a crawling procedure that mimics human behavior to bypass limitations
in Twitter's search functionality. This is a clever approach, but the paper
does not sufficiently address ethical considerations and the potential
biases introduced by this method. Discussing these aspects would
strengthen the methodology section.
3. Strengths:
o Utilizing social media data as a proactive tool for early detection and
mitigation of zero-day attacks enhances cybersecurity measures. The
deep character-level anomaly detection technique showcased efficacy in
detecting zero-day threats.
o The paper clearly outlines the use of TensorFlow and NLTK for
data processing and analysis, which is a standard and effective
approach for text-based machine learning tasks. The authors also
mention using real Twitter data, which is crucial for real-world
applicability.
3

o Highlights the potential of using publicly available information on

Twitter for early detection of zero-day attacks, which could be
valuable for security researchers and organizations.
4. Limitations and Considerations:
o The study focuses solely on Twitter data, which may not cover all social
media platforms.
o The effectiveness of the NLTK-based word extraction method may vary
across different languages and contexts.
o The success rate of 80% leaves room for improvement, and false
positives/negatives should be carefully evaluated.
o Limitations in the capabilities of the crawlers, such as:

- Unable to refresh at a higher frequency than once per second

- Unable to perform real-time searches

- Unable to directly target specific individuals or groups on Twitter

- Reliance on Twitter's built-in search functionality, which restricted the

crawlers' ability to operate at higher speeds or target specific entities -
Need for more robust computational resources and advanced crawling
techniques to improve the efficiency and effectiveness of data collection in
future research.

- The zero-day detection mechanism presented in this paper use total

linear data set having only one attribute that is key word “zero day” the
more sophisticated ATP use advance tactics, techniques and procedure
not consider in data set used to train the model moreover model is totally
dependent on the keyword search like zero day and similar it is not
detection any zero day on its behavior tactics techniques and procedure
so it is a very wavered approach for detection.
4

DESCRIPTION OF DATASET
1. Dataset Creation:
o The research aimed to generate a dataset for their model, specifically
focusing on social media data.
o They selected Twitter as the social media platform for data collection.
2. Data Collection Approaches:
o To create the dataset, the following approaches were used:
 Crawling Procedure Implementation:
 Robots were programmed to mimic human behavior on
Twitter.
 These robots operated web browsers in a shadow mode,
navigating Twitter as if they were human users.
 Data Extraction:
 The robots browsed through Twitter, identifying relevant data
related to specific keywords (e.g., “zero day”).
 Extracted data was stored in a database for further
processing.
 Human-like Scrolling Behavior:
 The crawling procedure replicated typical human scrolling
behavior on Twitter.
 Unlike the platform’s interface, which limits scrolling based
on scroll-down count, the robots could scroll indefinitely.
 This ensured comprehensive data collection.
 Consideration of Twitter’s Response Time:
 The study accounted for Twitter’s response time during data
collection.
 Understanding and optimizing the crawling process based
on response time were emphasized.

3. Missing Items from Dataset

o Size of the dataset: The paper doesn't mention the number of tweets
collected.
o Time period of data collection: It's unclear when the tweets were
collected.
o Specific keywords used: The exact keywords or search queries used to
gather tweets related to zero-day attacks aren't provided.
o Data labeling: The paper mentions manual intervention for handling
certain cases, implying some level of human labeling for training and/or
evaluating the model. However, the labeling process and the inter-rater
reliability (if multiple annotators were involved) aren't discussed.
4. A graph was created to illustrate the relationship between the model's
performance and Twitter's response time. This visual representation helped in
understanding how the model's effectiveness was influenced by the time it took
for Twitter to respond to requests.
6

RAW RESULTS OF ALL EXPERIMENTS

Using Tensor Flow for Zero Day attack Detection
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25

NOVELTY TO THE PAPER

Instead of relying solely on keyword-based search for data collection, the authors could
explore more sophisticated methods like Combining multiple machine learning
models (e.g., CNN, RNN, Transformer) to improve detection accuracy and robustness
and incorporating techniques to provide insights into why the model classifies certain
tweets as indicative of zero-day attacks, enhancing trust and interpretability. Moreover,
analyzing user metadata (e.g., account age, follower/following ratio, tweet history) to
identify potentially malicious accounts spreading zero-day information can also be
employed\ along with use of computer vision techniques to analyze any content for
potential zero-day information. Methodologies can be explored to integrate the zero-day
detection system with existing security information and event management tools to
provide actionable alerts and facilitate rapid response.

CODE
Code file is submitted along with this paper as a jupyter notebook.

PMC Final Evaluation Report
76% (21)
PMC Final Evaluation Report
50 pages
Hino J08c Engine Manual
0% (1)
Hino J08c Engine Manual
2 pages
E&M Voice Card For LOOP-AM3440-A/B/C User'S Manual
No ratings yet
E&M Voice Card For LOOP-AM3440-A/B/C User'S Manual
32 pages
Lesson4 Peripheral Devices
75% (4)
Lesson4 Peripheral Devices
4 pages
8615 1
100% (1)
8615 1
15 pages
3-SDU Version 5.10 Release Notes
100% (2)
3-SDU Version 5.10 Release Notes
45 pages
TVL CSS11 - Q3 - M1
100% (2)
TVL CSS11 - Q3 - M1
13 pages
Detailed Drawing Exercises: Solidworks Education
No ratings yet
Detailed Drawing Exercises: Solidworks Education
51 pages
RM68120 LCD
No ratings yet
RM68120 LCD
289 pages
Owners' Perspective in Construction
100% (1)
Owners' Perspective in Construction
286 pages
Trimble Guide PDF
No ratings yet
Trimble Guide PDF
60 pages
Grade 11 ICT Collaboration Guide
No ratings yet
Grade 11 ICT Collaboration Guide
4 pages
Parts List Lista de Peças Tdmg30: Toyama Part Number Name Portugues
No ratings yet
Parts List Lista de Peças Tdmg30: Toyama Part Number Name Portugues
18 pages
Technical Words in KiSwahili
100% (1)
Technical Words in KiSwahili
16 pages
Fake News Detection Using Machine Learning: Project Report On
No ratings yet
Fake News Detection Using Machine Learning: Project Report On
57 pages
Untitled
100% (2)
Untitled
66 pages
Detection and Visualization of Misleading Content On Twitter
No ratings yet
Detection and Visualization of Misleading Content On Twitter
16 pages
Spammer Detection and Fake User Identification On Social Networks
No ratings yet
Spammer Detection and Fake User Identification On Social Networks
9 pages
Twitter Suspicious URL Detector
No ratings yet
Twitter Suspicious URL Detector
5 pages
Anomaly Detection in Social Networks Twitter Bot
No ratings yet
Anomaly Detection in Social Networks Twitter Bot
11 pages
SNA Project Presentation
No ratings yet
SNA Project Presentation
18 pages
SNA Group9 Project Report
No ratings yet
SNA Group9 Project Report
5 pages
Cyberspace News Prediction of Text and Image
No ratings yet
Cyberspace News Prediction of Text and Image
53 pages
Machine Learning-Problems
No ratings yet
Machine Learning-Problems
1 page
Zero-Day Attack Paper2
No ratings yet
Zero-Day Attack Paper2
25 pages
Excel Module 1 PPT Presentation
No ratings yet
Excel Module 1 PPT Presentation
31 pages
Tracing Down User and Computer Account Deletion in Active Directory - TechNet Blogs
No ratings yet
Tracing Down User and Computer Account Deletion in Active Directory - TechNet Blogs
4 pages
Tweet
No ratings yet
Tweet
63 pages
The Main Objective Is To Detect The Fake News, Which Is A Classic Text Classification
No ratings yet
The Main Objective Is To Detect The Fake News, Which Is A Classic Text Classification
57 pages
Detecting Fake Social Media Profiles Using Blockchain
No ratings yet
Detecting Fake Social Media Profiles Using Blockchain
21 pages
CASE Study 4 Chapter 9 Pg390 Assignment
No ratings yet
CASE Study 4 Chapter 9 Pg390 Assignment
8 pages
Fake News Synopsis
No ratings yet
Fake News Synopsis
10 pages
About Resource Book Chain Management (SCM) " This Resource Book Has Been Designed According
No ratings yet
About Resource Book Chain Management (SCM) " This Resource Book Has Been Designed According
2 pages
Vaibhav DSBDA Project
No ratings yet
Vaibhav DSBDA Project
16 pages
Detecting Emerging Topics in Social Networks Using Anomaly Detection
No ratings yet
Detecting Emerging Topics in Social Networks Using Anomaly Detection
6 pages
A Comparative Study On Fake Profile Identification Using Different Machine Learning Techniques
No ratings yet
A Comparative Study On Fake Profile Identification Using Different Machine Learning Techniques
11 pages
1 s2.0 S0140366422004248 Main
No ratings yet
1 s2.0 S0140366422004248 Main
11 pages
Kuwait CS
No ratings yet
Kuwait CS
8 pages
Fake Profile Detection in Social Media Using NLP: About The Project
100% (1)
Fake Profile Detection in Social Media Using NLP: About The Project
33 pages
Saloni CSL Report
No ratings yet
Saloni CSL Report
10 pages
MAJOR PROJECT REPORT On Machine Learning Model To Determine Fake News
No ratings yet
MAJOR PROJECT REPORT On Machine Learning Model To Determine Fake News
52 pages
Automated Emerging Cyber Threat Identification and Profiling Based On Natural Language Processing
No ratings yet
Automated Emerging Cyber Threat Identification and Profiling Based On Natural Language Processing
16 pages
Spam Review Detection Using Linguistic Methods For Specified User in Twitter
No ratings yet
Spam Review Detection Using Linguistic Methods For Specified User in Twitter
11 pages
ZTA Architecture - Survey Paper
No ratings yet
ZTA Architecture - Survey Paper
13 pages
Compromised Account Detection On Social Networks
No ratings yet
Compromised Account Detection On Social Networks
11 pages
Thales 20-Watt Base Station V12 Data Sheet 2020-08
No ratings yet
Thales 20-Watt Base Station V12 Data Sheet 2020-08
2 pages
A Comparative Study On Fake Job Post Prediction Using Different Machine Learning Techniques
No ratings yet
A Comparative Study On Fake Job Post Prediction Using Different Machine Learning Techniques
11 pages
Machine Learning-Based Secure Data Acquisition For
No ratings yet
Machine Learning-Based Secure Data Acquisition For
10 pages
Security Incident Response Guide
No ratings yet
Security Incident Response Guide
5 pages
A Framework To Predict Social Crimes Using Twitter Tweets
No ratings yet
A Framework To Predict Social Crimes Using Twitter Tweets
5 pages
Increasing The Veracity of Event Detection On Social Media Networks Through User Trust Modeling
No ratings yet
Increasing The Veracity of Event Detection On Social Media Networks Through User Trust Modeling
8 pages
2 PB
No ratings yet
2 PB
16 pages
Real-Time Hashtag Event Detection
No ratings yet
Real-Time Hashtag Event Detection
8 pages
B3 Twitter Data
No ratings yet
B3 Twitter Data
68 pages
Fin Irjmets1715854730
No ratings yet
Fin Irjmets1715854730
8 pages
Batch 2
No ratings yet
Batch 2
21 pages
Fake Account Detection Using Machine Learning Techniques
100% (1)
Fake Account Detection Using Machine Learning Techniques
7 pages
Fake News Classifier Project Report
No ratings yet
Fake News Classifier Project Report
5 pages
Print Mo Na Toh
No ratings yet
Print Mo Na Toh
56 pages
Hate Speech Detection Using LSTM and NLP Sushan Pratihar 3 Page
No ratings yet
Hate Speech Detection Using LSTM and NLP Sushan Pratihar 3 Page
13 pages
CNN-Based License Plate Recognition
No ratings yet
CNN-Based License Plate Recognition
6 pages
For Fake or Real Disaster Tweet Analysis of Machine Learning Algorithms
No ratings yet
For Fake or Real Disaster Tweet Analysis of Machine Learning Algorithms
23 pages
Twitter Spam Detection Methods
No ratings yet
Twitter Spam Detection Methods
45 pages
A Review On Threat Detection Approaches in Social Networks: Ghadeer Al-Turaif and Fethi Fkih
No ratings yet
A Review On Threat Detection Approaches in Social Networks: Ghadeer Al-Turaif and Fethi Fkih
9 pages
F701 Coalescing Filters Guide
No ratings yet
F701 Coalescing Filters Guide
2 pages
Fake News Detection System Report
No ratings yet
Fake News Detection System Report
29 pages
Sensors 23 01805
No ratings yet
Sensors 23 01805
24 pages
Kodak-Axpert King II Twin 20220531
No ratings yet
Kodak-Axpert King II Twin 20220531
2 pages
A Methodology To Quickly Perform Opinion Mining and Build Supervised Datasets Using Social Networks Mechanics
No ratings yet
A Methodology To Quickly Perform Opinion Mining and Build Supervised Datasets Using Social Networks Mechanics
12 pages
Java Syllabus
No ratings yet
Java Syllabus
4 pages
Embedded Systems Architecture Guide
No ratings yet
Embedded Systems Architecture Guide
24 pages
Fake Account Detection
No ratings yet
Fake Account Detection
33 pages
04-Division 16-Section 16040 Power Monitor-Version 2.0
No ratings yet
04-Division 16-Section 16040 Power Monitor-Version 2.0
5 pages
PICME Case Study Jotun
No ratings yet
PICME Case Study Jotun
2 pages
Analyzing and Ranking Prevalent News Over Social Media
No ratings yet
Analyzing and Ranking Prevalent News Over Social Media
12 pages
Press Brake Basics for Students
No ratings yet
Press Brake Basics for Students
3 pages
Synopsis of Project Work
No ratings yet
Synopsis of Project Work
31 pages
Machine Learning Methods For Secure Internet of Things Against Cyber Threats Synopsis
No ratings yet
Machine Learning Methods For Secure Internet of Things Against Cyber Threats Synopsis
4 pages
Social Media Fake Account Detection Report 20pages
No ratings yet
Social Media Fake Account Detection Report 20pages
8 pages
Fast Changes On The Earth's Surface Activity Research Poster in Violet Grey Orange Hand Drawn Style
No ratings yet
Fast Changes On The Earth's Surface Activity Research Poster in Violet Grey Orange Hand Drawn Style
1 page
Improving Accuracy of Twitter Fake Profile Detection Using Deep Learning
No ratings yet
Improving Accuracy of Twitter Fake Profile Detection Using Deep Learning
5 pages
Fake Social Media Profile Detection
No ratings yet
Fake Social Media Profile Detection
10 pages
Fake News Camera Ready
No ratings yet
Fake News Camera Ready
6 pages
Digital Electronics and Software Enggc 3rd Semester Btech Short Notes
No ratings yet
Digital Electronics and Software Enggc 3rd Semester Btech Short Notes
10 pages
0795 A - LEVEL Computer SC P1
No ratings yet
0795 A - LEVEL Computer SC P1
5 pages
Ccs335-Cloud Computing PPT Unit I
No ratings yet
Ccs335-Cloud Computing PPT Unit I
62 pages
A Peck of Pickled Peppers Peter Piper Picked
No ratings yet
A Peck of Pickled Peppers Peter Piper Picked
5 pages
Lehle-P-ISO Manual EN v1.0
No ratings yet
Lehle-P-ISO Manual EN v1.0
14 pages
A Rapid Review of Clustering Algorithms
No ratings yet
A Rapid Review of Clustering Algorithms
14 pages
Malware Analysis Project
No ratings yet
Malware Analysis Project
13 pages
Bda Exp1
No ratings yet
Bda Exp1
4 pages
VT825t Remote Monitoring System (Brochure) Vutlan v1.2
No ratings yet
VT825t Remote Monitoring System (Brochure) Vutlan v1.2
5 pages
Paper 1
No ratings yet
Paper 1
8 pages
VDIAZ - MT DetectingMaliciousProfilesTwitter
No ratings yet
VDIAZ - MT DetectingMaliciousProfilesTwitter
66 pages
Twitter Sentiment Analysis
No ratings yet
Twitter Sentiment Analysis
5 pages
13 Paper 01032022 IJCSIS Camera Ready Pp111-118
No ratings yet
13 Paper 01032022 IJCSIS Camera Ready Pp111-118
8 pages