0% found this document useful (0 votes)

9 views10 pages

Saloni CSL Report

Uploaded by

appuprakash040

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

9 views10 pages

Saloni CSL Report

Uploaded by

appuprakash040

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 10

A Report On

Review Of Machine Learning-Based Zero-Day Attack

Detection: Challenges And Future Directions

Submitted in partial fulfillment of the requirement of

University of Mumbai for the Degree of

Bachelor of Technology
In
Computer Engineering

Submitted By
Saloni Dongare

Supervisor
Prof. Payel Thakur

Department of Computer Engineering

PILLAI COLLEGE OF ENGINEERING
New Panvel – 410 206
UNIVERSITY OF MUMBAI
Academic Year 2024– 25
Table of Contents
1. Introduction
2. Implementation
3. Challenge
4. Conclusion and Future Work
5..1 Conclusion
5.2 Future Work

1
1. Introduction
Zero-day attacks are cybersecurity threats that exploit unknown vulnerabilities, allowing
attackers to bypass defenses without being detected. These attacks target flaws that are not
publicly disclosed, giving no time for security teams to patch systems. Traditional detection
methods, like signature-based systems, rely on known patterns to identify attacks and are
ineffective against zero-day threats since these patterns are unavailable. Anomaly-based
detection systems, while useful, can struggle with false positives or fail to detect more subtle
attacks. Machine learning (ML) has emerged as a promising tool for detecting zero-day attacks.
ML models analyze patterns in data and adapt to detect unusual behavior, potentially identifying
unknown attacks. Different types of ML models, such as supervised and unsupervised learning,
have been explored for zero-day detection. However, challenges persist, particularly with the
lack of data on zero-day attacks, which makes training these models difficult. Researchers often
assume similarities between zero-day and known attacks, but this needs to be validated.

Furthermore, evaluation of these models is challenging due to limited real-world testing data and
the absence of standardized benchmarks. Despite these challenges, ML-based approaches show
potential for improving zero-day attack detection. This report reviews existing ML models and
identifies gaps in current detection methods.

2. Implementation

3.1 OUTLIER-BASED ZERO-DAY ATTACK DETECTION USING NORMAL DATA

A. One-Class SVM-based Detection

● One-Class SVM learns a decision boundary around normal data using non-linear kernels.
Points that fall outside the boundary are considered outliers, i.e., potential zero-day
attacks.

2
● The model involves finding a hyperplane in a feature space that maximizes the distance
from the origin while keeping normal data on one side of the plane.
● Optimization for this detection involves a kernel function (e.g., Gaussian kernel) and
solves a quadratic programming problem to determine the model's boundaries.

B. Autoencoder-based Detection

● Autoencoders, a type of neural network, aim to reconstruct input data with low error for
normal data and high error for outliers. The reconstruction error helps determine if new
data is an outlier.
● The autoencoder consists of an encoder that compresses the input and a decoder that
reconstructs it. A data point is considered anomalous if the reconstruction error exceeds a
threshold.

C. Performance Comparison

● One-Class SVM and autoencoder models were trained on CIC-IDS2017 and NSL-KDD
datasets, which include various types of benign and attack traffic.
● Autoencoders generally outperform One-Class SVM for more complex zero-day attacks,
as their performance scales better with complex data.
● Both models have low false positives, though detection accuracy can vary based on attack
types (e.g., attacks very different from normal data like Hulk and DDoS have high
detection rates, while others like DoS-SlowHTTPTest perform poorly)

3
D. Ensemble of Autoencoders

● An ensemble approach, such as the Kitsune framework, enhances zero-day detection by

combining multiple autoencoders, each monitoring different network features.
● The ensemble method was evaluated using a test bed involving IP camera surveillance
systems, and Kitsune demonstrated performance comparable to offline anomaly detectors
like Isolation Forests and Gaussian Mixture Models. However, its performance varied
depending on the type of attack.

3.2 SUPERVISED AND HYBRID LEARNING-BASED ZERO-DAY ATTACK

DETECTION USING LABELED DATA

A. Evaluation of Supervised Machine Learning Classifiers for Zero-Day Attack Detection

This section focuses on evaluating the effectiveness of six popular machine learning
classifiers—Random Forest, Gaussian Naive Bayes, Decision Tree, Multi-layer Perceptron,
K-Nearest Neighbors, and Quadratic Discriminant Analysis—for detecting zero-day attacks. The
CSE-CIC-IDS2018 dataset was used for training, containing labeled data from a variety of
attacks and benign network activities. After pre-processing to remove noisy features, 25
bi-directional flow features were used for model training. The performance of the classifiers was
evaluated on real-world zero-day attack data, showing that the Decision Tree model
outperformed others, achieving a true-positive rate of 96% and a false-positive rate of 5% at an
optimal tree depth.

B. Integrating Supervised and Unsupervised Learning for Zero-Day Malware Detection

4
Comar et al. developed a two-level hybrid detection method using supervised and unsupervised
learning to identify zero-day malware. The approach consists of a macro-level binary classifier
(Random Forest) to identify malicious traffic, followed by a micro-level classifier (multi-class
Support Vector Machine) to distinguish between known and zero-day malware. This layered
approach successfully identified both known and zero-day malware, with an AUC score of 91%.
However, the F1 score for zero-day detection was lower at 0.50, indicating room for
improvement.

C. Hybrid Learning Using Available Unlabeled Data for Zero-Day Malware Detection

A hybrid approach was proposed that leverages unlabeled data to detect zero-day malware. The
method involves running files in a sandbox, extracting API call frequencies, and applying
k-means clustering to merge labeled and unlabeled data. Geometric distances between data
points and cluster centroids are used as additional features for training classifiers. This approach
achieved perfect detection accuracy (100%) when augmented geometric features were used with
classifiers such as Random Forest and SVM.

3. Challenges
The primary challenge of zero-day attacks is the lack of prior knowledge, making them nearly
impossible to detect with traditional IDSs that rely on predefined signatures. Attackers can
exploit these unknown vulnerabilities for long periods before any defensive measures are
developed, allowing significant damage to be inflicted. In rapidly evolving smart community
environments, this challenge is amplified, as interconnected systems handling critical
infrastructure are prime targets. The constantly changing nature of zero-day threats leaves
conventional security methods struggling to keep up, often resulting in severe breaches and
financial losses. Additionally, the sheer volume of potential vulnerabilities in modern software
and hardware increases the difficulty of maintaining comprehensive defenses. Attackers can
innovate faster than defenders can patch, making zero-day attacks especially dangerous in
environments where data security and system availability are critical. The lack of transparency in
proprietary systems further complicates efforts to detect and prevent these attacks.

5
4. Conclusion and Future Work

4.1 Conclusion
Zero-day attacks are frequent, lasting an average of 312 days before detection and causing
significant financial damage, with costs averaging $1.2 million per attack. Machine Learning
(ML)-based detection methods offer the most promising solution for identifying these attacks.
This review explored various ML approaches, including unsupervised, supervised, hybrid, and
transfer learning methods. However, key challenges remain, particularly the lack of zero-day
attack data in training sets, which limits the accuracy and robustness of existing models.
Additionally, limited datasets and feature spaces hinder the effectiveness of current detection
methods.

To improve ML-based zero-day detection, future efforts should focus on integrating the latest
ML advancements, incorporating expert knowledge, and developing standardized, data-rich
benchmarks for better evaluation and model improvement.

4.2 Future Work

To overcome the challenges in designing effective ML-based zero-day attack detection, several
multi-front efforts are essential. First, collecting zero-day attack data before they become widely
known is critical. Honeypots can be used to gather this data, allowing systems to learn from
real-world attacks before they are publicly disclosed. Another key area is effective feature
engineering, which requires integrating domain expertise. Attackers may disguise their attacks as
legitimate actions, so involving cybersecurity experts in selecting features ensures that new
attacks are detectable within the model’s feature space.

Staying updated with advancements in machine learning is also crucial. Reinforcement Learning
(RL), for example, allows systems to learn by interacting with the environment through trial and
error, making it particularly suited for zero-day attack detection. RL-based methods have shown
success in defending against various types of zero-day attacks, including strategic and random
attacks. Lastly, developing a comprehensive benchmark suite with standardized datasets and
evaluation tools is necessary to further research in ML-based zero-day detection systems. This
would significantly speed up innovation and improve the effectiveness of these systems.

6
PPT:

7
8
9

Updated Zero Day Attack Final Report
No ratings yet
Updated Zero Day Attack Final Report
51 pages
2 PB
No ratings yet
2 PB
16 pages
1 s2.0 S0140366422004248 Main
No ratings yet
1 s2.0 S0140366422004248 Main
11 pages
13 Paper 01032022 IJCSIS Camera Ready Pp111-118
No ratings yet
13 Paper 01032022 IJCSIS Camera Ready Pp111-118
8 pages
Zero Day Presentaion
No ratings yet
Zero Day Presentaion
17 pages
Saurabh Kansal Dec Month 2024 - 18 Feb
No ratings yet
Saurabh Kansal Dec Month 2024 - 18 Feb
12 pages
Usfad Based Effective Unknown Attack Detection Focused Ids Framework
No ratings yet
Usfad Based Effective Unknown Attack Detection Focused Ids Framework
25 pages
Towards Detection of Zero-Day Botnet Attack in IoT Networks Using Federated Learning
No ratings yet
Towards Detection of Zero-Day Botnet Attack in IoT Networks Using Federated Learning
6 pages
Impri 1
No ratings yet
Impri 1
36 pages
4978 IoT
No ratings yet
4978 IoT
6 pages
An Enhanced Framework For Identification and Risks Assessment of Zero-Day Vulnerabilities
No ratings yet
An Enhanced Framework For Identification and Risks Assessment of Zero-Day Vulnerabilities
10 pages
Scsa1619 Ids Unit 2
No ratings yet
Scsa1619 Ids Unit 2
20 pages
Zero-Day Attack Detection via Zero-Shot Learning
No ratings yet
Zero-Day Attack Detection via Zero-Shot Learning
13 pages
Detection of Zero-Day Attacks Using CNN and LSTM in Networked Autonomous Systems IEEE CNS 23 Poster
No ratings yet
Detection of Zero-Day Attacks Using CNN and LSTM in Networked Autonomous Systems IEEE CNS 23 Poster
2 pages
12741-Article Text-43097-3-10-20240910
No ratings yet
12741-Article Text-43097-3-10-20240910
14 pages
An Analysis of Machine Learning Models For Early Detection of Cybersecurity Threats
No ratings yet
An Analysis of Machine Learning Models For Early Detection of Cybersecurity Threats
1 page
Gpo TNW 25 1 2024
No ratings yet
Gpo TNW 25 1 2024
76 pages
Using Artificial Intelligence For Detecting and Mitigating Zero-Day Attacks: A Review of Emerging Techniques
No ratings yet
Using Artificial Intelligence For Detecting and Mitigating Zero-Day Attacks: A Review of Emerging Techniques
6 pages
Using Tensor Flow For Zero Day Attack Detection
No ratings yet
Using Tensor Flow For Zero Day Attack Detection
25 pages
Unsupervised Cyber-Attack Detection
No ratings yet
Unsupervised Cyber-Attack Detection
13 pages
Zero-Day Attack Paper2
No ratings yet
Zero-Day Attack Paper2
25 pages
Deep Learning for Zero-Day Detection
No ratings yet
Deep Learning for Zero-Day Detection
17 pages
Analyze and Forecast The Cyber Attack Detection PR
No ratings yet
Analyze and Forecast The Cyber Attack Detection PR
49 pages
Supervised Machine Learning and Detection of Unknown Attacks
No ratings yet
Supervised Machine Learning and Detection of Unknown Attacks
13 pages
CANA 1 Deepa+Tatyasaheb+Mane 11 1539
No ratings yet
CANA 1 Deepa+Tatyasaheb+Mane 11 1539
9 pages
Zero-Day Network Intrusion Detection Using Machine Learning Approach
No ratings yet
Zero-Day Network Intrusion Detection Using Machine Learning Approach
9 pages
Intrusion Detection in Wireless Sensor Networks
No ratings yet
Intrusion Detection in Wireless Sensor Networks
5 pages
Deeplearning-Basedprobabilistic Anomaly Detection For Solar Forecasting Under Cyberattacks
No ratings yet
Deeplearning-Basedprobabilistic Anomaly Detection For Solar Forecasting Under Cyberattacks
12 pages
How AI and Machine Learning Improve Enterprise Cybersecurity
No ratings yet
How AI and Machine Learning Improve Enterprise Cybersecurity
4 pages
Wang 2022 J. Phys. Conf. Ser. 2303 012008
No ratings yet
Wang 2022 J. Phys. Conf. Ser. 2303 012008
12 pages
Confusion Matrix
No ratings yet
Confusion Matrix
14 pages
Details of NAAC Accreditation
100% (1)
Details of NAAC Accreditation
78 pages
Cyber Security 2020 V1 3 Revise 16 04 2020 For Joe and The Team
No ratings yet
Cyber Security 2020 V1 3 Revise 16 04 2020 For Joe and The Team
8 pages
Network Anomaly Detection
No ratings yet
Network Anomaly Detection
18 pages
Introductory Chapter Machine Learning in
No ratings yet
Introductory Chapter Machine Learning in
8 pages
A Comparative Analysis of Malware
No ratings yet
A Comparative Analysis of Malware
10 pages
A Framework For Zero-Day Vulnerabilities Detection and Prioritization
No ratings yet
A Framework For Zero-Day Vulnerabilities Detection and Prioritization
9 pages
Anomaly Detection in Log Files Based On Machine Le
No ratings yet
Anomaly Detection in Log Files Based On Machine Le
13 pages
Sensors 23 06305 v2
No ratings yet
Sensors 23 06305 v2
35 pages
Anomaly Detection in Network Traffic Using Machine
No ratings yet
Anomaly Detection in Network Traffic Using Machine
16 pages
ECali1 Engineer Manual Eng
No ratings yet
ECali1 Engineer Manual Eng
138 pages
Snap
No ratings yet
Snap
46 pages
Symmetry 15 01251
No ratings yet
Symmetry 15 01251
31 pages
Detecting 0day
No ratings yet
Detecting 0day
8 pages
Anamoly Detection
0% (1)
Anamoly Detection
20 pages
Explainable AI
No ratings yet
Explainable AI
4 pages
Review 0
No ratings yet
Review 0
7 pages
Ai 05 00143
No ratings yet
Ai 05 00143
17 pages
Design and Analysis of A Hybrid Security
No ratings yet
Design and Analysis of A Hybrid Security
5 pages
Um2206 stm32 Nucleo64p Boards mb1319 Stmicroelectronics
No ratings yet
Um2206 stm32 Nucleo64p Boards mb1319 Stmicroelectronics
52 pages
Information Security - Final
No ratings yet
Information Security - Final
3 pages
Sic Ip Service Handbook 2.3 en
No ratings yet
Sic Ip Service Handbook 2.3 en
91 pages
VSS01 - VSS02-Maximizing Use of Vanguard Administrator (Part 1 - Part 2)
No ratings yet
VSS01 - VSS02-Maximizing Use of Vanguard Administrator (Part 1 - Part 2)
126 pages
Black Wade The Wild Side of Love PDF
No ratings yet
Black Wade The Wild Side of Love PDF
4 pages
Cracking Codes With Python Al Sweigart Download
100% (1)
Cracking Codes With Python Al Sweigart Download
47 pages
Combining Supervised and Unsupervised Learning For Zero-Day Malware Detection PDF
No ratings yet
Combining Supervised and Unsupervised Learning For Zero-Day Malware Detection PDF
9 pages
School of Information Technology & Engineering Digital Assignment I SWE 3002: Information System Security Team-4
No ratings yet
School of Information Technology & Engineering Digital Assignment I SWE 3002: Information System Security Team-4
19 pages
Detailed Design and Production Information For Main Hull Steel Structures
No ratings yet
Detailed Design and Production Information For Main Hull Steel Structures
4 pages
Devops Brochure - H-Town Technologies
No ratings yet
Devops Brochure - H-Town Technologies
4 pages
Flexible and Robust K Zero Day Safety Network Security Metrics To Measure The Risk On Different Vulnerabilities
No ratings yet
Flexible and Robust K Zero Day Safety Network Security Metrics To Measure The Risk On Different Vulnerabilities
5 pages
Aditya Final
No ratings yet
Aditya Final
5 pages
Structure Charts & HIPO Diagram
No ratings yet
Structure Charts & HIPO Diagram
5 pages
Pranjali P Jagtap - Resume
No ratings yet
Pranjali P Jagtap - Resume
6 pages
Sequential Circuits
No ratings yet
Sequential Circuits
19 pages
Central Finance Overview
No ratings yet
Central Finance Overview
12 pages
A Course in Data Design For Relational Databases
No ratings yet
A Course in Data Design For Relational Databases
76 pages
Week 4-7 Nptel Haskell HRST
No ratings yet
Week 4-7 Nptel Haskell HRST
16 pages
Data Validation vs. Verification Guide
No ratings yet
Data Validation vs. Verification Guide
16 pages
Real Time Braille To Speech Using Python
100% (1)
Real Time Braille To Speech Using Python
10 pages
Userguide Ethernetip en Cro 2017 05 08
No ratings yet
Userguide Ethernetip en Cro 2017 05 08
34 pages
Ict Chapter 1-4
No ratings yet
Ict Chapter 1-4
9 pages
Designing Forms and Reports Guide
No ratings yet
Designing Forms and Reports Guide
23 pages
Failures, Errors and Risks in Computer System Presentation (0024)
No ratings yet
Failures, Errors and Risks in Computer System Presentation (0024)
21 pages
Supervised Learning For Attack Detection
No ratings yet
Supervised Learning For Attack Detection
11 pages
Using Machine Learning Models To Identify and Predict Security
No ratings yet
Using Machine Learning Models To Identify and Predict Security
17 pages
Screenshot 2024-06-27 at 2.57.46 PM
No ratings yet
Screenshot 2024-06-27 at 2.57.46 PM
9 pages
Kemuning/Icu Isolasi 3 JAB Rating Bobot N
No ratings yet
Kemuning/Icu Isolasi 3 JAB Rating Bobot N
7 pages
CC Assignment 5
No ratings yet
CC Assignment 5
5 pages
LoRa SDR Tool for Satellite IoT
No ratings yet
LoRa SDR Tool for Satellite IoT
6 pages
Final Progress
No ratings yet
Final Progress
22 pages
MS SQL Administrator Resume
No ratings yet
MS SQL Administrator Resume
1 page
Network Intrusion Detection in Big Datasets Using Spark Environment and Incremental Learning
No ratings yet
Network Intrusion Detection in Big Datasets Using Spark Environment and Incremental Learning
8 pages
Android App Uninstallation Guide
No ratings yet
Android App Uninstallation Guide
3 pages
Printable Fathers Day Craft Tool Box
No ratings yet
Printable Fathers Day Craft Tool Box
6 pages