PhishingSpamDataSet

Phishing and Spam Email DataSet

A multi-layered open-science dataset for phishing, spam, and legitimate email analysis using emotional, motivational, and semantic labels.

Overview

This repository contains a new, richly annotated dataset designed for research on LLM-based email security, including phishing detection, spam analysis, emotional manipulation, and automated robustness evaluation under paraphrasing.

The dataset includes:

Human-written phishing, spam, and legitimate emails
LLM-generated emails (GPT-4o, DeepSeek-Chat, Grok, Llama 3.3, Gemini, Nova, Mistral, etc.)
Emotion and motivation labels
Rephrased/paraphrased variants from three independent LLM pipelines
Claude 3.5 Sonnet classifications

This repository enables reproducible research on how LLMs interpret, classify, and analyze deceptive online communication.

Abstract

Phishing and spam emails remain pervasive cybersecurity threats, increasingly strengthened by the use of Large Language Models (LLMs) to generate deceptive content. This work introduces a comprehensive, multi-layered email dataset containing both human-written and LLM-generated messages across phishing, spam, and legitimate categories. Each email is enriched with emotional and motivational labels—capturing cues such as urgency, fear, authority, greed, and link-click incentives—along with paraphrased variants generated by multiple LLM pipelines to test classifier robustness.

We benchmark several modern LLMs for emotional and motivational detection and identify Claude 3.5 Sonnet as the most reliable model for large-scale annotation. We further evaluate its classification accuracy under both strict (three-class) and relaxed (unwanted vs. valid) settings across original and LLM-rewritten emails. Results show that contemporary LLMs can reliably detect harmful messages and emotional manipulation strategies, though distinguishing spam from legitimate emails remains difficult.

All templates, datasets, and source code are released openly to support reproducible research in AI-assisted email security.

Methodology

1. Dataset Construction

Human-written emails collected from open-source corpora and curated phishing repositories
LLM-generated emails created for diversity
Rephrasing via three pipelines:
- DeepSeek-Chat
- GPT-4o
- OpenRouter multi-model pipeline (Gemini, Nova, Grok, Llama, Mistral, etc.)

2. Emotional & Motivational Labeling

Four LLMs evaluated:

GPT-4o-mini
GPT-4.1-mini
Claude 3.5 Sonnet
DeepSeek-Chat

Evaluation metrics:

Strict accuracy
Close-enough accuracy
Jaccard similarity
Internal consistency across 5 independent runs
Precision & recall

Claude 3.5 Sonnet was selected for full-dataset labeling due to highest match with human annotations.

3. Email Classification

Claude 3.5 Sonnet performed final classification using:

Email body
Subject line
Sender metadata
URL and attachment indicators

Evaluated using:

Strict classification (Phishing / Spam / Valid)
Relaxed classification (Unwanted vs. Valid)
Robustness to paraphrasing across three pipelines

Key Findings

Emotional & Motivational Analysis

Claude 3.5 Sonnet:
- Jaccard similarity = 0.60
- Close-enough accuracy = 42%
Motivational detection harder, but top models achieve 53–61% close-enough accuracy
LLMs often infer additional plausible motivations beyond human annotations

Email Classification

Across all email groups (Original, DeepSeek-rephrased, GPT-4o-rephrased, RandomAPI):

Strict accuracy: ~66–67%
Relaxed accuracy: ~69–70%
Phishing detection excellent (F1 ≈ 0.93)
Spam detection weak (F1 ≈ 0.20–0.23)
Valid classification moderate (F1 ≈ 0.63)

Robustness to Paraphrasing

Maximum deviation from original:

Strict accuracy deviation: 0.55 percentage points
Relaxed accuracy deviation: 0.54 percentage points

Rephrasing has minimal impact on classifier performance.

Reproducibility

Running stats.py produces:

Strict and relaxed accuracy
Confusion matrices
Group-by-group metrics
Paraphrasing robustness analysis
LaTeX-ready tables for publications

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
1_DataSet		1_DataSet
2_Labeling		2_Labeling
3_RePhrase		3_RePhrase
4_MergeDataSet		4_MergeDataSet
5_Categorize		5_Categorize
6_Results		6_Results
images		images
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

PhishingSpamDataSet

Phishing and Spam Email DataSet

A multi-layered open-science dataset for phishing, spam, and legitimate email analysis using emotional, motivational, and semantic labels.

Overview

Contents

Dataset

Scripts

Results

Abstract

Methodology

1. Dataset Construction

2. Emotional & Motivational Labeling

3. Email Classification

Key Findings

Emotional & Motivational Analysis

Email Classification

Robustness to Paraphrasing

Reproducibility

About

Uh oh!

Releases

Packages

Languages

HarmJ0y/PhishingSpamDataSet

Folders and files

Latest commit

History

Repository files navigation

PhishingSpamDataSet

Phishing and Spam Email DataSet

A multi-layered open-science dataset for phishing, spam, and legitimate email analysis using emotional, motivational, and semantic labels.

Overview

Contents

Dataset

Scripts

Results

Abstract

Methodology

1. Dataset Construction

2. Emotional & Motivational Labeling

3. Email Classification

Key Findings

Emotional & Motivational Analysis

Email Classification

Robustness to Paraphrasing

Reproducibility

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages