AI Phishing Detector

An AI-powered phishing detection system built using ensemble machine learning models. This project analyzes website metadata and predicts whether a URL is legitimate or a phishing attempt, with over 95% accuracy.

Overview

This project demonstrates the use of supervised machine learning to detect phishing websites using a dataset sourced from Kaggle. It uses two base models — Logistic Regression and Random Forest — combined into an ensemble classifier to increase accuracy and reliability.

Features

Detects phishing vs. legitimate websites
Ensemble model: Logistic Regression + Random Forest
Pink-themed confusion matrix for visual analysis
Scalable, modular codebase with clean function separation
Accuracy score, classification report, and visual output

Technologies Used

Python
Pandas – data manipulation
Scikit-learn – machine learning algorithms
Matplotlib – data visualization
VotingClassifier – for ensemble learning

Skills Demonstrated

Building ensemble ML models from scratch
Feature scaling and preprocessing techniques
Model evaluation using confusion matrices and classification reports
Structuring Python projects into clean, reusable functions
Using .gitignore to manage sensitive/local files
Version control with Git + GitHub

How It Works

Loads and preprocesses phishing dataset
Splits into training and testing sets
Scales feature data using StandardScaler
Trains an ensemble model (soft voting classifier)
Evaluates model with accuracy score, confusion matrix, and report
Displays results via terminal + pink-themed visual

Dataset used: Phishing Website Detector (Kaggle)

Note: To run this project, download phishing.csv from the link above and place it in the same directory as the Python script.

Results

Final Accuracy: ~95.7%
Classification Report: Includes precision, recall, F1-score
Visual Analysis: Confusion matrix with styled visualization

Results Visualization

Confusion matrix for the model (pink-themed):

Future Improvements

Deploy the model via Flask or Streamlit for web input
Accept live URLs and perform real-time checks
Experiment with deep learning or NLP for phishing email content
Integrate threat intelligence feeds for smarter detection

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
.DS_Store		.DS_Store
.gitignore		.gitignore
README.md		README.md
phishing_confusion_matrix.png		phishing_confusion_matrix.png
phishing_detector.py		phishing_detector.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

AI Phishing Detector

Overview

Features

Technologies Used

Skills Demonstrated

How It Works

Results

Results Visualization

Future Improvements

About

Uh oh!

Releases

Packages

Languages

rbagwandeen/ai-phishing-detector

Folders and files

Latest commit

History

Repository files navigation

AI Phishing Detector

Overview

Features

Technologies Used

Skills Demonstrated

How It Works

Results

Results Visualization

Future Improvements

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages