| Sr.No | Contents | Date |
|---|---|---|
| 1 | Project Group Formation | 08/01/2026 – 15/01/2026 |
| 2 | Project Topic Finalization | 19/01/2026 – 29/01/2026 |
| 3 | Identified and analyzed the functional and non-functional requirements of the proposed system | 29/01/2026 – 31/01/2026 |
| 4 | Designed the overall system architecture, workflow, and module interactions | 07/02/2026 – 18/02/2026 |
| 5 | Implementation Phase – I | 19/02/2026 – 28/02/2026 |
| 6 | Developed the web interface and integrated core system functionalities. | 01/03/2026 – 09/03/2026 |
| 7 | Implementation Phase – II | 10/03/2026 – 27/03/2026 |
| 8 | Implementation Phase – III | 28/03/2026 – 01/04/2026 |
| 9 | Conducted functional, performance, and validation testing of the complete system | 05/04/2026 – 27/04/2026 |
| 10 | Evaluated model performance and analyzed the obtained results using relevant metrics | 28/04/2026 – 09/05/2026 |
| 11 | Prepared the project documentation and summarized the conclusions and future scope | 09/05/2026 – 11/05/2026 |
The AI-Based Document Fraud Detection Web System is a deep learning-powered web application designed to automatically detect fraudulent and tampered documents. The system analyzes uploaded document images and classifies them as Genuine or Fraudulent using a Convolutional Neural Network (CNN) model trained on document datasets.
The project aims to reduce manual verification efforts and improve the accuracy, speed, and reliability of document authentication in educational institutions, recruitment agencies, financial organizations, and government sectors.
- Document Image Upload
- Deep Learning-Based Fraud Detection
- Real-Time Prediction
- Genuine/Fake Classification
- Confidence Score Generation
- Responsive Web Interface
- Automated Verification Process
- Easy Deployment and Scalability
User Uploads Document
│
▼
Image Preprocessing
│
▼
Feature Extraction
│
▼
CNN Deep Learning Model
│
▼
Fraud Detection Prediction
│
▼
Display Result on Web Interface
- HTML5
- CSS3
- JavaScrip
- Python
- Flask
- TensorFlow
- Keras
- OpenCV
- NumPy
- Pandas
- CNN (Convolutional Neural Network)
The model is trained using internship certificate and document image datasets containing:
- Genuine Documents
- Fraudulent Documents
- Tampered Certificates
- Modified Document Images
Dataset Structure:
dataset/
│
├── genuine/
│ ├── doc1.jpg
│ ├── doc2.jpg
│ └── ...
│
└── fake/
├── doc1.jpg
├── doc2.jpg
└── ...
Collect genuine and fraudulent document images.
- Resize images
- Normalize pixel values
- Convert image format
- Remove noise
Train a CNN model using TensorFlow/Keras.
Evaluate using:
- Accuracy
- Precision
- Recall
- F1 Score
Deploy trained model using Flask Web Framework.
AI-Document-Fraud-Detection/
│
├── app.py
├── train.py
├── predict.py
├── model/
│ └── fraud_detection_model.h5
│
├── dataset/
│ ├── genuine/
│ └── fake/
│
├── static/
│ ├── css/
│ ├── js/
│ └── uploads/
│
├── templates/
│ ├── index.html
│ └── result.html
│
├── requirements.txt
├── README.md
└── notebook.ipynb
git clone https://github.com/PLACEHOLDER/AI-Document-Fraud-Detection.git
cd AI-Document-Fraud-Detectionpip install -r requirements.txtRun:
python train.pyThe trained model will be saved as:
fraud_detection_model.h5
python app.pyOpen browser:
http://127.0.0.1:5000
- Upload document image.
- Image is preprocessed.
- CNN model extracts features.
- Model predicts fraud probability.
- Result displayed on webpage.
| Metric | Value |
|---|---|
| Accuracy | 84.1% |
| Precision | - |
| Recall | - |
| F1 Score | 0.8403 |
- Educational Certificate Verification
- Recruitment Document Validation
- Government Record Authentication
- Banking and Financial Institutions
- Insurance Claim Verification
- Corporate Background Verification
Pranav Chaudhari
Darshan Shinde
Harsh Mali
Pratik Nikwade
College: R C Patel Institute of Technology
Department: Computer Science and Engineering (Data Science)
Guide: Prof. Priyanka D. Lanjewar
Academic Year: 2025-2026
This project is developed for academic and research purposes only.