Thanks to visit codestin.com
Credit goes to Github.com

Skip to content

A simple GUI-based tool to analyze files for hidden metadata (like GPS, camera model, authorship) and flag potential privacy risks.

Notifications You must be signed in to change notification settings

suv4tha/metadata

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Metadata Privacy Risk Analyzer

🔍 A simple yet effective tool to inspect hidden metadata in files and identify potential privacy risks. This tool supports images (JPEG, PNG), PDFs, and audio files (MP3, WAV) and highlights sensitive metadata like GPS location, device information, author details, and artist info.


Features

  • Extracts metadata from:
    • Images (JPEG, PNG) using exifread
    • PDFs using PyPDF2
    • Audio files (MP3, WAV) using mutagen
  • Detects privacy risks such as:
    • GPS coordinates embedded in photos
    • Device model and make info
    • PDF author and producer details
    • Audio artist and encoding metadata
  • Provides user-friendly output via a web interface built with Gradio
  • Supports local/offline usage
  • Easy to extend for additional file types and metadata checks

Demo

Demo Screenshot


Installation

  1. Clone the repo:

    git clone https://github.com/suv4tha/metadata.git
    cd metadata
  1. Create and activate a Python virtual environment:

    python -m venv venv
    # Windows:
    venv\Scripts\activate
    # Linux / macOS:
    source venv/bin/activate
  2. Install required dependencies:

    pip install -r requirements.txt

Usage

Run the app:

python metadata_tool.py

This will launch a local web interface (usually at http://127.0.0.1:7860) where you can upload your file and inspect metadata risks instantly.


How It Works

  • The app detects the file type based on its extension.

  • Depending on the file type, it uses appropriate libraries to extract metadata:

    • Images: exifread
    • PDFs: PyPDF2
    • Audio: mutagen
  • It analyzes metadata keys for sensitive information like GPS coordinates, device details, author info, and more.

  • The results and any flagged privacy risks are displayed in an easy-to-understand format on the web UI.


Dependencies

Install all with:

pip install gradio exifread PyPDF2 mutagen

Limitations & Future Work

  • Currently supports only a limited set of file types (images, PDFs, audio).

  • Does not modify or remove metadata; only reports risks.

  • Could be extended with:

    • Additional file types (video, documents)
    • Metadata removal or redaction
    • Batch processing support
    • Detailed risk scoring and recommendations

License

This project is open-source and available under the MIT License.


Contact

Created by suv4tha. Feel free to open issues or submit pull requests for improvements!


About

A simple GUI-based tool to analyze files for hidden metadata (like GPS, camera model, authorship) and flag potential privacy risks.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages