Gumshoe is a quick script to help locate potential PII in S3 buckets.
poetry shell
poetry install
Create IAM credentials with the appropriate permissions and set them in your environment:
export AWS_ACCESS_KEY_ID=<your-access-key-id>
export AWS_SECRET_ACCESS_KEY=<your-secret-access-key>
You will need at least the following permissions:
s3:ListBuckets3:GetObjects3:GetObjectVersions3:GetObjectVersionAcls3:GetObjectVersionTaggings3:GetObjectVersionTorrents3:GetObjectTorrents3:GetObjectTaggingcomprehend:ContainsPiiEntitiescomprehend:DetectPiiEntities
python gumshoe.py