You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
AI-generated fake sports content — produced by models like Sora, Midjourney, and SDXL — is now virtually indistinguishable from authentic footage. This poses serious threats to sports organizations, broadcasters, and fans.
Existing detection tools fall short because:
They require an original reference — useless when verifying unseen content.
They rely on face-trained models — missing critical sports-specific artifacts such as grass textures, jersey physics, ball blur, crowd bokeh, and shadow angles.
💡 Solution Overview
DeepProvenance is a full-stack platform that combines sports-specific AI forgery detection with blockchain-anchored Reality Certificates to establish a verifiable chain of trust for digital media.
Workflow
Track
Description
🟢 Official Content
Photographer captures → ViT embedding extraction → Reality Certificate minted on Polygon zkEVM
🔵 Reuse Detection
Content uploaded → ANN similarity search against Content DNA Registry → Ownership & authorization verified
🔴 Fake Detection
Content uploaded → ViT classifier + Grad-CAM heatmap → AI/Real verdict with confidence score
Four Verdicts
Verdict
Indicator
Meaning
ORIGINAL VERIFIED
🟢 Green
Uploaded by owner, valid certificate, confirmed authentic
VERIFIED REUSE
🔵 Cyan
Matches registered content, authorized user
UNVERIFIED COPY
🟡 Amber
Matches registered content, unauthorized upload
FAKE / AI MANIPULATED
🔴 Red
AI-generated content, no registry match
✨ Key Features
Sports-Specific AI Detection — ViT model fine-tuned on sports media artifacts (grass, jerseys, ball physics, crowd patterns, lighting, hands)
Grad-CAM Explainability — Visual heatmaps highlighting exactly which regions are synthetic
Content DNA Registry — 768-dimensional visual fingerprints stored in Pinecone vector database for similarity search
Note: The application runs in Demo Mode by default (DEMO_MODE=true), which uses in-memory storage and simulated ML inference — no external services required.
🔌 API Reference
Authentication
Method
Endpoint
Description
POST
/api/v1/auth/register
Register a new creator account
POST
/api/v1/auth/login
Authenticate & receive JWT tokens
POST
/api/v1/auth/refresh
Refresh access token
Upload & Verification
Method
Endpoint
Description
POST
/api/v1/upload/verify
Upload image for AI verification
GET
/api/v1/upload/result/{task_id}
Poll for verification result
POST
/api/v1/upload/mint
Mint a Reality Certificate on-chain
GET
/api/v1/upload/history
Get upload history
Certificates
Method
Endpoint
Description
GET
/api/v1/certificate/{hash}
Get certificate details
POST
/api/v1/certificate/{id}/authorize
Grant user access
DELETE
/api/v1/certificate/{id}/authorize/{user_id}
Revoke user access
GET
/api/v1/certificate/{id}/chain
Get chain of custody
Analytics & ML
Method
Endpoint
Description
GET
/api/v1/analytics/overview
Platform analytics overview
GET
/api/v1/analytics/heatmap
Violation distribution by platform
GET
/api/v1/analytics/audit/{upload_id}
Asset audit trail
GET
/api/v1/ml/model-stats
Model accuracy metrics
POST
/api/v1/ml/detect
Run AI forgery detection (internal)
Admin
Method
Endpoint
Description
POST
/api/v1/admin/takedown
Initiate content takedown
GET
/api/v1/admin/queue
Get human review queue
POST
/api/v1/admin/export/{upload_id}
Generate forensic report PDF
Full interactive API documentation is available at /api/docs when the backend is running.
🤖 ML Pipeline
Model Architecture
Base Model:google/vit-base-patch16-224 (Vision Transformer)
Task: Binary classification — Real vs. AI-Generated sports media
Grad-CAM heatmaps are generated to highlight suspicious regions:
Red zones indicate high confidence of synthetic content
Yellow zones indicate moderate anomaly
Heatmap opacity is adjustable in the UI
Training & Evaluation
# Fine-tune ViT modelcd ml && python train.py
# Evaluate model performancecd ml && python evaluate.py
# Generate Grad-CAM visualizationscd ml && python gradcam.py
⛓️ Smart Contract
RealityCertificate.sol
A non-transferable (soulbound) ERC-721 NFT deployed on Polygon zkEVM testnet.
Real-time charts (30-day trends, verdict breakdown pie chart, platform violation bar chart) and live alert feed
/certificate/:hash
Certificate Explorer
Detailed certificate view with QR code, blockchain link, chain-of-custody timeline, embeddable badge, and exposure score
/admin
Admin Panel
Content moderation queue with search/filter, takedown actions, review workflow, and export capabilities
/developer
Developer
API documentation with code examples (cURL, Python, JavaScript), endpoint reference table, API key management, and rate limits
🧪 Testing
# Run all tests
make test# Backend unit + integration tests
make test-backend
# → pytest tests/ -v --tb=short# Frontend build verification
make test-frontend
# Smart contract tests
make test-contracts
# → npx hardhat test# ML model evaluation
make test-ml
# → python evaluate.py
This project is licensed under the MIT License — see the LICENSE file for details.
Built with ❤️ for authentic sports media
DeepProvenance — because truth in sports matters.
About
AI-powered sports media authenticity platform — detects AI-generated fakes using sports-specific ViT forensics and anchors real content with blockchain Reality Certificates on Polygon zkEVM.