Data leader with a passion for turning insights into impact
I build things that matter at the intersection of data and music. Right now, I'm the founder of Sync Wave Analytics, where I'm creating the analytics infrastructure that helps labels, investors, and artists understand what their catalogs are actually worth—processing 500K+ daily streaming events across Spotify, Deezer, and Luminate.
Before going all-in on music tech, I spent five years at Spotify building the systems behind the scenes—churn prediction models serving 500M users, the data science for Wrapped campaigns, and ML features like Family Mix that shipped to millions. Earlier, I detected fraud patterns in 10M+ daily transactions at JPMorgan Chase (and earned a utility patent for privacy-preserving analytics), and cut my teeth building fantasy sports models at ESPN that powered content for 2M+ weekly users.
I studied Mathematics and Statistics at Rutgers, but what really drives me is turning messy, complex data into something that helps people make better decisions. Whether that's a $100M acquisition call or figuring out which songs deserve a second listen—I love the puzzle.
From sports analytics to streaming at scale to building my own thing
- Founded Lyric Nexus, a B2B music analytics platform processing 500K+ daily streaming events from Spotify, Deezer, and Luminate to deliver catalog valuations, royalty forecasting, and artist performance insights for labels and investors.
- Architected Snowflake data warehouse with 260+ dbt models and automated Slidev reporting, reducing analyst report generation time by 80% while enabling reproducible, audience-tailored insights for stakeholders from investors to artists.
- Developed content-based recommendation engine using audio feature embeddings and genre similarity scoring, achieving 89% precision in sync licensing matches and powering playlist curation for catalog discovery workflows.
- Engineered full-stack platform with Next.js, TypeScript, and Neon PostgreSQL; implemented OAuth 2.0, RBAC, and row-level security enabling multi-tenant access for 15+ enterprise clients with SOC 2-ready architecture.
- Orchestrated ETL pipelines with Prefect and Airbyte ingesting royalty data from 20+ streaming platforms, processing 250GB+ monthly with 99.5% pipeline reliability and sub-hour data freshness for real-time analytics.
- Directed data science and analytics organization of 5 for PE firm acquiring music catalogs, delivering valuation models and due diligence datasets that informed $100M+ in acquisition decisions and secured 3 investor funding rounds.
- Migrated legacy infrastructure to GCP and built serverless ETL pipelines processing 250GB+ transaction data from 200+ royalty sources, reducing data processing costs by 60% while improving cross-team data accessibility.
- Launched Centralized Insights Hub integrating Spotify, Deezer, and Tidal streaming data with catalog financials, enabling track-level revenue attribution and reducing monthly reporting cycles from 2 weeks to 2 days.
- Engineered end-to-end churn prediction pipeline scaled to 500M users using TensorFlow and BigQuery, driving 10M user reactivations and $20M in incremental subscription revenue through targeted re-engagement campaigns.
- Led cross-functional data science for Spotify + Hulu Bundle launch, designing propensity models and A/B testing frameworks that optimized targeting and messaging, delivering $15M lift in paid subscriptions within first quarter.
- Managed DS and Analytics Engineering team for Wrapped 2019-2020, Spotify's most viral annual campaign; introduced matched-market testing methodology that measured 1M incremental listening hours and validated campaign ROI.
- Drove Product Data Science for new premium tier initiative and Spotify Stations app, owning experiment design, UX research, and growth strategy that informed product roadmap decisions impacting 50M+ monthly active users.
- Shipped two ML-powered personalization features: "Family Mix" (first cross-account algorithmic playlist) and emotion-based recommendations using image recognition, both launched to millions of users in production.
- Engineered NLP and ML pipelines processing 10M+ daily credit card transactions for merchant matching and bot detection, achieving 92% classification accuracy and reducing fraud investigation time by 40%.
- Granted utility patent for privacy-preserving synthetic data methods enabling secure cross-institutional analytics.
- Pioneered predictive sports analytics platform for Daily Fantasy, building ML models for lineup optimization across NBA, MLB, NHL that achieved 73% top-quartile accuracy and powered content reaching 2M+ weekly users.
- Developed Monte Carlo simulations for MLB run expectancies with 85% accuracy; featured on ESPN broadcasts.
Personal projects showcasing full-stack development and data engineering
Applications
Full-stack web applications built for scale
An interactive music trivia game where players identify songs from progressively-revealed audio previews. Features a 5-attempt system with adaptive preview lengths, real-time artist search across millions of Deezer tracks, and a retro arcade aesthetic with CRT effects.
A comprehensive fantasy sports analytics platform with live NHL scores, Yahoo Fantasy integration, FELO rating system for manager rankings, and NHL Edge analytics for advanced player performance metrics.
Comprehensive sports analytics platform built on a reusable Slidev template system with NHL betting strategy research (8-12% ROI), goalie performance analysis, and MLB sabermetrics—all powered by D3.js visualizations matching Baseball Savant quality.
Data Engineering
Data pipelines, warehouses, and APIs
A lightweight serverless proxy providing type-safe access to the Deezer music platform. Features auto-generated OpenAPI documentation, sliding window rate limiting, and comprehensive coverage of search, tracks, albums, artists, playlists, and podcasts.
Production-grade dbt data warehouse with 569+ data quality tests, comprehensive transformation layers, and analytics-ready marts spanning sports and fantasy domains.
High-performance FastAPI service with intelligent retry strategies, GCP Secret Manager integration, and comprehensive Pydantic validation for Yahoo Fantasy Sports data.
Technologies and tools I work with