Data Science & Bioinformatics
London, United Kingdom
I’m a biomedical scientist with 7+ years of experience in clinical laboratories and a growing focus on bioinformatics, data science, and precision medicine.
I bridge wet-lab techniques with computational workflows - from stem cell research to NGS data pipelines - combining biological expertise with data-driven insights.
- 7+ years of experience across clinical, stem-cell, and ATMP laboratories (UCLH, CooperGenomics)
- Experience in flow cytometry, NGS library prep, and clinical data reporting
- Skilled in Python, R, SQL, and Bash for genomic analysis and automation
- Cloud-native mindset: Google Cloud, AWS, and Kubernetes for scalable bioinformatics pipelines
- Freelance AI data annotator (Mercor, Outlier, micro1, Alignerr) supporting model training for biomedical data
- Currently pursuing MSc Bioinformatics @ Atlantic Technological University (remote)
Programming: Python · R · SQL · Bash · Git
Bioinformatics: Bowtie2 · HISAT2 · SAMtools · BEDtools · DESeq2 · Bioconductor
Data Viz: Tableau · ggplot2 · seaborn · matplotlib
Cloud: AWS · Google Cloud · Kubernetes Engine
AI/ML: Data annotation · model evaluation · prompt design
| Project | Description | Tools |
|---|---|---|
| Genomic Data Science | End-to-end RNA-seq & variant analysis using HISAT2, StringTie, and DESeq2 | Bash · Python · R |
| Salifort Motors | Predictive modelling to understand drivers of employee turnover and inform retention strategy | XGBoost · NumPy · SciPy · scikit-learn · Pandas · Statsmodels |
| Tik Tok Project | Exploratory analysis of engagement metrics to uncover content trends and optimisation levers | Matplotlib · Seaborn · Plotly · SciPy |
| Bellabeat Case Study | Fitbit data analysis and Tableau dashboard | R · dplyr · Tableau |
| AWS Solution Architecture | Cloud deployment diagrams & IaC design | AWS · ECS · S3 · Aurora |
| Cyclistic BI Capstone | Data integration and visualization for business insights | BigQuery · Tableau |
| Portfolio Website | Personal website showcasing bioinformatics and data projects | HTML · CSS · JS |
MSc Bioinformatics - Atlantic Technological University (Remote) - 2025–Present
MSc Cell & Gene Therapy - University College London - 2021–2023
BSc Biomedical Science - University of Catania - 2014–2017
Certifications
Google: Data Analytics · Advanced Data Analytics · IT Automation with Python · Project Management · Business Intelligence; Cloud: Architecting with Google Kubernetes Engine
AWS: Cloud Practitioner Essentials · Cloud Solutions Architect
Bioinformatics:
Johns Hopkins University Genomic Data Science Specialization
Wellcome: Bioinformatics for Biologists: An Introduction to Linux, Bash Scripting, and R; Analysing and Interpreting Genomics Datasets
Data Science: freeCodeCamp: Data Analysis with Python; Relational Databases; Scientific Computing with Python & Databases; DE<code>LIFE: Genomes, Networks & Pathways; Data Science & Machine Learning with Python
- Ecological and Conservational Data Science
- Protein–protein interaction (PPI) networks with Cytoscape
- Graph theory & network analysis in genomics
- Cloud-native workflows with Docker & Snakemake
- Data visualisation storytelling (Tableau + HTML integration)