Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View barbavegeta's full-sized avatar

Block or report barbavegeta

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
barbavegeta/README.md

Hi, I’m Salvatore

Data Science & Bioinformatics
London, United Kingdom


About Me

I’m a biomedical scientist with 7+ years of experience in clinical laboratories and a growing focus on bioinformatics, data science, and precision medicine.
I bridge wet-lab techniques with computational workflows - from stem cell research to NGS data pipelines - combining biological expertise with data-driven insights.

  • 7+ years of experience across clinical, stem-cell, and ATMP laboratories (UCLH, CooperGenomics)
  • Experience in flow cytometry, NGS library prep, and clinical data reporting
  • Skilled in Python, R, SQL, and Bash for genomic analysis and automation
  • Cloud-native mindset: Google Cloud, AWS, and Kubernetes for scalable bioinformatics pipelines
  • Freelance AI data annotator (Mercor, Outlier, micro1, Alignerr) supporting model training for biomedical data
  • Currently pursuing MSc Bioinformatics @ Atlantic Technological University (remote)

Tech Stack

Programming: Python · R · SQL · Bash · Git
Bioinformatics: Bowtie2 · HISAT2 · SAMtools · BEDtools · DESeq2 · Bioconductor
Data Viz: Tableau · ggplot2 · seaborn · matplotlib
Cloud: AWS · Google Cloud · Kubernetes Engine
AI/ML: Data annotation · model evaluation · prompt design


Featured Projects

Project Description Tools
Genomic Data Science End-to-end RNA-seq & variant analysis using HISAT2, StringTie, and DESeq2 Bash · Python · R
Salifort Motors Predictive modelling to understand drivers of employee turnover and inform retention strategy XGBoost · NumPy · SciPy · scikit-learn · Pandas · Statsmodels
Tik Tok Project Exploratory analysis of engagement metrics to uncover content trends and optimisation levers Matplotlib · Seaborn · Plotly · SciPy
Bellabeat Case Study Fitbit data analysis and Tableau dashboard R · dplyr · Tableau
AWS Solution Architecture Cloud deployment diagrams & IaC design AWS · ECS · S3 · Aurora
Cyclistic BI Capstone Data integration and visualization for business insights BigQuery · Tableau
Portfolio Website Personal website showcasing bioinformatics and data projects HTML · CSS · JS

Education & Certifications

MSc Bioinformatics - Atlantic Technological University (Remote) - 2025–Present
MSc Cell & Gene Therapy - University College London - 2021–2023
BSc Biomedical Science - University of Catania - 2014–2017

Certifications
Google: Data Analytics · Advanced Data Analytics · IT Automation with Python · Project Management · Business Intelligence; Cloud: Architecting with Google Kubernetes Engine
AWS: Cloud Practitioner Essentials · Cloud Solutions Architect
Bioinformatics: Johns Hopkins University Genomic Data Science Specialization Wellcome: Bioinformatics for Biologists: An Introduction to Linux, Bash Scripting, and R; Analysing and Interpreting Genomics Datasets Data Science: freeCodeCamp: Data Analysis with Python; Relational Databases; Scientific Computing with Python & Databases; DE<code>LIFE: Genomes, Networks & Pathways; Data Science & Machine Learning with Python


Currently Exploring

  • Ecological and Conservational Data Science
  • Protein–protein interaction (PPI) networks with Cytoscape
  • Graph theory & network analysis in genomics
  • Cloud-native workflows with Docker & Snakemake
  • Data visualisation storytelling (Tableau + HTML integration)

Connect with Me

Pinned Loading

  1. Genomic_Data_Science_Specialization Genomic_Data_Science_Specialization Public

    Jupyter Notebook

  2. Google_Advanced_Data_Analytics-Tik_Tok_Project Google_Advanced_Data_Analytics-Tik_Tok_Project Public

    Jupyter Notebook

  3. Google_Advanced_Data_Analytics-Salifort_Motors Google_Advanced_Data_Analytics-Salifort_Motors Public

    HTML

  4. Google_Data_Analytics-Bellabeat-Project Google_Data_Analytics-Bellabeat-Project Public

    Bellebeat project from Google Data Analytics Certificate

    R

  5. AWS-Solution-Architect AWS-Solution-Architect Public

    High-level AWS architecture design for migrating on-premises workloads to a cloud-native, fully managed solution, ensuring scalability, fault-tolerance, and operational efficiency.

  6. Google_Business_Intelligence---Google-Fiber Google_Business_Intelligence---Google-Fiber Public