Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View geocarvalho's full-sized avatar
🐼
🐼

Highlights

  • Pro

Organizations

@uclanelsonlab

Block or report geocarvalho

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
geocarvalho/README.md

About Me

I’m George Carvalho, a Healthcare Data Scientist and Bioinformatician specializing in clinical genomics, large-scale sequencing analysis, and workflow automation. I build scalable, reproducible pipelines and data systems that transform complex genomic data into actionable insights for research and patient care.

Currently at UCLA Health, I analyze whole genome and RNA sequencing data for rare disease patients, develop cloud-based pipelines, and generate clinical genomics reports that support diagnostic decision-making. My work focuses on improving variant detection, structural variation analysis, and the efficiency and reliability of genomic workflows.

Previously, I worked at G42 Healthcare, developing large-scale genomics analysis solutions, and at Hospital Israelita Albert Einstein, supporting clinical sequencing pipelines and validation processes. These experiences strengthened my ability to operate in regulated clinical environments and collaborate across multidisciplinary teams.

My technical expertise spans Python-based data science, workflow orchestration (Nextflow, Snakemake, WDL), cloud computing (AWS HealthOmics, DNAnexus), and multi-platform sequencing technologies, including Illumina, PacBio, and Oxford Nanopore. I am particularly interested in building a robust bioinformatics infrastructure that enables precision medicine and accelerates scientific discovery.

I thrive in collaborative environments where science, engineering, and medicine intersect, and I am driven by the opportunity to develop solutions that directly improve patient outcomes.

Core Strengths

  • Clinical genomics & rare disease analysis
  • NGS pipeline development & workflow automation
  • Cloud & high-performance computing (AWS, DNAnexus)
  • Variant calling, CNV & structural variation analysis
  • RNA-seq and multi-omics integration
  • Reproducible research & data engineering best practices

Interests

  • Professional interests: I enjoy working in clinical bioinformatics, building and optimizing analysis pipelines, evaluating new tools and methodologies, and strengthening the statistical foundations behind genomic data interpretation.

  • Personal interests: Outside of work, I value spending quality time with my wife and friends, exploring new places together, and trying different cuisines. I also enjoy experimenting with new sports and practice Brazilian Jiu-Jitsu as my primary form of exercise and stress relief.


Feel free to get in touch if you'd like to collaborate on a project.

Pinned Loading

  1. sv-cnv-studies sv-cnv-studies Public

    📚 Relevant papers for CNV and SV approaches

    94 16

  2. genomika/pandas-workshop genomika/pandas-workshop Public

    I Summer Workshop de Análise de Dados com Python e Mr. Pandas

    Jupyter Notebook 5

  3. Material de docker do "descomplicand... Material de docker do "descomplicando docker 2016"
    1
    # Descomplicando Docker 2016
    2
    ## [LinuxTips YouTube](https://www.youtube.com/watch?v=0xxHiOSJVe8&list=PLf-O3X2-mxDkiUH0r_BadgtELJ_qyrFJ_&index=1)
    3
    ## [O que é Docker? | Mundo Docker](https://www.mundodocker.com.br/o-que-e-docker/)
    4
    ---
    5
    
                  
  4. uclanelsonlab/nl-rna-seq_wf uclanelsonlab/nl-rna-seq_wf Public

    RNA-seq for rare diseases pipeline using nextflow

    Nextflow 1

  5. uclanelsonlab/wgs_qc_wf uclanelsonlab/wgs_qc_wf Public

    Workflow used to check QC for WGS data

    Nextflow

  6. uclanelsonlab/nl-wgs_wf uclanelsonlab/nl-wgs_wf Public

    Whole genome sequecing pipeline for germline short-read data

    Nextflow