Thanks to visit codestin.com
Credit goes to github.com

Skip to content

Curated and evolving portfolio of my computational genomics and bioinformatics projects, including data modeling, phylogenetic profiling, and multi-omics integration.

License

Notifications You must be signed in to change notification settings

Shalev-CompBio/Shalev-Evolutionary-Genomics-Portfolio

Repository files navigation

Evolutionary Genomics & Multi-Omics Portfolio

Genomics Language Institution

🧬 Decoding Disease Architecture Through Evolutionary Signals

Shalev Yaacov | M.Sc. Researcher @ Hebrew University (Tabach Lab)
Evolutionary gene clustering, multi-omics integration, and phylogenetic profiling for uncovering IRD & cilia-related disease networks.


📌 Overview

This portfolio gathers the computational frameworks, pipelines, and visualization tools I develop for studying Inherited Retinal Diseases (IRD) and Ciliopathies.

My work focuses on addressing missing heritability in genetic disorders. By integrating Normalized Phylogenetic Profiling (NPP) with transcriptomic, phenotypic, and molecular datasets, I aim to uncover evolutionarily conserved gene modules and new disease candidates.

Note: All datasets in this repository are synthetic, transformed, or strictly demonstrative to ensure confidentiality and compliance with institutional agreements.


🎨 Visual Summary of My Evolutionary Genomics Portfolio

To quickly explore my work visually, check out the interactive portfolio:

Interactive View: Click here to launch

(It includes evolutionary genomics tools, multi-omics visualizations, and project overviews.)


🔬 Key Projects

  • Goal: Systematic characterization of precomputed ciliary gene clusters using evolutionary and functional annotations.
  • Approach: Modular analytical pipeline evaluating cluster composition, annotations, enrichment, and evolutionary or phenotypic coherence.
  • Includes:
    • Automated annotation and GO-term enrichment.
    • Internal-distance and similarity metrics.
    • Detection of submodules and coherent functional subsets.
    • Visualizations of gene-gene relationships and conserved signatures.

  • Goal: Identify phenotype-driven gene modules within IRD-associated gene sets.
  • Approach: Semantic similarity-based clustering using HPO annotations, designed for integration within a broader multi-omics framework.
  • Includes:
    • Silhouette validation and stability assessment.
    • Phenotype similarity matrices and semantic clustering.
    • Prioritization support for unsolved IRD gene candidates (synthetic demonstration only).

  • Goal: Create clear, publication-ready heatmaps and similarity visualizations supporting exploratory multi-omics analyses.
  • Tools: Custom Python and R scripts for species-aligned matrices, clustered heatmaps, annotation layers, and comparison plots.

  • Goal: Demonstrate Local Barcode Segmentation (LBS)-inspired evolutionary profiling using synthetic data.
  • Includes:
    • Synthetic NPP matrices and barcode segments.
    • Segmentation and consensus-profile workflows.
    • Signal vs. noise validation and benchmarking experiments.

🛠️ Tech Stack & Methodology

  • Languages: Python (Pandas, Scikit-learn, SciPy), R (ggplot2, pheatmap, ComplexHeatmap).
  • Core Methods:
    • Normalized Phylogenetic Profiling (NPP).
    • Comparative genomics & Supervised clustering.
    • Multi-omics data integration.
    • Feature engineering & ML-based prioritization.

📬 Contact & Affiliation

  • Lab: Prof. Yuval Tabach Lab, Faculty of Medicine.
  • Institution: Hebrew University of Jerusalem.
  • Role: M.Sc. Candidate in Genomics & Bioinformatics.
  • Location: Jerusalem, Israel.

This repository evolves alongside my thesis research and will continue to expand with new analyses, tools, and polished workflows.

About

Curated and evolving portfolio of my computational genomics and bioinformatics projects, including data modeling, phylogenetic profiling, and multi-omics integration.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published