Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View JohnPaulinePineda's full-sized avatar

Block or report JohnPaulinePineda

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
JohnPaulinePineda/README.md

šŸ‘‹ Hi, I'm John!


LinkedIn Badge Ā  Ā  Researchgate Badge Ā  Ā  Github Badge Ā  Ā  Googlescholar Badge Ā  Ā  Tableaupublic Badge

I’m a Data Science and Machine Learning Enthusiast with a unique background that bridges:

  • ⚔ Electronics Engineering
  • šŸ“Š Applied Statistics

šŸ’¼ Professional Background

I bring experience from diverse, high-impact industries — including:

  • āš™ļø High-End Electronics Manufacturing
  • šŸ’³ Fintech
  • 🧬 Biotechnology R&D

My work spans across domains where Data-Driven Insights, Engineering Precision, and Statistical Rigor intersect to solve real-world problems.

šŸš€ What Drives Me

I'm passionate about Lifelong Learning and creating Innovative, End-to-End Projects that:

  • šŸ¤– Harness the power of data and machine learning
  • šŸ› ļø Deliver practical, real-world solutions
  • šŸ“š Contribute to open knowledge

In my spare time, you’ll find me experimenting with cutting-edge ML tools and applying them to meaningful use cases.

šŸ“ Project Showcase

I’ve briefly organized my work below — including:

  • Machine Learning Applications: End-to-end predictive modeling projects showcasing complete pipelines from data to deployment with live interactive web apps.
  • Machine Learning Case Studies: In-depth, statistically rigorous studies that emphasize exploratory modeling, validation, and interpretability.
  • Machine Learning Exploratory Projects: Targeted experiments exploring specific machine learning lifecycle components, techniques, or technologies in isolation.
  • Visual Analytics Projects: Interactive dashboards designed for intuitive data exploration and visual storytelling.
  • Scientific Research Papers: Peer-reviewed co-authored research contributions applying data science methods in academic and scientific contexts.
  • šŸ”µ Completed Projects
  • šŸ”“ Ongoing Work

For a deeper dive into my projects, methodologies, and tools used, check out my šŸ“Œ Project Portfolio Website

Thanks for visiting my GitHub! šŸ»

Feel free to šŸ” explore, šŸ¤ connect, or šŸ‘„ collaborate.


🧠 Machine Learning Applications

Tools Project Title Status Link
Python Badge
Jupyter Badge
Github Badge
Streamlit Badge
Classifying Brain Tumors from Magnetic Resonance Images by Leveraging Convolutional Neural Network-Based Multilevel Feature Extraction and Hierarchical Representation šŸ”µ Notebook
Repository
Application
Python Badge
Jupyter Badge
Github Badge
Streamlit Badge
Estimating Heart Failure Survival Risk Profiles From Cardiovascular, Hematologic And Metabolic Markers šŸ”µ Notebook
Repository
Application
Python Badge
Jupyter Badge
Github Badge
Streamlit Badge
Estimating Lung Cancer Probabilities From Demographic Factors, Clinical Symptoms And Behavioral Indicators šŸ”µ Notebook
Repository
Application

šŸ“ Machine Learning Case Studies

Tools Project Title Status Link
Python Badge
Jupyter Badge
Learning Hierarchical Features for Predicting Multiclass X-Ray Images using Convolutional Neural Network Model Variations šŸ”µ Notebook
Repository
Python Badge
Jupyter Badge
Discovering Global Patterns in Cancer Mortality Across Countries Via Clustering Analysis šŸ”µ Notebook
Repository
Python Badge
Jupyter Badge
Identifying Contributing Factors for Countries With High Cancer Rates Using Classification Algorithms With Class Imbalance Treatment šŸ”µ Notebook
Repository
R Badge
RStudio Badge
Uncovering Underlying Constructs of Chronic Disease Indicators Across US States Using Exploratory and Confirmatory Factor Analyses šŸ”µ Notebook
Repository
R Badge
RStudio Badge
Characterizing Life Expectancy Drivers Across Countries Using Model-Agnostic Interpretation Methods for Black-Box Models šŸ”µ Notebook
Repository

šŸ”¬ Machine Learning Exploratory Projects

Tools Project Title Status Link
Python Badge
Jupyter Badge
NannyML Badge
Detecting and Analyzing Machine Learning Model Drift Using Open-Source Monitoring Tools šŸ”µ Notebook
Repository
Python Badge
Jupyter Badge
MlFlow Badge
Machine Learning Model Experiment Logging and Tracking Using Open-Source Frameworks šŸ”µ Notebook
Repository
Python Badge
Jupyter Badge
Detecting and Evaluating Anomalies in Categorical Data Under Supervised and Unsupervised Settings šŸ”µ Notebook
Repository
Python Badge
Jupyter Badge
FastAPI Badge
Streamlit Badge
Docker Badge
DockerHub Badge
Render Badge
Containerizing and Deploying Machine Learning API Endpoints on Open-Source Platforms šŸ”µ Notebook
Repository
Python Badge
Jupyter Badge
Leveraging Ensemble Learning With Bagging, Boosting, Stacking and Blending Approaches šŸ”µ Notebook
Repository
Python Badge
Jupyter Badge
FastAPI Badge
Flask Badge
Exploring Modular Application Programming Interface Frameworks For Serving Model Predictions šŸ”µ Notebook
Repository
Python Badge
Jupyter Badge
Exploring Parametric Accelerated Failure Time Models for Estimating Lifetimes in Survival Data šŸ”µ Notebook
Repository
Python Badge
Jupyter Badge
Implementing Shapley Additive Explanations for Interpreting Feature Contributions in Penalized Cox Regression šŸ”µ Notebook
Repository
Python Badge
Jupyter Badge
Modelling Right-Censored Survival Time and Status Responses for Prediction šŸ”µ Notebook
Repository
Python Badge
Jupyter Badge
Exploring Regularization Approaches for Controlling Model Complexity Through Weight Penalization for Neural Network Classification šŸ”µ Notebook
Repository
Python Badge
Jupyter Badge
Comparing Optimization Algorithms in Parameter Updates and Loss Function Minimization for Neural Network Classification šŸ”µ Notebook
Repository
Python Badge
Jupyter Badge
Exploring Activation Functions And Backpropagation Gradient Updates for Neural Network Classification šŸ”µ Notebook
Repository
Python Badge
Jupyter Badge
Comparing Batch, Stochastic and Mini-Batch Approaches to Gradient Descent in Estimating Regression Coefficients šŸ”µ Notebook
Repository
Python Badge
Jupyter Badge
Implementing Backpropagation In Updating Weights for Neural Network Classification šŸ”µ Notebook
Repository
Python Badge
Jupyter Badge
Implementing Gradient Descent Algorithm in Estimating Regression Coefficients šŸ”µ Notebook
Repository
Python Badge
Jupyter Badge
Data Quality Assessment, Preprocessing and Exploration for a Classification Modelling Problem šŸ”µ Notebook
Repository
Python Badge
Jupyter Badge
Exploring Penalized Models for Predicting Numeric Responses šŸ”µ Notebook
Repository
Python Badge
Jupyter Badge
Data Quality Assessment, Preprocessing and Exploration for a Regression Modelling Problem šŸ”µ Notebook
Repository
R Badge
RStudio Badge
Exploring Boosting, Bagging and Stacking Algorithms for Ensemble Learning šŸ”µ Notebook
Repository
R Badge
RStudio Badge
Discovering Latent Variables in High-Dimensional Data using Exploratory Factor Analysis šŸ”µ Notebook
Repository
R Badge
RStudio Badge
Exploring Penalized Models for Handling High-Dimensional Survival Data šŸ”µ Notebook
Repository
R Badge
RStudio Badge
Exploring and Visualizing Extracted Dimensions from Principal Component Algorithms šŸ”µ Notebook
Repository
R Badge
RStudio Badge
Sample Size and Power Calculations for Tests Comparing Proportions in Clinical Research šŸ”µ Notebook
Repository
R Badge
RStudio Badge
Sample Size and Power Calculations for Tests Comparing Means in Clinical Research šŸ”µ Notebook
Repository
R Badge
RStudio Badge
Comparing Oversampling and Undersampling Algorithms for Class Imbalance Treatment šŸ”µ Notebook
Repository
R Badge
RStudio Badge
Exploring Performance Evaluation Metrics for Survival Prediction šŸ”µ Notebook
Repository
R Badge
RStudio Badge
Exploring Robust Logistic Regression Models for Handling Quasi-Complete Separation šŸ”µ Notebook
Repository
R Badge
RStudio Badge
Estimating Outlier Scores Using Density and Distance-Based Anomaly Detection Algorithms šŸ”µ Notebook
Repository
R Badge
RStudio Badge
Estimating Outlier Scores Using Isolation Forest-Based Anomaly Detection Algorithms šŸ”µ Notebook
Repository
R Badge
RStudio Badge
Identifying Multivariate Outliers Using Density-Based Clustering Algorithms šŸ”µ Notebook
Repository
R Badge
RStudio Badge
Exploring Dichotomization Thresholding Strategies for Optimal Classification šŸ”µ Notebook
Repository
R Badge
RStudio Badge
Implementing Gradient Descent Algorithm in Estimating Regression Coefficients šŸ”µ Notebook
Repository
R Badge
RStudio Badge
Formulating Segmented Groups Using Clustering Algorithms šŸ”µ Notebook
Repository
R Badge
RStudio Badge
Extracting Information Using Dimensionality Reduction Algorithms šŸ”µ Notebook
Repository
R Badge
RStudio Badge
Remedial Procedures for Skewed Data with Extreme Outliers šŸ”µ Notebook
Repository
R Badge
RStudio Badge
Selecting Informative Predictors Using Simulated Annealing and Genetic Algorithms šŸ”µ Notebook
Repository
R Badge
RStudio Badge
Selecting Informative Predictors Using Univariate Filters šŸ”µ Notebook
Repository
R Badge
RStudio Badge
Selecting Informative Predictors Using Recursive Feature Elimination šŸ”µ Notebook
Repository
R Badge
RStudio Badge
Evaluating Model-Independent Feature Importance for Predictors with Dichotomous Categorical Responses šŸ”µ Notebook
Repository
R Badge
RStudio Badge
Evaluating Model-Independent Feature Importance for Predictors with Numeric Responses šŸ”µ Notebook
Repository
R Badge
RStudio Badge
Cost-Sensitive Learning for Severe Class Imbalance šŸ”µ Notebook
Repository
R Badge
RStudio Badge
Remedial Procedures in Handling Imbalanced Data for Classification šŸ”µ Notebook
Repository
R Badge
RStudio Badge
Evaluating Hyperparameter Tuning Strategies and Resampling Distributions šŸ”µ Notebook
Repository
R Badge
RStudio Badge
Modelling Multiclass Categorical Responses for Prediction šŸ”µ Notebook
Repository
R Badge
RStudio Badge
Modelling Dichotomous Categorical Responses for Prediction šŸ”µ Notebook
Repository
R Badge
RStudio Badge
Modelling Numeric Responses for Prediction šŸ”µ Notebook
Repository
R Badge
RStudio Badge
Resampling Procedures for Model Hyperparameter Tuning and Internal Validation šŸ”µ Notebook
Repository
R Badge
RStudio Badge
Clinical Research Prediction Model Development and Evaluation for Prognosis šŸ”µ Notebook
Repository
R Badge
RStudio Badge
Missing Data Pattern Analysis, Imputation Method Evaluation and Post-Imputation Diagnostics šŸ”µ Notebook
Repository
R Badge
RStudio Badge
Survival Analysis and Descriptive Modelling for a Three-Group Right-Censored Data with Time-Independent Variables Using Cox Proportional Hazards Model šŸ”µ Notebook
Repository
R Badge
RStudio Badge
Survival Analysis and Descriptive Modelling for a Two-Group Right-Censored Data with Time-Independent Variables Using Cox Proportional Hazards Model šŸ”µ Notebook
Repository
R Badge
RStudio Badge
Treatment Comparison Tests Between a Single Two-Level Factor Variable and a Single Numeric Response Variable šŸ”µ Notebook
Repository
R Badge
RStudio Badge
Data Quality Assessment, Preprocessing and Exploration for a Regression Modelling Problem šŸ”µ Notebook
Repository
R Badge
RStudio Badge
Data Quality Assessment, Preprocessing and Exploration for a Classification Modelling Problem šŸ”µ Notebook
Repository

🧮 Visual Analytics Projects

Tools Project Title Status Link
Tableau Badge Dashboard Development with Slice-and-Dice Exploration Features šŸ”µ Dashboard
Tableau Badge Dashboard Development with Dynamic Filtering Features šŸ”µ Dashboard
Tableau Badge Dashboard Development with Longitudinal Change Tracking Features šŸ”µ Dashboard
Tableau Badge Dashboard Development with What-If Scenario Analysis Features šŸ”µ Dashboard
Tableau Badge Dashboard Development with Period-To-Date Performance Tracking Features šŸ”µ Dashboard

šŸ“š Scientific Research Papers

Tools Project Title Status Link
R Badge
RStudio Badge
Genomic Imprinting Biomarkers for Cervical Cancer Risk Stratification šŸ”µ Publication
R Badge
RStudio Badge
High Diagnostic Accuracy of Epigenetic Imprinting Biomarkers in Thyroid Nodules šŸ”µ Publication
R Badge
RStudio Badge
Epigenetic Imprinting Alterations as Effective Diagnostic Biomarkers for Early-Stage Lung Cancer and Small Pulmonary Nodules šŸ”µ Publication
R Badge
RStudio Badge
Novel Visualized Quantitative Epigenetic Imprinted Gene Biomarkers Diagnose the Malignancy of Ten Cancer Types šŸ”µ Publication
SPSS Badge
New Thyroid Imaging Reporting and Data System (TIRADS) Based on Ultrasonography Features for Follicular Thyroid Neoplasms: A Multicenter Study šŸ”µ Publication
R Badge
RStudio Badge
Advancing Malignancy Risk Stratification for Early-Stage Cancers in Lung Nodules by Combined Imaging and Electrical Impedance Analysis šŸ”µ Abstract
R Badge
RStudio Badge
Intronic Noncoding RNA Expression of DCN is Related to Cancer-Associated Fibroblasts and NSCLC Patients’ Prognosis šŸ”µ Abstract
R Badge
RStudio Badge
Epigenetic Imprinted Genes as Biomarkers for the Proactive Detection and Accurate Presurgical Diagnosis of Small Lung Nodules šŸ”µ Abstract
R Badge
RStudio Badge
Effect of Epigenetic Imprinting Biomarkers in Urine Exfoliated Cells (UEC) on the Diagnostic Accuracy of Low-Grade Bladder Cancer šŸ”µ Abstract
R Badge
RStudio Badge
Epigenetic Imprinted Gene Biomarkers Significantly Improve the Accuracy of Presurgical Bronchoscopy Diagnosis of Lung Cancer šŸ”µ Abstract
R Badge
RStudio Badge
Quantitative Chromogenic Imprinted Gene In Situ Hybridization (QCIGISH) Technique Could Diagnose Lung Cancer Accurately šŸ”µ Abstract

šŸ’» GitHub Stats


Pinned Loading

  1. Portfolio_Project_62 Portfolio_Project_62 Public

    Data science project which demonstrates experiment tracking in Python by implementing key features such as parameter logging, metric tracking, artifact storage, run comparison, and experiment organ…

    Jupyter Notebook 1

  2. Portfolio_Project_61 Portfolio_Project_61 Public

    Data science project which investigates multiple algorithms for identifying unusual patterns in categorical datasets under two distinct evaluation strategies involving the presence and absence of o…

    Jupyter Notebook

  3. Portfolio_Project_60 Portfolio_Project_60 Public

    Data science project which explores containerization and cloud deployment of machine learning applications in Python.

    Jupyter Notebook

  4. Portfolio_Project_59 Portfolio_Project_59 Public

    Data science project which explores boosting, bagging, stacking and blending ensemble learning to refine predictions by integrating diverse learning approaches in Python.

    Jupyter Notebook

  5. Portfolio_Project_58 Portfolio_Project_58 Public

    Data science project which explores different RESTful approaches to model deployment using Python.

    Jupyter Notebook

  6. Portfolio_Project_56 Portfolio_Project_56 Public

    Data science model deployment project aimed at developing convolutional neural network models for both detecting and classifying brain tumors from magnetic resonance images and serving a web applic…

    Jupyter Notebook