Thanks to visit codestin.com
Credit goes to github.com

Skip to content

WiDS-ADHD-Prediction This project was developed for the 2024 Women in Data Science (WiDS) Datathon and focuses on improving the diagnosis of ADHD in women using biomedical, cognitive, and demographic data. It includes multiple predictive modeling pipelines (XGBoost, PCA + Random Forest + RNN) with fairness-aware evaluation and interpretability

itszoetom/WiDSdatathon

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

44 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

WiDS 2025: ADHD Prediction in Women

This repository contains our submission for the WiDS 2025 Datathon, focused on using biomedical, cognitive, and demographic data to improve diagnosis of ADHD in women — a population that is historically underdiagnosed. Our goal was to develop interpretable and performant models while surfacing gender- and age-related disparities in mental health diagnostics.

Motivation

ADHD is significantly underdiagnosed in adult women due to non-traditional symptom presentations, social masking, and systemic bias. This project explores whether machine learning models trained on behavioral and biomedical features can help reduce diagnostic gaps, and which features are most informative for identifying ADHD in women.

Folder Structure

model/
├── mk_XGB_model.ipynb          # XGBoost baseline model
├── mk_combo_model.ipynb        # Model stacking: XGB + NN + RF combo
├── mk_nn_model.ipynb           # Basic neural network model
├── pca_age.ipynb               # PCA with age feature engineering
├── pca_nn_model.ipynb          # Multi-output NN model (F1: 0.67)
├── pca_rf_nn.ipynb             # Main model: PCA + RF + NN hybrid
├── submissionXGB.csv           # Submission from XGB model
├── xg_boost.ipynb              # XGBoost tuning and evaluation

About

WiDS-ADHD-Prediction This project was developed for the 2024 Women in Data Science (WiDS) Datathon and focuses on improving the diagnosis of ADHD in women using biomedical, cognitive, and demographic data. It includes multiple predictive modeling pipelines (XGBoost, PCA + Random Forest + RNN) with fairness-aware evaluation and interpretability

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 2

  •  
  •