FedScore-Python

FedScore is a framework for developing scoring systems across multiple sites in a privacy-preserving way. This repo contains the Python code for the proposed method. The R implementation is available here.

See also this tiny paper for the results of FedScore-Python applied to real-world heterogeneous electronic health records (EHR) datasets.

Introduction

Cross-institutional collaboration has gained popularity in recent years as a way to accelerate medical research and facilitate quality improvement. Federated learning (FL) can avoid data sharing by collectively training algorithms without exchanging patient-level data. However, most FL applications in medical image data use black box models from computer vision. Interpretable models, on the contrary, have fewer instances of FL applications despite their popularity in clinical research.

As a type of interpretable risk scoring model, scoring systems have been employed in practically every diagnostic area of medicine. However, scoring systems have usually been created using single-source data, limiting application at other sites if the development data has insufficient sample size or is not representative. Although it is possible to develop scoring systems on pooled data, the process of doing such pooling is time-consuming and difficult to achieve due to privacy restrictions.

To fill this gap, we propose FedScore, a first-of-its-kind framework for building federated scoring systems across multiple sites.

The figure below provides a high-level overview of the FedScore algorithm:

System requirements

Python 3.9
To install required Python packages, run

pip install -r requirements.txt

Test locally

In FedScore directory, open three terminals. First start the server in the first terminal.

python server.py

Then start two clients, one for each terminal.

python client1.py
python client2.py

Note that the current implementation is a proof of concept with the assumption that server and clients are on the same machine. It has not been adapted or tested for real-world commercial use cases. In addition, the implementation only supports data with binary outcomes.

Supported FL Algorithms

We implemented FedScore using Flower framework, so that all common FL algorithms supported by Flower can be adopted for FedScore. It is also feasible to implement a custom FL strategy.

To apply a different FL algorithm, write a script for the server and place it in the Flower directory. Then edit FedScore/server.py (line 14) to use the new FL strategy.

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
AutoScore		AutoScore
FedScore		FedScore
Figures		Figures
Flower		Flower
output		output
.gitignore		.gitignore
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

FedScore-Python

Introduction

System requirements

Test locally

Supported FL Algorithms

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 4

Uh oh!

Languages

nliulab/FedScore-Python

Folders and files

Latest commit

History

Repository files navigation

FedScore-Python

Introduction

System requirements

Test locally

Supported FL Algorithms

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 4

Uh oh!

Languages

Packages