tf-shell

Train models with strong privacy, even when you can't trust anyone.

tf-shell is a TensorFlow extension that uses Homomorphic Encryption (HE) to train models with centralized label differential privacy (Label DP) guarantees, without requiring a trusted third party.

It's built for the "vertically partitioned" scenario where one party has the features and another party has the labels. This library implements the protocols from the Hadal research paper to securely train a model without the feature-holder ever seeing the plain-text labels.

This is not an officially supported Google product.

Getting Started

pip install tf-shell

See ./examples/ for how to use the library.

The Problem: Centralized DP without a Trusted Party

When training with high privacy requirements (e.g., $\epsilon \le 1$), centralized DP (like DP-SGD) provides much higher model accuracy than local DP (like randomized response). However, centralized DP traditionally requires a trusted third-party "curator" that can see all the data (or at least the secret labels) to compute and noise the gradients. This is a problem when data is vertically partitioned: Party F has the features and wants to train the model. Party L has the secret labels and cannot share them. How can Party F train its model using Party L's labels without a trusted intermediary?

The Solution: HE-based Backpropagation

tf-shell uses Homomorphic Encryption (via Google's SHELL library) to cryptographically "simulate" the trusted curator. The core technical idea is based on the Features-And-Model-vs-Labels (FAML) data partitioning:

Forward Pass (Plaintext): Party F (with features and the model) computes the entire forward pass in plaintext, right up to the final layer's logits.
Encrypted Labels: Party L encrypts its batch of labels using HE and sends the single ciphertext to Party F.
Backward Pass (Encrypted): The gradient of the loss (e.g., CCE with Softmax) is often a simple affine function of the labels (like $\hat{y} - y$). Party F can compute this step homomorphically using its plaintext logits and Party L's encrypted labels.
Model Update: Party F finishes the backpropagation, adds the required DP noise, and updates its model weights.

The result is a model trained with the high utility of centralized DP, but Party F never sees Party L's individual labels.

What's Included?

The library is split into two packages:

tf_shell: The base package. It integrates TensorFlow with the SHELL library, providing a ShellTensor type for basic HE-enabled computations.
tf_shell_ml: The machine learning library. It implements two different protocols for the encrypted backpropagation step:
- POSTSCALE: A novel protocol that is highly efficient for models with a low number of output classes (e.g., binary classification).
- HE-DP-SGD: A more direct HE implementation of backpropagation, which is better suited for models with many output classes.

Building

Build From Source

Install bazel and python3 or use the devcontainer.

Run the tests.

bazel test //tf_shell/...
bazel test //tf_shell_ml/...  # Large tests, requires 128GB of memory.

Build the code.

bazel build //:wheel
bazel run //:wheel_rename

(Optional) Install the wheel, e.g. to try out the ./examples/. You may first need to copy the wheel out of the devcontainer's filesystem.
```
cp -f bazel-bin/*.whl ./  # Run in devcontainer if using.
```
Then install.
```
pip install --force-reinstall tf_shell-*.whl  # Run in target environment.
```

Note the cpython api is not compatible across minor python versions (e.g. 3.10, 3.11) so the wheel must be rebuilt for each python version.

Code Formatters and Counters

bazel run //:bazel_formatter
bazel run //:python_formatter
bazel run //:clang_formatter

cloc ./ --fullpath --not-match-d='/(bazel-.*|.*\.venv)/'

Update Python Dependencies

Update requirements.in and run the following to update the requirements files for each python version.

for ver in 3_9 3_10 3_11 3_12; do
  rm requirements_${ver}.txt
  touch requirements_${ver}.txt
  bazel run //:requirements_${ver}.update
done

bazel clean --expunge

If updating the tensorflow dependency, other dependencies may also need to change, e.g. abseil (see MODULE.bazel). This issue usually manifests as a missing symbols error in the tests when trying to import the tensorflow DSO. In this case, c++filt will help to decode the mangled symbol name and nm --defined-only .../libtensorflow_framework.so | grep ... may help find what the symbol changed to, and which dependency is causing the error.

Contributing

See CONTRIBUTING.md for details.

License

Apache 2.0; see LICENSE for details.

Disclaimer

Convolutions on AMD-based platforms may fail due to known limitations of TensorFlow. This will resulting in the following error when running tests:

CPU implementation of Conv3D currently only supports dilated rates of 1.

This project is not an official Google project. It is not supported by Google and Google specifically disclaims all warranties as to its quality, merchantability, or fitness for a particular purpose.

Name		Name	Last commit message	Last commit date
Latest commit History 355 Commits
.devcontainer		.devcontainer
.github/workflows		.github/workflows
.vscode		.vscode
examples		examples
tf_shell		tf_shell
tf_shell_ml		tf_shell_ml
third_party		third_party
tools		tools
.bazeliskrc		.bazeliskrc
.bazelrc		.bazelrc
.clang-format		.clang-format
.gitignore		.gitignore
AUTHORS		AUTHORS
BUILD		BUILD
CONTRIBUTING.md		CONTRIBUTING.md
GEMINI.md		GEMINI.md
LICENSE		LICENSE
MODULE.bazel		MODULE.bazel
README.md		README.md
requirements.in		requirements.in
requirements_3_10.txt		requirements_3_10.txt
requirements_3_11.txt		requirements_3_11.txt
requirements_3_12.txt		requirements_3_12.txt
requirements_3_9.txt		requirements_3_9.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

tf-shell

Getting Started

The Problem: Centralized DP without a Trusted Party

The Solution: HE-based Backpropagation

What's Included?

Building

Build From Source

Code Formatters and Counters

Update Python Dependencies

Contributing

License

Disclaimer

About

Uh oh!

Releases 52

Packages

Uh oh!

Languages

License

google/tf-shell

Folders and files

Latest commit

History

Repository files navigation

tf-shell

Getting Started

The Problem: Centralized DP without a Trusted Party

The Solution: HE-based Backpropagation

What's Included?

Building

Build From Source

Code Formatters and Counters

Update Python Dependencies

Contributing

License

Disclaimer

About

Resources

License

Code of conduct

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases 52

Packages 0

Uh oh!

Languages

Packages