gato-hep: Gradient-based Categorization Optimizer for High Energy Physics

A toolkit for binning / categorisation optimisation with respect to signal significance for HEP analyses, using gradient-descent methods. gatohep relies on TensorFlow with TensorFlow-Probability.

The categorisation can be performed directly in a multidimensional discriminant space, e.g. from a mutliclassifier with softmax activation. The bins are defined by learnable multidimensional Gaussians as a Gaussian Mixture Model (GMM), or, well working in 1D, using bin boundaries approximated by steep sigmoid functions of learnable position.

See the full documentation at https://gato-hep.readthedocs.io/.

Quick install (editable mode)

git clone https://github.com/FloMau/gato-hep.git
cd gato-hep
python3 -m venv gato_env       # or use conda
source gato_env/bin/activate
pip install -e .

Dependencies are declared in pyproject.toml. Note: The only tricky part is to find matching versions of tensorflow, tensorflow-probability and ml-dtypes. The requirements mentioned here should work, however, other combinations may work as well.

Running the toy examples

1D toy (signal vs. multi-background)

python examples/1D_example/run_toy_example.py

3-class soft-max (2 D slice of 3 D)

python examples/three_class_softmax_example/run_example.py

Each script writes plots & a significance comparison table.

Apply gato-hep to your own data

# standard GMM model for ND optimisation
from gatohep.models import gato_gmm_model
# more to be included here later on

# see ./examples for a full workflow!

Directory layout

gato-hep/                       project root
│
├─ pyproject.toml           metadata + dependencies
├─ src/gatohep/                installable Python package
│   │
│   ├─ __init__.py
│   ├─ models.py            Trainable model class
│   └─ losses.py            custom loss / penalty terms
│   ├─ utils.py             misc helpers
│   ├─ plotting_utils.py    helper plots (stacked hists, bin boundaries, ...)
│   ├─ data_generation.py   toy data generators (1D / 3-class softmax)
│
└─ examples/                runnable demos
    ├─ 1D_example/run_example.py
    └─ three_class_softmax_example/run_example.py

Contributing

git checkout -b feature/xyz
Code under src/gatohep/, add tests under tests/.
Update version in pyproject.toml.
black / isort / pytest, then open a PR.

Name		Name	Last commit message	Last commit date
Latest commit History 77 Commits
.github/workflows		.github/workflows
docs		docs
examples		examples
src/gatohep		src/gatohep
tests		tests
.flake8		.flake8
.gitignore		.gitignore
.readthedocs.yaml		.readthedocs.yaml
README.md		README.md
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

gato-hep: Gradient-based Categorization Optimizer for High Energy Physics

Quick install (editable mode)

Running the toy examples

1D toy (signal vs. multi-background)

3-class soft-max (2 D slice of 3 D)

Apply gato-hep to your own data

Directory layout

Contributing

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

FloMau/gato-hep

Folders and files

Latest commit

History

Repository files navigation

gato-hep: Gradient-based Categorization Optimizer for High Energy Physics

Quick install (editable mode)

Running the toy examples

1D toy (signal vs. multi-background)

3-class soft-max (2 D slice of 3 D)

Apply gato-hep to your own data

Directory layout

Contributing

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages