Himalaya [1] implements machine learning linear models in Python, focusing
on computational efficiency for large numbers of targets.
Use himalaya if you need a library that:
- estimates linear models on large numbers of targets,
- runs on CPU and GPU hardware,
- provides estimators compatible with
scikit-learn's API.
Himalaya is stable (with particular care for backward compatibility) and
open for public use (give it a star!).
import numpy as np
n_samples, n_features, n_targets = 10, 5, 4
np.random.seed(0)
X = np.random.randn(n_samples, n_features)
Y = np.random.randn(n_samples, n_targets)
from himalaya.ridge import RidgeCV
model = RidgeCV(alphas=[1, 10, 100])
model.fit(X, Y)
print(model.best_alphas_) # [ 10. 100. 10. 100.]- The model
RidgeCVuses the same API asscikit-learnestimators, with methods such asfit,predict,score, etc. - The model is able to efficiently fit a large number of targets (routinely used with 100k targets).
- The model selects the best hyperparameter
alphafor each target independently.
Check more examples of use of himalaya in the gallery of examples.
Himalaya was designed primarily for functional magnetic resonance imaging
(fMRI) encoding models. In depth tutorials about using himalaya for fMRI
encoding models can be found at gallantlab/voxelwise_tutorials.
Himalaya implements the following models:
- Ridge, RidgeCV
- KernelRidge, KernelRidgeCV
- GroupRidgeCV, MultipleKernelRidgeCV, WeightedKernelRidge
- SparseGroupLassoCV
See the model descriptions in the documentation website.
Himalaya can be used seamlessly with different backends.
The available backends are numpy (default), cupy, torch, and
torch_cuda.
To change the backend, call:
from himalaya.backend import set_backend
backend = set_backend("torch")and give torch arrays inputs to the himalaya solvers. For convenience,
estimators implementing scikit-learn's API can cast arrays to the correct
input type.
To run himalaya on a graphics processing unit (GPU), you can use either
the cupy or the torch_cuda backend:
from himalaya.backend import set_backend
backend = set_backend("cupy") # or "torch_cuda"
data = backend.asarray(data)- Python 3
- Numpy
- Scikit-learn
Optional (GPU backends):
- PyTorch (1.9+ preferred)
- Cupy
You may install the latest version of himalaya using the package manager
pip, which will automatically download himalaya from the Python Package
Index (PyPI):
pip install himalayaTo install himalaya from the latest source (main branch), you may
call:
pip install git+https://github.com/gallantlab/himalaya.gitDevelopers can also install himalaya in editable mode via:
git clone https://github.com/gallantlab/himalaya
cd himalaya
pip install --editable .If you use himalaya in your work, please give it a star, and cite our
publication:
| [1] | Dupré La Tour, T., Eickenberg, M., Nunez-Elizalde, A.O., & Gallant, J. L. (2022). Feature-space selection with banded ridge regression. NeuroImage. |