Use the function check_scalar for parameters validation

## Background / Objective

Use the function [`check_scalar`](https://scikit-learn.org/dev/modules/generated/sklearn.utils.check_scalar.html?highlight=check_scalar#sklearn.utils.check_scalar) for parameters validation. The validation function checks to see the following for a parameter: is an acceptable data type, is within the range of values, the range of values ([interval](https://www.basic-mathematics.com/interval-notation.html)).

- References Issue [#20724](https://github.com/scikit-learn/scikit-learn/issues/20724): "Use check_scalar for parameters validation" (with notes by @glemaitre, @jjerphan, @genvalen)
- References PR [#20723](https://github.com/scikit-learn/scikit-learn/pull/20723).  "MNT use check_scalar to validate scalar in AffinityPropagation". This is an example PR by @glemaitre. 

A helper function exists in scikit-learn which validates a scalar value: `sklearn.utils.check_scalar` [documentation](https://scikit-learn.org/stable/modules/generated/sklearn.utils.check_scalar.html). 
It is used to validate parameters of classes (? and functions). Most of the current classes in scikit-learn do not use this helper function.  We want to refactor the code so that it does use this standard helper function. Utilizing this helper function will help to get consistent error types and messages.

If there is a scalar argument that isn't being checked, we want to check it, or validate it using the `check_scalar` function.  In some cases it is currently being checked, but it is not using the `check_scalar` function.  For that change, we refactor the code.  (Refactoring means making changes to the code that result in the same output as before.)

The function `check_scalar` is defined in [`scikit-learn/sklearn/utils/validation.py`](https://github.com/scikit-learn/scikit-learn/blob/6077d52b706d118c0d9fb1e69c254bc67e15b078/sklearn/utils/validation.py).


## Prerequisites
This is an **Intermediate-level** issue for second time contributors. This requires the following experience:
- You have already set up your working virtual environment.
- You have submitted at least one other pull request to this library. (You are familiar with using git and submitting pull requests.)
- Be familiar with the scikit-learn code base. 
- Experience using [`pytest`](https://docs.pytest.org/en/6.2.x/).
- To find the range of possible for values for an estimator, that information might be available if some validation code has already been written in the scikit-learn library.
- Sometimes validation code is not available in the scikit-learn library.  It is helpful to be familiar with the acceptable range of values (minimum and maximum) for the arguments for the estimator you are working on. If you are not familiar with an estimator, you can reference other sources outside of scikit-learn documentation to get that information. 


## Steps
- [ ] Make sure you have activated your virtual environment. 
- [ ] Make sure you have created a separate branch from `main` before editing files for your new contribution. Refer to our [contributing guidelines](https://scikit-learn.org/dev/developers/contributing.html#how-to-contribute) for more information.
- [ ] Find a class with constructors that have scalar numeric as parameters. There are some listed below in the "Classes to Update" section. 
- [ ] Work on one estimator at a time and submit each in a separate pull request. 
- [ ] Identify the scalar numeric parameters (those of type `int`, `float`) for that class. 
    - Examples of scalar parameters are: `alpha`,`damping`, `max_iter`, and `convergence_iter`, `tol`, `verbose`.    
    - You can infer if it is a type scalar by looking at the documentation. 
    - Example PR: [`AffinityPropagation` scalar parameters]( https://github.com/scikit-learn/scikit-learn/pull/20723/files#diff-62083de22888eadb572404f8f7255a19a74370eeaf2a893858b066d90ada979eL273-L285) 
- [ ] For each of the scalar numeric parameters, determine the acceptable range of values. Look at minimum and maximum values. Sometimes that information is included in the parameter definition in the documentation. Sometimes you may need to reference other sources. If minimum and maximum values are missing, we should add them.
- [ ] Add tests. Note: the tests _must_ fail before adding validation. Example PR by @glemaitre [added a parametrised test for parameters](https://github.com/scikit-learn/scikit-learn/pull/20723/files#diff-35c6902baaa6b79819df8746c45a68f5d9057003fcd4189ac1d44213ac1eced2R76-R95).
- [ ] If any of the associated class attributes, which are scalar numeric, but are not being checked with `check_scalar`, are ones that can be done.
- [ ] Validation should be within the `def fit` function. Validation is when `check_scalar` is added to the class.  **Add `check_scalar` calls where needed**. Generally, this is not done in the constructor but rather just before calling the core of the method. For instance, in the case of #20723, [@glemaitre added `check_scalar` calls just before the call to `affinity_propagation` which is the core of the method.](https://github.com/scikit-learn/scikit-learn/pull/20723/files#diff-62083de22888eadb572404f8f7255a19a74370eeaf2a893858b066d90ada979eR460-R475)

### Notes
- [ ] The pull request can be named:  "MAINT Use check_scalar to validate scalar in: [EstimatorName]"
- [ ] Work on one estimator at a time and submit each in a separate pull request. 
- [ ] Within an estimator there may be multiple scalar arguments. (For one estimator, validation for multiple arguments - should be submitted in one pull request.)
- [ ] Include explicit parameter names (even if they are not required), as a best practice. In this function, the parameter `name` is *not* required, meaning it is not a keyword on the argument. You should include it in the function call for readability. 
```python
check_scalar(
  self.learning_rate,
  name="learning_rate",
  target_type=numbers.Real,
  min_val=0,
  max_val=None,  #default
  include_boundaries="both", #default
)
```

### Tests
Suggestion:  You may want to write the test before writing the validation code.  When doing the test first, it gives you an idea of where the existing validation is.  If validation exists, it will give you the range of possible values. Writing the test lets you check for that.

Generally speaking, this is how to connect the `.py` file with its associated test. Check to see if the test exists in the `test_*.py` file. If it does not, we will need to create a test.  
- Where the class is:  `sklearn/cluster/_affinity_propagation.py`
- Where the related class test file is:  `sklearn/cluster/tests/test_affinity_propagation.py`
- The name of the test:  `def test_affinity_propagation_params_validation(....)`

The point of a test is that if an incorrect parameter value is given, the program gives an error message.  We want to test for values that are outside of the acceptable range. We want to make sure the program is catching that.
To run an individual validation test, here are examples of the code to run at the terminal:    
- `pytest sklearn/cluster/tests/test_affinity_propagation.py::test_affinity_propagation_params_validation`
- `pytest sklearn/linear_model/_glm/tests/test_glm.py::test_glm_max_iter_argument`

## Consistency Checks for Reviewers
1. PR prefix should be `MAINT` (not `MNT`)
1. `check_scalar` call should include explicitly include `name` (Ex: `name="n_estimators",`  (not `"n_estimators", `))
1. Interval ranges should use the text `must be` (not `should be`)
1. Ensure error messages in tests are present

## Examples for Reference
- [x] `sklearn/cluster/_affinity_propagation.py`  (@glemaitre) [#20723](https://github.com/scikit-learn/scikit-learn/pull/20723) 
- [x] `sklearn/linear_model/_ridge.py`  (@ArturoAmorQ) [#21341](https://github.com/scikit-learn/scikit-learn/pull/21341) 

## Classes Updated
- [x] `sklearn/neighbors/_nca.py`
- [x] `sklearn/decomposition/_pca.py`
- [x] `sklearn/feature_extraction/text.py`  (@AlekLefebvre)  [#20752](https://github.com/scikit-learn/scikit-learn/pull/20752) 
- [x] `sklearn/preprocessing/_discretization.py`
- [x] `sklearn/cluster/_affinity_propagation.py`  (@glemaitre) [#20723](https://github.com/scikit-learn/scikit-learn/pull/20723) 
- [x] `sklearn/cluster/_birch.py`  (@SanjayMarreddi) [#20816](https://github.com/scikit-learn/scikit-learn/pull/20816) 
- [x] `sklearn/cluster/_dbscan.py`  (@SanjayMarreddi) [#20816](https://github.com/scikit-learn/scikit-learn/pull/20816) 
- [x] `sklearn/ensemble/_weight_boosting.py` (AdaBoostClassifier)  (@genvalen) [#21442](https://github.com/scikit-learn/scikit-learn/pull/21442) 
- [x] `sklearn/linear_model/_ridge.py`  (Ridge) @ArturoAmorQ) [#21341](https://github.com/scikit-learn/scikit-learn/pull/21341) 
- [x] `sklearn/linear_model/_ridge.py`  (RidgeCV) @ArturoAmorQ) [#21606](https://github.com/scikit-learn/scikit-learn/pull/21606) 
- [x] `sklearn/ensemble/_weight_boosting.py` (AdaBoostRegressor)  (@genvalen) [#21605](https://github.com/scikit-learn/scikit-learn/pull/21605) 
- [x] `sklearn/ensemble/_voting.py` (VotingClassifier, VotingRegressor) (@genvalen) [#22204](https://github.com/scikit-learn/scikit-learn/pull/22204)
- [x] `sklearn/linear_model/_glm/glm.py` (GeneralizedLinearRegressor) (@reshamas)  [#21946](https://github.com/scikit-learn/scikit-learn/pull/21946)
        - [x] `sklearn/linear_model/_glm/glm.py` (PoissonRegressor)  (@reshamas)
        - [x] `sklearn/linear_model/_glm/glm.py` (GammaRegressor)  (@reshamas)
        - [x] `sklearn/linear_model/_glm/glm.py` (TweedieRegressor)  (@reshamas)
- [x] `sklearn/tree/_classes.py` (BaseDecisionTree)   (@genvalen)[#21990](https://github.com/scikit-learn/scikit-learn/pull/21632)
- [x] `sklearn/cluster/_bicluster.py` (SpectralBiClustering)  (@creatornadiran) [#20817](https://github.com/scikit-learn/scikit-learn/pull/20817) 
- [x] `sklearn/cluster/_bicluster.py` (SpectralCoClustering)  (@creatornadiran) [#20817](https://github.com/scikit-learn/scikit-learn/pull/20817) 
- [x] `sklearn/cluster/_bicluster.py` (SpectralClustering)  (@hvassard) [#21881](https://github.com/scikit-learn/scikit-learn/pull/21881) 
- [x] `sklearn/ensemble/_gb.py` (BaseGradientBoosting)   (@genvalen)[#21632](https://github.com/scikit-learn/scikit-learn/pull/21632)
- [x] `sklearn/linear_model/_coordinate_descent.py` (LassoCV) (@ArturoAmorQ)  [#22305](https://github.com/scikit-learn/scikit-learn/pull/22305)
- [x] `sklearn/linear_model/_ridge.py`  (RidgeCV)      (@ArturoAmorQ)  [#21606](https://github.com/scikit-learn/scikit-learn/pull/21606)

## Classes to Update

- [ ] `sklearn/linear_model/_coordinate_descent.py` (Lasso) (@ArturoAmorQ)
- [ ] `sklearn/linear_model/_stochastic_gradient.py` (SGDClassifier) (@reshamas)  
        - add valid intervals: [#22115](https://github.com/scikit-learn/scikit-learn/pull/22115)
- [ ] `sklearn/linear_model/_bayes` (BayesianRidge) (@matiasrvazquez)  
- [ ] `sklearn/linear_model/_bayes` (ARDRegression) (@matiasrvazquez) 
- [ ] `sklearn/ensemble/_stacking.py` (StackingClassifier) (@genvalen)
- [ ] `sklearn/ensemble/_stacking.py` (StackingRegressor) (@genvalen)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Use the function check_scalar for parameters validation #21927

Background / Objective

Prerequisites

Steps

Notes

Tests

Consistency Checks for Reviewers

Examples for Reference

Classes Updated

Classes to Update

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Uh oh!

Use the function check_scalar for parameters validation #21927

Description

Background / Objective

Prerequisites

Steps

Notes

Tests

Consistency Checks for Reviewers

Examples for Reference

Classes Updated

Classes to Update

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions