Test docstrings for parameters are equal #9388

jnothman · 2017-07-17T13:28:51Z

We recently merged a test that checks consistency between parameters in function signatures and their docstrings using sklearn.utils.testing.check_docstring_parameters. I would like to have a function in sklearn.utils.testing which similarly makes use of numpydoc to check that parts of docstrings are identical among a set of objects. I would expect this helper to eventually be contributed back to numpydoc.

It might look something like:

def assert_consistent_docs(objects,
                           include_params=None, exclude_params=None,
                           include_attribs=None, exclude_attribs=None
                           include_returns=None, exclude_returns=None):
    """

    Checks if types and descriptions of parameters, etc, are identical across
    objects. ``object``s may either be ``NumpyDocString`` instances or objects
    (classes, functions, descriptors) with docstrings that can be parsed as
    numpydoc.

    By default it asserts that any Parameters/Returns/Attributes entries having the
    same name among ``objects`` docstrings also have the same type
    specification and description (ignoring whitespace).

    ``include_*`` and ``exclude_*`` parameters here are mutually exclusive,
    and specify a whitelist or blacklist, respectively, of parameter, attribute or
    return value names. May be '*' to include/exclude all from that section.
    """
    ... do stuff ...

Then we could call it with:

assert_consistent_docs([sklearn.metrics.precision_recall_fscore_support,
                        sklearn.metrics.precision_score,
                        sklearn.metrics.recall_score,
                        sklearn.metrics.f1_score,
                        sklearn.metrics.fbeta_score],
                       exclude_returns='*')

This will ensure (by making a test fail when the condition is not met) that all these related scoring functions have identical parameter descriptions (whitespace excepted) wherever they have identical parameters. (I've not actually checked whether this is or should be true of all these metric functions.) Most importantly, having such an assertion means we can rest assured that when we change the documentation of some parameter or return value, we will be forced to do so consistently.

The text was updated successfully, but these errors were encountered:

amanp10 · 2017-12-06T18:42:40Z

I would like to work on this. Will try to get back with a PR in a few days.

jnothman · 2017-12-06T21:13:33Z

Go for it! No promises it's easy

amanp10 · 2017-12-13T05:28:35Z

I need a clarification,
What if include_params lists a few names and exclude_params is set to '*'? Or vice-versa? Whom do we give priority?

Also, by default include_params is set to None so does that mean we don't consider any parameter? Also, exclude_params is set to None at the same time. It seems to be conflicting. May be we change default of either include or exclude.

jnothman · 2017-12-13T06:11:20Z

Exactly one of them should be set.

…

On 13 December 2017 at 16:28, Aman Pratik ***@***.***> wrote: I need a clarification, What if include_params lists a few names and exclude_params is set to '*'? Or vice-versa? Whom do we give priority? Also, by default include_params is set to None so does that mean we don't consider any parameter? Also, exclude_params is set to None at the same time. It seems to be conflicting. May be we change default of either include or exclude. — You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub <#9388 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/AAEz65k39y1imdZoAW0tYp1OuVh_DRcPks5s_2CFgaJpZM4OZ-Op> .

amanp10 · 2017-12-13T07:47:55Z

Regarding my second question, by default include_params=None would mean no parameters to be selected. Shouldn't it be include_params='*' ?

jnothman · 2017-12-13T09:42:02Z

I don't think so, as we want to force the user to be explicit about either inclusion or exclusion

jnothman added the Need Contributor label Jul 17, 2017

lesteve added help wanted and removed Need Contributor labels Oct 18, 2017

amanp10 mentioned this issue Dec 14, 2017

[WIP] : Added assert_consistent_docs() and related tests #10323

Closed

cmarmo added module:utils New Feature labels Dec 20, 2021

lucyleeow mentioned this issue Mar 22, 2024

TST check if docstring items are equal between objects (functions, classes, etc.) #28678

Merged

glemaitre closed this as completed in #28678 Sep 5, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Test docstrings for parameters are equal #9388

Test docstrings for parameters are equal #9388

jnothman commented Jul 17, 2017

amanp10 commented Dec 6, 2017

Uh oh!

jnothman commented Dec 6, 2017

Uh oh!

amanp10 commented Dec 13, 2017

Uh oh!

jnothman commented Dec 13, 2017 via email

Uh oh!

amanp10 commented Dec 13, 2017

Uh oh!

jnothman commented Dec 13, 2017

Uh oh!

Uh oh!

Test docstrings for parameters are equal #9388

Test docstrings for parameters are equal #9388

Comments

jnothman commented Jul 17, 2017

amanp10 commented Dec 6, 2017

Uh oh!

jnothman commented Dec 6, 2017

Uh oh!

amanp10 commented Dec 13, 2017

Uh oh!

jnothman commented Dec 13, 2017 via email

Uh oh!

amanp10 commented Dec 13, 2017

Uh oh!

jnothman commented Dec 13, 2017

Uh oh!