Thanks to visit codestin.com
Credit goes to github.com

Skip to content

Inappropriate Language in scikit-learn: Boston house prices dataset Description #19657

@dringel

Description

@dringel

Dear scikit-learn community,

I've been using scikit-learn and the Boston housing prices dataset in my data science for business course. I recognize that this is a very old dataset (1978). However, I believe the description deserves an update because it uses inappropriate language that can be hurtful.

Specifically, when calling the description with print(boston.DESCR) one of the attributes (B) is described as follows:

  • B 1000(Bk - 0.63)^2 where Bk is the proportion of blacks by town

I suggest correcting the term "blacks" to "black people" or "the black population" in the dataset description.

Thank you!
daniel

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions