-
-
Notifications
You must be signed in to change notification settings - Fork 25.9k
Label docstrings with versionadded / versionchanged #15426
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Comments
@rth, please feel free to add a Sprint tag if you think this is suitable... Btw, these classes appear to lack a versionadded... not sure though that we want to add to all of them (and I've not filtered private classes out): ARDRegression, AdaBoostClassifier, AdaBoostRegressor, AdamOptimizer, AdditiveChi2Sampler, AffinityPropagation, AgglomerationTransform, BaggingRegressor, BaseBagging, BaseCrossValidator, BaseDecisionTree, BaseDiscreteNB, BaseEnsemble, BaseForest, BaseGradientBoosting, BaseHistGradientBoosting, BaseLabelPropagation, BaseLibSVM, BaseLoss, BaseMixture, BaseNB, BaseRandomProjection, BaseSGD, BaseSVC, BaseSearchCV, BaseShuffleSplit, BaseSpectral, BaseWeightBoosting, BernoulliNB, BernoulliRBM, Binarizer, BinaryCrossEntropy, BinomialDeviance, BinomialDeviance, Birch, Bunch, CCA, CalibratedClassifierCV, CategoricalCrossEntropy, CategoricalNB, ChangedBehaviorWarning, CheckingClassifier, ClassificationLossFunction, ClassificationLossFunction, ClassifierChain, ComplementNB, ConvergenceWarning, CountVectorizer, DataConversionWarning, DataDimensionalityWarning, DictVectorizer, DummyRegressor, ElasticNet, ElasticNetCV, EllipticEnvelope, EmpiricalCovariance, ExponentialLoss, ExponentialLoss, FactorAnalysis, FastICA, FeatureHasher, FeatureUnion, FitFailedWarning, ForestClassifier, ForestRegressor, GaussianNB, GaussianRandomProjection, GenericUnivariateSelect, GraphicalLasso, GraphicalLassoCV, GridSearchCV, GroupKFold, HistGradientBoostingClassifier, HistGradientBoostingRegressor, HuberLossFunction, HuberLossFunction, IncrementalPCA, IsotonicRegression, IterativeImputer, KBinsDiscretizer, KFold, KMeans, KNeighborsClassifier, KNeighborsRegressor, KernelCenterer, KernelDensity, KernelRidge, KeyValTuple, KeyValTupleParam, LabelBinarizer, LabelEncoder, LabelPropagation, LabelSpreading, Lars, LarsCV, Lasso, LassoCV, LassoLars, LassoLarsCV, LassoLarsIC, LeastAbsoluteDeviation, LeastAbsoluteError, LeastAbsoluteError, LeastSquares, LeastSquaresError, LeastSquaresError, LeaveOneGroupOut, LeaveOneOut, LeavePGroupsOut, LeavePOut, LedoitWolf, LinearClassifierMixin, LinearModel, LinearModelCV, LinearRegression, LinearSVC, LinearSVR, LocalOutlierFactor, LocallyLinearEmbedding, LossFunction, LossFunction, MDS, MinCovDet, MiniBatchKMeans, MissingIndicator, Module_six_moves_urllib, Module_six_moves_urllib_error, Module_six_moves_urllib_parse, Module_six_moves_urllib_request, Module_six_moves_urllib_response, Module_six_moves_urllib_robotparser, MultiLabelBinarizer, MultiOutputClassifier, MultiOutputRegressor, MultiTaskElasticNet, MultiTaskElasticNetCV, MultiTaskLasso, MultiTaskLassoCV, MultinomialDeviance, MultinomialDeviance, MultinomialNB, NearestCentroid, NearestNeighbors, NeighborhoodComponentsAnalysis, NeighborsBase, NoSampleWeightWrapper, NonBLASDotWarning, Normalizer, NotFittedError, NuSVR, Nystroem, OAS, OPTICS, OneClassSVM, OneHotEncoder, OneVsOneClassifier, OneVsRestClassifier, OrdinalEncoder, OrthogonalMatchingPursuit, OrthogonalMatchingPursuitCV, OutputCodeClassifier, PLSCanonical, PLSRegression, PLSSVD, PatchExtractor, Pipeline, PowerTransformer, PredefinedSplit, QuantileLossFunction, QuantileLossFunction, QuantileTransformer, RBFSampler, RFE, RFECV, RadiusNeighborsClassifier, RadiusNeighborsRegressor, RandomizedSearchCV, RegressionLossFunction, RegressionLossFunction, RegressorChain, RepeatedKFold, RepeatedStratifiedKFold, RidgeCV, RidgeClassifierCV, SGDOptimizer, SVR, ScaledLogOddsEstimator, SelectFdr, SelectFpr, SelectFwe, SelectKBest, SelectPercentile, SelectorMixin, ShrunkCovariance, ShuffleSplit, SkewedChi2Sampler, SkipTestWarning, SparseCodingMixin, SparseRandomProjection, SpectralBiclustering, SpectralClustering, SpectralCoclustering, SpectralEmbedding, StratifiedKFold, StratifiedShuffleSplit, TfidfTransformer, TfidfVectorizer, TheilSenRegressor, TimeSeriesSplit, TransformedTargetRegressor, TruncatedSVD, UndefinedMetricWarning, VarianceThreshold |
Thanks @jnothman ! |
I'll work on AdaBoostClassifier, AdaBoostRegressor. |
LIST OF STUFF I SUBMITTED PULL REQUESTS FOR: Ongoing Progress: |
@lotusea worked on KernelRidge, AgglomerativeClustering |
@geoninja is working on 'OrdinalEncoder' and 'OneHotEncoder' and LabelEncoder' |
@Drajan and I are working on PLSCanonical, PLSRegression, PLSSVD, PatchExtractor, Pipeline, PowerTransformer, PredefinedSplit. |
Going to work on BaggingRegressor, BaseBagging, BaseCrossValidator, BaseDecisionTree, BaseDiscreteNB, BaseEnsemble, BaseForest, BaseGradientBoosting, BaseHistGradientBoosting, BaseLabelPropagation, BaseLibSVM, BaseLoss, BaseMixture, BaseNB, BaseRandomProjection, BaseSGD, BaseSVC, BaseSearchCV, BaseShuffleSplit, BaseSpectral, BaseWeightBoosting. |
…n, PLSSVD, PatchExtractor, Pipeline, PowerTransformer, PredefinedSplit (Issue scikit-learn#15426)
Please see ongoing list of items which versioning is being added during the WiMLDS sprint: I think we should merge all our changes into one branch and submit a single PR to the reviewers from the account of someone who can commit to addressing any comments the reviewers have. |
Claiming MultinomialNB |
Working on DataConversionWarning, DataDimensionalityWarning, DictVectorizer , DummyRegressor |
LocalOutlierFactor |
But we should label one version at a time rather than one module at a time,
I now reckon
|
@cmarmo I updated the list on the top. |
Thanks! |
Could @Schindst and I work on the LinearClassifierMixin, LinearModel, LinearRegression? |
I think it is better to work version (perhaps split by subpackage) by
version than class by class.
|
ok, we'll take v0.21 - with @Schindst |
We will work on v0.20.0 with @ellenkoenig |
We (@shravaniCD and @hhnnhh and @brigitteunger ) are going to take care of those: Was already fixed here: natashaborders/scikit-learn@3ca653a |
As a reminder, people who take on this issue, please take on a version (the changelog for v0.18 for instance), and tackle everything mentioned there, instead of working on classes or files or modules. This makes the review process much easier. |
We (@shravaniCD and @hhnnhh and @brigitteunger) are going to pick v0.18 |
I'm starting to work on v0.19 |
Towards scikit-learn#15426 @adrinjalali #wimlds #scikitlearnsprint
@jnothman or @adrinjalali do you know which versions still need documentation or are all the major ones done/in progress? I can start another one or pick one up but I’m not sure how far back it makes sense to go and the original list isn’t up-to-date (or relevant?!) anymore.. |
I think we've covered most of the relevant ones. I personally wouldn't worry about anything older than 0.18 |
* added v0.19.1 and wip v0.19 * finished adding vchanged strings for v0.19 Towards #15426 @adrinjalali #wimlds #scikitlearnsprint * fixing linter issues * caught line issues with flake8 * caught the last line issue * added lines and cleaned gtiignore * Update sklearn/multiclass.py Co-Authored-By: Thomas J Fan <[email protected]> * Update sklearn/multiclass.py Co-Authored-By: Thomas J Fan <[email protected]> Co-authored-by: Thomas J Fan <[email protected]>
The issue has been addressed until version 0.18. I'm closing it. Feel free to open a new one for specific sklearn features. |
* added v0.19.1 and wip v0.19 * finished adding vchanged strings for v0.19 Towards scikit-learn#15426 @adrinjalali #wimlds #scikitlearnsprint * fixing linter issues * caught line issues with flake8 * caught the last line issue * added lines and cleaned gtiignore * Update sklearn/multiclass.py Co-Authored-By: Thomas J Fan <[email protected]> * Update sklearn/multiclass.py Co-Authored-By: Thomas J Fan <[email protected]> Co-authored-by: Thomas J Fan <[email protected]>
Uh oh!
There was an error while loading. Please reload this page.
We should be using Sphinx's versionadded and versionchanged directives to indicate when classes, functions, methods and parameters are added or modified in their semantics. We are not very good at ensuring this labelling is done.
Towards the upcoming release of v0.22, it would be useful if some contributors (or sprinters) scoured the change log and identified whether any added/changed parameters/classes needed a
versionadded
orversionchanged
label, and to produce pull requests adding them when otherwise omitted.I also note past omissions, for example KBinsDiscretizer and IterativeImputer do not mention their versionadded, while ColumnTransformer does. Recording for each estimator when it was first released would be helpful for users when looking at the documentation.
Suggested procedure
Check the changelog in doc/whats_new/ one what's a new file at a time.
**Make a single PR for an estimator and a specific version. Do not include multiple estimators and multiple versions at once (it makes it difficult to review).
Focus on Public API: if a method has been made private, check the tickbox and move to another.
should be included only for new estimators/parameters
should be included when the default value of a parameter change.
Classes lacking label
The list may be incomplete.
The text was updated successfully, but these errors were encountered: