v2.3.1 -- Better handling of some edge-cases
This minor release primarily just improves the user experience when encountering various edge-cases in:
- find_label_issues method
- find_overlapping_issues method
- cleanlab.multiannotator module
This release is non-breaking when upgrading from v2.3.0. Two noteworthy updates in the cleanlab.multiannotator module include a:
- better tie-breaking algorithm inside of
get_majority_vote_label()to avoid diminishing the frequency of rarer classes (this only plays a role whenpred_probsare not provided). - better user-experience for
get_active_learning_scores()to support scoring only unlabeled data or only labeled data. More of the arguments can now beNone.
What's Changed
- Readme updates by @jwmueller in #645, #650, #656
- describe activelab in the documentation by @jwmueller in #648
- Added clipping to address issue #639 by @ulya-tkch in #647
- Fix for not specifying labels in find_overlapping_issues by @huiwengoh in #652
- Bug fixes + improvements to multiannotator module by @huiwengoh in #654
- FAQ question/answer on handling label errors in train vs test data by @jwmueller in #655
Full Changelog: v2.3.0...v2.3.1