-
Notifications
You must be signed in to change notification settings - Fork 553
Add BRIGHT SPLADEv3 cached regressions and fix ONNX #2928
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
lintool
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Minor nits - please fix averages, per section + overall.
| | Robotics | 0.1091 | 0.1578 | | ||
| | Stack Overflow | 0.1626 | 0.1290 | | ||
| | Sustainable Living | 0.1613 | 0.1497 | | ||
| | **StackExchange** average | **0.1705** | **0.1704** | |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The averages change also, right?
| queryTokens.add(CLS); | ||
| queryTokens.addAll(tokenizer.tokenize(query)); | ||
| if (queryTokens.size() > MAX_SEQ_LEN - 2) { | ||
| queryTokens = queryTokens.subList(0, MAX_SEQ_LEN - 2); |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
in terms of code alignment, imo a ternary operator, i.e., x = y ? a : b; would look prettier, but don't feel too strongly.
Co-authored-by: Jimmy Lin <[email protected]>
…nto bright-splade
Codecov Report❌ Patch coverage is
Additional details and impacted files@@ Coverage Diff @@
## master #2928 +/- ##
============================================
+ Coverage 71.82% 71.83% +0.01%
Complexity 1321 1321
============================================
Files 186 186
Lines 12280 12294 +14
Branches 1413 1414 +1
============================================
+ Hits 8820 8832 +12
- Misses 2932 2933 +1
- Partials 528 529 +1 ☔ View full report in Codecov by Sentry. 🚀 New features to boost your workflow:
|
|
For the record, I reran all SPLADE-v3 conditions for BEIR, did not appear to have any score changes. Finding all the conditions: $ grep splade-v3 src/main/python/regressions-batch04.txt | grep beirRunning: |
Bug in Anserini's SPLADE encoder logic that showed up from BRIGHT's really long queries: queries longer than the max sequence length of 512 have the CLS ending token truncated, resulting in the model going haywire. This is fixed in this PR, but we should probably rerun everything SPLADE ONNX that might have queries that fell in this e.g. ArguAna.
Fixing this resulted in some updates to BRIGHT SPLADEv3 ONNX numbers, updated here. Also fixed a typo in the regression docs.
Added the entire regression pipeline for BRIGHT SPLADEv3 cached, including topic bindings.