|
3 | 3 | poetry install
|
4 | 4 | <!-- poetry run pip install --upgrade --force-reinstall --no-deps "apache-beam[gcp]" "multiprocess==0.70.14" -->
|
5 | 5 | poetry run pip install --upgrade --force-reinstall --no-deps "apache-beam[gcp]" "multiprocess==0.70.14" "dill==0.3.1.1"
|
| 6 | + |
| 7 | + |
| 8 | + |
| 9 | +| base models | batch size | lr | JSICK (val) | JSICK (test) | JSTS (train) | JSTS (val) | |
| 10 | +| ------------------------------------------------------------------------------------------------------------------------- | :--------: | :---: | :---------: | :----------: | :----------: | :--------: | |
| 11 | +| [cl-tohoku/bert-base-japanese-v2](https://huggingface.co/cl-tohoku/bert-base-japanese-v2) | | | | | | | |
| 12 | +| [cl-tohoku/bert-base-japanese-char-v2](https://huggingface.co/cl-tohoku/bert-base-japanese-char-v2) | | | | | | | |
| 13 | +| [cl-tohoku/bert-base-japanese](https://huggingface.co/cl-tohoku/bert-base-japanese) | | | | | | | |
| 14 | +| [cl-tohoku/bert-base-japanese-whole-word-masking](https://huggingface.co/cl-tohoku/bert-base-japanese-whole-word-masking) | | | | | | | |
| 15 | +| [cl-tohoku/bert-base-japanese-char](https://huggingface.co/cl-tohoku/bert-base-japanese-char) | | | | | | | |
| 16 | +| [ku-nlp/roberta-base-japanese-char-wwm](https://huggingface.co/ku-nlp/roberta-base-japanese-char-wwm) | | | | | | | |
| 17 | +| [studio-ousia/luke-japanese-base-lite](https://huggingface.co/studio-ousia/luke-japanese-base-lite) | | | | | | | |
| 18 | +| | | | | | | | |
| 19 | +| [ku-nlp/deberta-v2-base-japanese](https://huggingface.co/ku-nlp/deberta-v2-base-japanese) | | | | | | | |
| 20 | +| [nlp-waseda/roberta-base-japanese](https://huggingface.co/nlp-waseda/roberta-base-japanese) | | | | | | | |
| 21 | +| [megagonlabs/roberta-long-japanese](https://huggingface.co/megagonlabs/roberta-long-japanese) | | | | | | | |
| 22 | +| | | | | | | | |
| 23 | +| [bert-base-multilingual-cased](https://huggingface.co/bert-base-multilingual-cased) | | | | | | | |
| 24 | +| [xlm-roberta-base](https://huggingface.co/xlm-roberta-base) | | | | | | | |
| 25 | +| [microsoft/mdeberta-v3-base](https://huggingface.co/microsoft/mdeberta-v3-base) | | | | | | | |
| 26 | +| [studio-ousia/mluke-base-lite](https://huggingface.co/studio-ousia/mluke-base-lite) | | | | | | | |
| 27 | + |
| 28 | + |
| 29 | + |
| 30 | +| large models | batch size | lr | JSICK (val) | JSICK (test) | JSTS (train) | JSTS (val) | |
| 31 | +| ------------------------------------------------------------------------------------------------------- | :--------: | :---: | :---------: | :----------: | :----------: | :--------: | |
| 32 | +| [cl-tohoku/bert-large-japanese](https://huggingface.co/cl-tohoku/bert-large-japanese) | | | | | | | |
| 33 | +| [ku-nlp/roberta-large-japanese-char-wwm](https://huggingface.co/ku-nlp/roberta-large-japanese-char-wwm) | | | | | | | |
| 34 | +| [studio-ousia/luke-japanese-large-lite](https://huggingface.co/studio-ousia/luke-japanese-large-lite) | | | | | | | |
| 35 | +| | | | | | | | |
| 36 | +| [nlp-waseda/roberta-large-japanese](https://huggingface.co/nlp-waseda/roberta-large-japanese) | | | | | | | |
| 37 | +| [ku-nlp/deberta-v2-large-japanese](https://huggingface.co/ku-nlp/deberta-v2-large-japanese) | | | | | | | |
| 38 | +| | | | | | | | |
| 39 | +| [xlm-roberta-large](https://huggingface.co/xlm-roberta-large) | | | | | | | |
| 40 | +| [studio-ousia/mluke-large-lite](https://huggingface.co/studio-ousia/mluke-large-lite) | | | | | | | |
0 commit comments