Stars
Open-source evaluation toolkit of large multi-modality models (LMMs), support 220+ LMMs, 80+ benchmarks
A Heterogeneous Benchmark for Information Retrieval. Easy to use, evaluate your models across 15+ diverse IR datasets.
A book series (2 published editions) on the JS language.
Experiments in positive-unlabeled learning
Japanese morphological analysis engine written in pure Python
A Japanese NLP Library using spaCy as framework based on Universal Dependencies
Distributed transactional key-value database, originally created to complement TiDB
Gradually-Warmup Learning Rate Scheduler for PyTorch
消息推送平台🔥 推送下发【邮件】【短信】【微信服务号】【微信小程序】【企业微信】【钉钉】等消息类型。
程序员在家做饭方法指南。Programmer's guide about how to cook at home (Simplified Chinese only).
UTF-8 re-encoded dictionary CSVs of MeCab's ipadic dictionary, needed for recompiling with custom utf8-encoded CSVs
Neologism dictionary based on the language resources on the Web for mecab-ipadic
BERT with SentencePiece for Japanese text.
Basic text analytics and natural language processing in Python
Python implementation of stacked generalization classifier. Plays nice with sklearn.
A Benchmarking Study of Embedding-based Entity Alignment for Knowledge Graphs, VLDB 2020
A sample web application built on MyBatis 3, Spring Boot and Thymeleaf 3.
Curated list of project-based tutorials