shigashiyama

shigashiyama

NLP researcher

9 followers · 2 following

Seika-cho, Kyoto, Japan
https://sites.google.com/view/shigashiyama

Achievements

Organizations

Stars

togiso / OpenCHJ

Data for OpenCHJ

2 Updated Nov 12, 2025

google / wmt-mqm-human-evaluation

96 11 Updated Sep 25, 2025

google-research / mt-metrics-eval

Tools for evaluating the performance of MT metrics on data from recent WMT metrics shared tasks.

Python 119 25 Updated Oct 13, 2025

google-research / bleurt

BLEURT is a metric for Natural Language Generation based on transfer learning.

Python 769 92 Updated Aug 4, 2023

lucadiliello / bleurt-pytorch

BLEURT implementation in PyTorch

Python 36 5 Updated Jan 19, 2023

weixuan-wang123 / INCLINE

Python 4 1 Updated Oct 6, 2025

neelguha / simple-wikidata-db

A set of Python scripts for preprocessing the Wikidata JSON dump and running simple queries in an efficient manner.

Python 134 24 Updated Oct 17, 2024

yuta1984 / honkoku-data

歴史資料の市民参加型翻刻プラットフォーム「みんなで翻刻」のテキストデータ置き場です。 / Transcription texts created on Minna de Honkoku (https://honkoku.org), a crowdsourced transcription platform for historical Japanese documents.

17 3 Updated Apr 14, 2025

yy-ye / mqm-analysis

Repository includes scripts for MQM error analysis and annotation results from English to Chinese.

Python 2 Updated Sep 27, 2020

DataScienceUIBK / Rankify

🔥 Rankify: A Comprehensive Python Toolkit for Retrieval, Re-Ranking, and Retrieval-Augmented Generation 🔥. Our toolkit integrates 40 pre-retrieved benchmark datasets and supports 7+ retrieval techn…

Python 519 39 Updated Oct 23, 2025

unslothai / notebooks

100+ Fine-tuning Tutorial Notebooks on Google Colab, Kaggle and more.

Jupyter Notebook 3,818 542 Updated Nov 10, 2025

microsoft / BitNet

Official inference framework for 1-bit LLMs

Python 24,404 1,894 Updated Jun 3, 2025

HaDung2002 / visolex

Python 5 3 Updated Oct 1, 2024

yuiseki / NLP2025-tutorial-2

NLP2025 のチュートリアル「地理情報と言語処理実践入門」の資料とソースコード

Jupyter Notebook 17 1 Updated Nov 12, 2025

togiso / OpenCHJ-Genji

「源氏物語」形態論情報データ

2 Updated Mar 7, 2025

katherinethai / par3

Shell 29 5 Updated Dec 2, 2024

nttmdlab-nlp / InstructDoc

InstructDoc: A Dataset for Zero-Shot Generalization of Visual Document Understanding with Instructions (AAAI2024)

Python 158 6 Updated May 31, 2024

hiroshi-matsuda-rit / NLP2024-tutorial-3

NLP2024 チュートリアル３作って学ぶ日本語大規模言語モデル - 環境構築手順とソースコード / NLP2024 Tutorial 3: Practicing how to build a Japanese large-scale language model - Environment construction and experimental source codes

112 Updated Apr 2, 2024