Instructions

Converts wiktionary data from https://kaikki.org/ to yomitan-compatible dictionaries. Converted dictionaries can be found in the Releases section.

Instructions

(examples use German (de) to English (en))

Basic Run

Create a .env file based on .env.example.
If your language is not in languages.json, add it.
Run ./auto.sh German English.
Dictionaries should be in data/language/de/en.

Contributing

The auto.sh script can also be run with flags:

k: keep files (by default, the script deletes the downloaded files after running),
d: redownload (by default, the script skips downloading if the file already exists),
t: force_tidy (run tidy script again, even if its output already exists. useful when the tidy script is updated),
y: force_ymt (run yomitan script again, even if its output already exists. useful when the yomitan script is updated),
F: force = force_tidy + force_ymt,
S: run for all source languages (./auto.sh German English S is like ./auto.sh * English),
T: run for all target languages (./auto.sh German English T is like ./auto.sh German *).

Most often, you will want to run ./auto.sh German English kty to recreate the dictionaries, then load them in yomitan and test them.

After a run, data/language/de/en should contain files with skipped tags for IPA and terms. Adding some to tag_bank_ipa.json or tag_bank_term.json is an easy way to improve the conversion for your language pair.

Tests

Test inputs are in data/test/kaikki. Each line is a line from the corresponding kaikki file (from data/kaikki, after downloading).

To fix something in the conversion of a word, add its line from data/kaikki to the corresponding test file in data/test/kaikki. Then run npm run test-write to add it to the expected test output, and commit the changes (e.g. add baseline test for "word"). Now when you modify tidy-up or make-yomitan, you can run npm run test-write to see the changes you made.

If you are making a change that shouldn't change the output, just run npm run test to check if anything broke.

Name		Name	Last commit message	Last commit date
Latest commit History 113 Commits
.github/workflows		.github/workflows
data		data
tools		tools
util		util
.env.example		.env.example
.gitignore		.gitignore
1-create-folders.js		1-create-folders.js
2-extract-language.py		2-extract-language.py
3-tidy-up.js		3-tidy-up.js
3-tidy-up.test.js		3-tidy-up.test.js
4-make-yomitan.js		4-make-yomitan.js
4-make-yomitan.test.js		4-make-yomitan.test.js
README.md		README.md
auto.sh		auto.sh
languages.json		languages.json
merge-ipa.js		merge-ipa.js
package-lock.json		package-lock.json
package.json		package.json
write-tests.js		write-tests.js

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Instructions

Basic Run

Contributing

Tests

About

Uh oh!

Releases

Packages

Languages

martholomew/kaikki-to-yomitan

Folders and files

Latest commit

History

Repository files navigation

Instructions

Basic Run

Contributing

Tests

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages