ZNOB

ZNOB is a multimodal benchmark measuring frontier LLMs' capabilities in passing Ukrainian national exams.

🚀 Installation

conda create -n znob python=3.9 -y # up to 3.13
conda activate znob
git clone https://github.com/Inc44/ZNOB.git
cd ZNOB
pip install -r requirements.txt

🧾 Configuration

Set environment variable:

setx /M OPENROUTER_API_KEY your_api_key

For Linux/macOS:

echo 'export OPENROUTER_API_KEY="your_api_key"' >> ~/.bashrc # or ~/.zshrc

Or create a .env file or modify /etc/environment:

OPENROUTER_API_KEY=your_api_key

Check by restarting the terminal and using:

echo %OPENROUTER_API_KEY%

For Linux/macOS:

echo $OPENROUTER_API_KEY

📖 Usage Examples

Prepare Dataset

python -m znob.cli -d your_zno_dataset -u your_zno_source

Test LLM

python -m znob.cli -d your_zno_dataset --model google/gemini-2.5-flash

Reset Outputs

python -m znob.cli -d your_zno_dataset -r responses,combined_responses,summary # or questions or all

🎨 Command-Line Arguments

Argument	Description
`-u, --url <url>`	Dataset source.
`-d, --dataset <path>`	Dataset to test.
`-m, --model <name>`	AI model to test.
`-r, --reset <name>`	Reset outputs.
`--no-text`	Send only image, no text.
`--no-image`	Send only text, no image.
`--necessary-image-only`	Send image only if necessary.

🎯 Motivation

LLMs have made significant progress since November 2024, so I decided to measure their progress and also verify the claims of the Benchmarking Multimodal Models for Ukrainian Language Understanding Across Academic and Cultural Domains research paper.

🐛 Bugs

Not yet found.

⛔ Known Limitations

Not yet known.

🚧 TODO

Not yet planned.

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
znob		znob
.gitignore		.gitignore
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

ZNOB

🚀 Installation

🧾 Configuration

📖 Usage Examples

Prepare Dataset

Test LLM

Reset Outputs

🎨 Command-Line Arguments

🎯 Motivation

🐛 Bugs

⛔ Known Limitations

🚧 TODO

About

Uh oh!

Languages

Inc44/ZNOB

Folders and files

Latest commit

History

Repository files navigation

ZNOB

🚀 Installation

🧾 Configuration

📖 Usage Examples

Prepare Dataset

Test LLM

Reset Outputs

🎨 Command-Line Arguments

🎯 Motivation

🐛 Bugs

⛔ Known Limitations

🚧 TODO

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Languages