Thanks to visit codestin.com
Credit goes to github.com

Skip to content
/ ZNOB Public

ZNOB is a multimodal benchmark measuring frontier LLMs' capabilities in passing Ukrainian national exams.

Notifications You must be signed in to change notification settings

Inc44/ZNOB

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

15 Commits
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation

ZNOB

ZNOB is a multimodal benchmark measuring frontier LLMs' capabilities in passing Ukrainian national exams.

πŸš€ Installation

conda create -n znob python=3.9 -y # up to 3.13
conda activate znob
git clone https://github.com/Inc44/ZNOB.git
cd ZNOB
pip install -r requirements.txt

🧾 Configuration

Set environment variable:

setx /M OPENROUTER_API_KEY your_api_key

For Linux/macOS:

echo 'export OPENROUTER_API_KEY="your_api_key"' >> ~/.bashrc # or ~/.zshrc

Or create a .env file or modify /etc/environment:

OPENROUTER_API_KEY=your_api_key

Check by restarting the terminal and using:

echo %OPENROUTER_API_KEY%

For Linux/macOS:

echo $OPENROUTER_API_KEY

πŸ“– Usage Examples

Prepare Dataset

python -m znob.cli -d your_zno_dataset -u your_zno_source

Test LLM

python -m znob.cli -d your_zno_dataset --model google/gemini-2.5-flash

Reset Outputs

python -m znob.cli -d your_zno_dataset -r responses,combined_responses,summary # or questions or all

🎨 Command-Line Arguments

Argument Description
-u, --url <url> Dataset source.
-d, --dataset <path> Dataset to test.
-m, --model <name> AI model to test.
-r, --reset <name> Reset outputs.
--no-text Send only image, no text.
--no-image Send only text, no image.
--necessary-image-only Send image only if necessary.

🎯 Motivation

LLMs have made significant progress since November 2024, so I decided to measure their progress and also verify the claims of the Benchmarking Multimodal Models for Ukrainian Language Understanding Across Academic and Cultural Domains research paper.

πŸ› Bugs

Not yet found.

β›” Known Limitations

Not yet known.

🚧 TODO

Not yet planned.

About

ZNOB is a multimodal benchmark measuring frontier LLMs' capabilities in passing Ukrainian national exams.

Topics

Resources

Stars

Watchers

Forks

Languages