Research Institute of Sweden Assignment

Package requirements

The notebook is completely created & executed in Google Colab using T4 GPU, where you can directly do the pip install(in the file itself) from the notebook itself on Google colab. However, if you want to run it locally, I've uploaded the requirements.txt file for you to run it locally, where you can do a pip install -r requirements.txt to install the necessary libraries.

Configuration

The notebook can be executed sequentially. I'd recommend creating a dedicated folder for this notebook. After you mount it onto google drive, copy the path of the folder and paste it in "model_dir" variable in section 4.2 followed by /models so that the checkpoint files of System A and System B can be saved. The same applies when running it locally

Running from checkpoint

If you want to run from a checkpoint, you can just pass in the path of the checkpoint folder as a parameter in the trainer() function and it will resume from the specified checkpoint. There are comments available for you to easily follow. Note that you should execute tokenizer_A/B and model_A/B before you call the trainer(CHECKPOINT_PATH).

Name		Name	Last commit message	Last commit date
Latest commit History 24 Commits
README.md		README.md
RISE_Assignment.ipynb		RISE_Assignment.ipynb
requirements.txt		requirements.txt

Provide feedback