init release #1

MHashemzadeh · 2024-07-19T05:27:58Z

This pull request is to release the code for our paper "SUB-GOAL DISTILLATION: A METHOD TO IMPROVE SMALL LANGUAGE AGENTS", accepted at CoLLAs 2024. The codebase is a modification and extension of the existing SwiftSage repository.

data/data_convert.py

data/data_utils.py

requirement.txt

train/README.md

train/run.sh

hnekoeiq

Added the high level comments. Will take a look later when these are addressed and approve it. Great Job!

hnekoeiq · 2024-07-24T20:53:18Z

README.md

+
+## 2- Train models
+
+Codes for train models are in `/train`. Three models which are small LM are required to fine-tune: 1- action generator (executor), 2- sub-goal generator (contoroller), 3- first sub-goal generator. 


Its a minor grammatical error but --> for "training the models" and "required to be fine-tuned"
Overal passing the text through ChatGPT or Grammarly could be a good idea.

hnekoeiq · 2024-07-24T20:54:38Z

README.md

+
+# Acknowledgements
+
+We thank SwiftSage implementation, which this repo is based upon.


Probably you wanted to say "We thank the authors of SwiftSage repository"

hnekoeiq · 2024-07-24T20:56:52Z

data/data_convert.py

+task_id_to_actions = {}
+task_id = args.task_id
+
+for i in range(1):    


why there is a for loop here if it is range(1)?

At some time we wanted to have an iteration. That is why. But I removed it

hnekoeiq · 2024-07-24T21:03:15Z

data/data_utils.py

+
+
+def downsampling(task_idx_real, curr_task_seq):
+    # Downsampling Task 26 and 29


This part of the code could have been written by the others but docstrings and comments should have been added for all nontrivial functions. E.g. Why Downsampling has been done only for task 26 and 29?

because only these two have a lot of variations. In scienceworld the number of variations of the tasks are different. So it is ok.

hnekoeiq · 2024-07-24T21:07:01Z

subgoals/sg_generating.py

+
+
+    return steps_cp, subgoals_list_insteps
+


It is recommended to remove empty space.

hnekoeiq · 2024-07-24T21:07:57Z

subgoals/sg_generating.py

+    return actions_list, subgoal_action_index_list
+
+def check_action_sg(actions_list, gold_path, sg_act_idx_list, task_desc, chatgpt_answer):
+    print(f'len goal path {len(gold_path)} ---- len action seq {len(actions_list)}')


again, adding a short docstring could be helpful.

I have added them.

hnekoeiq · 2024-07-24T21:12:09Z

evaluation/agent.py

+
+
+def creat_semi_random_sg(first_subgoal, PossibleObjects, locations):
+    ### this function chnage the objects or locations in the subgoals to generates semi-random subgoals.


hnekoeiq

LGTM.

maryam.hashemzadeh added 4 commits July 18, 2024 22:18

add files

65fa614

add readme

59b7fea

add all readme

8b8f5ce

add all main readme

eefeafb

MHashemzadeh requested review from Megh-Thakkar and hnekoeiq July 19, 2024 19:54