feat: add SaveStrategy to allow flexibility in saving localized evaluation outputs #281

keerthanvasist · 2024-05-28T00:43:39Z

Add SaveStrategy to allow flexibility in saving localized evaluation outputs

Description of changes:
Introduces a new class SaveStrategy that allows users define their own saving strategy for localized evaluation outputs. Due to the distributed nature of the computations. If the dataset is large, and all of the data is pulled to the head node, it might lead to OOM errors. In order to avoid that, the data is pulled in batches, and save function is called on each batch at a time. In order to allow this mechanism, while allowing more flexbility in the way outputs are saved, this class works as a ContextManager.

This PR looks big, but is in essence a small change that reflects in every evaluation algorithm. The main change is in save_strategy.py and eval_algorithms/common.py (which are new files).

Incidentally, this PR also updates ray version from 2.9.1 to 2.23.0

By submitting this pull request, I confirm that you can use, modify, copy, and redistribute this contribution, under the terms of your choice.

review-notebook-app · 2024-05-28T00:43:44Z

Check out this pull request on

See visual diffs & provide feedback on Jupyter Notebooks.

Powered by ReviewNB

danielezhu

Most comments are nits about docstring issues, but I also left some suggestions on the save strategy source code and unit tests. Thanks!

src/fmeval/eval_algorithms/classification_accuracy.py

src/fmeval/eval_algorithms/save_strategy.py

test/unit/eval_algorithms/test_save_strategy.py

danielezhu · 2024-05-28T18:13:37Z

test/unit/eval_algorithms/test_save_strategy.py

+                },
+            )
+        ] * 3
+        with patch.object(s3_client, "upload_part", return_value={"ETag": 1}) as upload_part, patch.object(


Can we use a side effect for upload_part where it returns {"ETag": "etag_1"}, {"ETag": "etag_2"}, {"ETag": "etag_3"}? Later, we can validate that self._parts_info contains these values.

danielezhu · 2024-05-28T18:19:33Z

test/unit/eval_algorithms/test_save_strategy.py

+                for _ in range(num_of_save_times):
+                    save_strategy.save(records)
+            assert upload_part.call_count == 3
+            assert complete_multipart_upload.call_count == 1


Following up from the comment above, at this point, self._parts_info[PARTS] should look like

[ {PART_NUMBER: 1, E_TAG: "etag_1"}, {PART_NUMBER: 1, E_TAG: "etag_2"}, {PART_NUMBER: 1, E_TAG: "etag_3"} ]

so we can use

complete_multipart_upload.assert_called_once_with( Bucket=# mocked value, Key=# mocked value, UploadId="1234", MultipartUpload={PARTS: # the list above}, )

…ation outputs

* feat: add SaveStrategy to allow flexibility in saving localized evaluation outputs (#281) * feat: modify GeneratePrompt transform to take placeholder_dict (#288) * feat: modify GeneratePrompt transform to take placeholder_dict * fix: unit test * fix: requested changes --------- Co-authored-by: keerthanvasist <[email protected]> Co-authored-by: Xiaoyi Cheng <[email protected]>

keerthanvasist force-pushed the saver branch 4 times, most recently from 4de8c13 to 605814c Compare May 28, 2024 14:51

danielezhu suggested changes May 28, 2024

View reviewed changes

feat: add SaveStrategy to allow flexibility in saving localized evalu…

71ba525

…ation outputs

keerthanvasist force-pushed the saver branch from 605814c to 71ba525 Compare May 28, 2024 19:46

danielezhu approved these changes May 28, 2024

View reviewed changes

oyangz approved these changes May 28, 2024

View reviewed changes

keerthanvasist merged commit 3ce289f into aws:main May 28, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

feat: add SaveStrategy to allow flexibility in saving localized evaluation outputs #281

feat: add SaveStrategy to allow flexibility in saving localized evaluation outputs #281

Uh oh!

keerthanvasist commented May 28, 2024 •

edited

Loading

Uh oh!

review-notebook-app bot commented May 28, 2024

Uh oh!

danielezhu left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

danielezhu May 28, 2024

Uh oh!

danielezhu May 28, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

feat: add SaveStrategy to allow flexibility in saving localized evaluation outputs #281

feat: add SaveStrategy to allow flexibility in saving localized evaluation outputs #281

Uh oh!

Conversation

keerthanvasist commented May 28, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

review-notebook-app bot commented May 28, 2024

Uh oh!

danielezhu left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

danielezhu May 28, 2024

Choose a reason for hiding this comment

Uh oh!

danielezhu May 28, 2024

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

keerthanvasist commented May 28, 2024 •

edited

Loading