chore: remove XSUM dataset from example notebook and integration tests #192

danielezhu · 2024-02-15T17:50:19Z

Description of changes:
This PR is a follow-up of #191, where the last traces of the XSUM dataset are removed from the codebase. The integration tests that used XSUM now use Gigaword, and have had their expected values updated.

This PR also updates all of the integration tests so that ray.shutdown() is called in between the tests for each evaluation algorithm. This is used to clean up resources in between tests, and has reduced the mask disk usage during testing from ~18 GB to ~6 GB.

Lastly, this PR moves the initialization of the SummarizationAccuracy object in test_summarization_accuracy.py from the top of the file into the test method. This is required because code at the top level of every file gets run at the very start of testing, before any tests are executed. This means that the BertscoreHelperModel actor created by the SummarizationAccuracy object also gets created right from the beginning. When we call ray.shutdown() the first time, it will clean up the BertscoreHelperModel resource, meaning that by the time we execute the summarization accuracy integ test, said actor will not exist as expected, and the test will fail.

By submitting this pull request, I confirm that you can use, modify, copy, and redistribute this contribution, under the terms of your choice.

review-notebook-app · 2024-02-15T17:50:25Z

Check out this pull request on

See visual diffs & provide feedback on Jupyter Notebooks.

Powered by ReviewNB

… the test method for test_summarization_accuracy.py

chore: remove XSUM dataset from example notebook and integration tests

f5b34cd

xiaoyi-cheng previously approved these changes Feb 15, 2024

View reviewed changes

oyangz previously approved these changes Feb 15, 2024

View reviewed changes

Add ray.shutdown() to the end of every integration test file

f7c0619

danielezhu dismissed stale reviews from oyangz and xiaoyi-cheng via f7c0619 February 15, 2024 21:55

Daniel Zhu added 2 commits February 15, 2024 15:45

fix: move the initialization of the SummarizationAccuracy object into…

0cfc3a6

… the test method for test_summarization_accuracy.py

Move ray.shutdown() into a class-level pytest fixture

ebc7686

malhotra18 approved these changes Feb 16, 2024

View reviewed changes

danielezhu mentioned this pull request Feb 16, 2024

fix: change prompt_template with summarize instruction #182

Closed

xiaoyi-cheng approved these changes Feb 16, 2024

View reviewed changes

danielezhu merged commit c18b93d into aws:main Feb 16, 2024

danielezhu deleted the remove_xsum_from_integ_test branch February 16, 2024 08:14

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

chore: remove XSUM dataset from example notebook and integration tests #192

chore: remove XSUM dataset from example notebook and integration tests #192

Uh oh!

danielezhu commented Feb 15, 2024 •

edited

Loading

Uh oh!

review-notebook-app bot commented Feb 15, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Uh oh!

chore: remove XSUM dataset from example notebook and integration tests #192

chore: remove XSUM dataset from example notebook and integration tests #192

Uh oh!

Conversation

danielezhu commented Feb 15, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

review-notebook-app bot commented Feb 15, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

danielezhu commented Feb 15, 2024 •

edited

Loading