fix: update GetModelResponse transform to work with any ModelRunner #228

danielezhu · 2024-03-23T21:41:14Z

Description of changes:
Currently, GetModelResponse is a transform that augments the input record with the entire model response payload from calling the predict method of a ModelRunner. The response payload is of the form (model_output, log_prob), where both model_output and log_prob are optional, and whether they are non-null depends on the particular ModelRunner being used.

When initializing a GetModelResponse, one must provide a tuple representing the keys that will be associated with the data in a response payload. Algorithms like SummarizationAccuracy that currently utilize GetModelReponse only care about obtaining model outputs (and not the log probabilities), and thus only provide a key for the model output when constructing GetModelResponse instances.

The bug is that GetModelResponse is currently requiring the response payload to conform to the format of the response payload keys, instead of the other way around. For example, if a GetModelResponse is initialized with the response key tuple (model_output_key, ), then its __call__ method will raise an error if the model's predict method returns a payload that contains log probabilities.

assert_condition(
    len(model_response) == len(response_key_tuple),
    f"The number of elements in model response {model_response} "
    f"does not match number of response keys in {response_key_tuple}.",
)

This PR renames GetModelResponse to GetModelOutputs and fixes the bug above. Now, GetModelOutputs is responsible solely for extracting the model_output portion of the (model_output, log_probability) predict response, and more importantly, the __call__ logic does not make any assumptions about the format of the response payload.

By submitting this pull request, I confirm that you can use, modify, copy, and redistribute this contribution, under the terms of your choice.

franluca · 2024-03-24T19:04:06Z

src/fmeval/transforms/common.py

-                    record[model_response_key] = model_response_item
+        for input_key, output_keys in self.input_to_output_keys.items():
+            for output_key in output_keys:
+                model_output, _ = self.model_runner.predict(record[input_key])


Great! Next step is to refactor also the interface of ModelRunner to separate the two calls.
(and also get a transfrom for input log prob that is used in Stereotyping)

fix: update GetModelResponse transform to work with any ModelRunner

6d0786d

danielezhu requested review from franluca and oyangz March 23, 2024 22:52

franluca approved these changes Mar 24, 2024

View reviewed changes

oyangz approved these changes Mar 25, 2024

View reviewed changes

danielezhu merged commit 5a11f14 into aws:main Mar 25, 2024

danielezhu deleted the fix_get_model_response branch March 25, 2024 07:09

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

fix: update GetModelResponse transform to work with any ModelRunner #228

fix: update GetModelResponse transform to work with any ModelRunner #228

Uh oh!

danielezhu commented Mar 23, 2024 •

edited

Loading

Uh oh!

franluca Mar 24, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

fix: update GetModelResponse transform to work with any ModelRunner #228

fix: update GetModelResponse transform to work with any ModelRunner #228

Uh oh!

Conversation

danielezhu commented Mar 23, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

franluca Mar 24, 2024

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

danielezhu commented Mar 23, 2024 •

edited

Loading