Thanks to visit codestin.com
Credit goes to github.com

Skip to content

Conversation

sabrenner
Copy link
Collaborator

What does this PR do?

Makes sure that bedrock runtime calls to invoke anthropic models properly capture the last user message on the LLM span's input, and properly capture the text blocks for the assistant's response.

Additionally, does the same thing for newer cohere response models.

Lastly, adds cassettes to these tests instead of nock mocking, which could be flaky and are hard to get right with the actual return values from the bedrock-runtime AWS service.

Motivation

Drive-by fixes while generating/using cassettes for real-world, accurate response shapes from bedrock-runtime. Evidently, it paid off 😆

Copy link

github-actions bot commented Sep 4, 2025

Overall package size

Self size: 12 MB
Deduped: 112.08 MB
No deduping: 112.48 MB

Dependency sizes | name | version | self size | total size | |------|---------|-----------|------------| | @datadog/libdatadog | 0.7.0 | 35.02 MB | 35.02 MB | | @datadog/native-appsec | 10.2.1 | 20.64 MB | 20.65 MB | | @datadog/native-iast-taint-tracking | 4.0.0 | 11.72 MB | 11.73 MB | | @datadog/pprof | 5.10.0 | 9.91 MB | 10.3 MB | | @opentelemetry/core | 1.30.1 | 908.66 kB | 7.16 MB | | protobufjs | 7.5.4 | 2.95 MB | 5.6 MB | | @datadog/wasm-js-rewriter | 4.0.1 | 2.85 MB | 3.58 MB | | @datadog/native-metrics | 3.1.1 | 1.02 MB | 1.43 MB | | @opentelemetry/api | 1.8.0 | 1.21 MB | 1.21 MB | | jsonpath-plus | 10.3.0 | 617.18 kB | 1.08 MB | | import-in-the-middle | 1.14.2 | 122.36 kB | 850.93 kB | | lru-cache | 10.4.3 | 804.3 kB | 804.3 kB | | opentracing | 0.14.7 | 194.81 kB | 194.81 kB | | source-map | 0.7.6 | 185.63 kB | 185.63 kB | | pprof-format | 2.2.1 | 163.06 kB | 163.06 kB | | @datadog/sketches-js | 2.1.1 | 109.9 kB | 109.9 kB | | lodash.sortby | 4.7.0 | 75.76 kB | 75.76 kB | | ignore | 7.0.5 | 63.38 kB | 63.38 kB | | istanbul-lib-coverage | 3.2.2 | 34.37 kB | 34.37 kB | | rfdc | 1.4.1 | 27.15 kB | 27.15 kB | | dc-polyfill | 0.1.10 | 26.73 kB | 26.73 kB | | @isaacs/ttlcache | 1.4.1 | 25.2 kB | 25.2 kB | | tlhunter-sorted-set | 0.1.0 | 24.94 kB | 24.94 kB | | shell-quote | 1.8.3 | 23.74 kB | 23.74 kB | | limiter | 1.1.5 | 23.17 kB | 23.17 kB | | retry | 0.13.1 | 18.85 kB | 18.85 kB | | semifies | 1.0.0 | 15.84 kB | 15.84 kB | | jest-docblock | 29.7.0 | 8.99 kB | 12.76 kB | | crypto-randomuuid | 1.0.0 | 11.18 kB | 11.18 kB | | ttl-set | 1.0.0 | 4.61 kB | 9.69 kB | | mutexify | 1.4.0 | 5.71 kB | 8.74 kB | | path-to-regexp | 0.1.12 | 6.6 kB | 6.6 kB | | koalas | 1.0.2 | 6.47 kB | 6.47 kB | | module-details-from-path | 1.0.4 | 3.96 kB | 3.96 kB |

🤖 This report was automatically generated by heaviest-objects-in-the-universe

Copy link

codecov bot commented Sep 4, 2025

Codecov Report

✅ All modified and coverable lines are covered by tests.
✅ Project coverage is 84.32%. Comparing base (e6dcda4) to head (380a448).
⚠️ Report is 1 commits behind head on master.

Additional details and impacted files
@@           Coverage Diff           @@
##           master    #6383   +/-   ##
=======================================
  Coverage   84.31%   84.32%           
=======================================
  Files         477      477           
  Lines       20086    20086           
=======================================
+ Hits        16936    16937    +1     
+ Misses       3150     3149    -1     

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:
  • ❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.
  • 📦 JS Bundle Analysis: Save yourself from yourself by tracking and limiting bundle sizes in JS merges.

@pr-commenter
Copy link

pr-commenter bot commented Sep 8, 2025

Benchmarks

Benchmark execution time: 2025-09-09 17:00:18

Comparing candidate commit 380a448 in PR branch sabrenner/bedrock-runtime-tests-use-cassettes with baseline commit e6dcda4 in branch master.

Found 0 performance improvements and 0 performance regressions! Performance is the same for 1683 metrics, 81 unstable metrics.

@sabrenner sabrenner marked this pull request as ready for review September 9, 2025 14:32
@sabrenner sabrenner requested review from a team as code owners September 9, 2025 14:32
@sabrenner sabrenner merged commit f12c439 into master Sep 9, 2025
698 checks passed
@sabrenner sabrenner deleted the sabrenner/bedrock-runtime-tests-use-cassettes branch September 9, 2025 17:30
dd-octo-sts bot pushed a commit that referenced this pull request Sep 9, 2025
…rom bedrock-runtime calls (#6383)

* utils fixes

* update testagent version

* update fixtures

* generate cassettes

* test changes

* use fixed testagent version

* add in break line

* add in nullish coalesce for anthropic input messages
@dd-octo-sts dd-octo-sts bot mentioned this pull request Sep 9, 2025
crysmags pushed a commit that referenced this pull request Sep 9, 2025
…rom bedrock-runtime calls (#6383)

* utils fixes

* update testagent version

* update fixtures

* generate cassettes

* test changes

* use fixed testagent version

* add in break line

* add in nullish coalesce for anthropic input messages
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants