fix(llmobs): properly parse "newer" anthropic models, cohere models from bedrock-runtime calls #6383

sabrenner · 2025-09-04T13:02:06Z

What does this PR do?

Makes sure that bedrock runtime calls to invoke anthropic models properly capture the last user message on the LLM span's input, and properly capture the text blocks for the assistant's response.

Additionally, does the same thing for newer cohere response models.

Lastly, adds cassettes to these tests instead of nock mocking, which could be flaky and are hard to get right with the actual return values from the bedrock-runtime AWS service.

Motivation

Drive-by fixes while generating/using cassettes for real-world, accurate response shapes from bedrock-runtime. Evidently, it paid off 😆

github-actions · 2025-09-04T13:02:41Z

Overall package size

Self size: 12 MB
Deduped: 112.08 MB
No deduping: 112.48 MB

Dependency sizes

| name | version | self size | total size | |------|---------|-----------|------------| | @datadog/libdatadog | 0.7.0 | 35.02 MB | 35.02 MB | | @datadog/native-appsec | 10.2.1 | 20.64 MB | 20.65 MB | | @datadog/native-iast-taint-tracking | 4.0.0 | 11.72 MB | 11.73 MB | | @datadog/pprof | 5.10.0 | 9.91 MB | 10.3 MB | | @opentelemetry/core | 1.30.1 | 908.66 kB | 7.16 MB | | protobufjs | 7.5.4 | 2.95 MB | 5.6 MB | | @datadog/wasm-js-rewriter | 4.0.1 | 2.85 MB | 3.58 MB | | @datadog/native-metrics | 3.1.1 | 1.02 MB | 1.43 MB | | @opentelemetry/api | 1.8.0 | 1.21 MB | 1.21 MB | | jsonpath-plus | 10.3.0 | 617.18 kB | 1.08 MB | | import-in-the-middle | 1.14.2 | 122.36 kB | 850.93 kB | | lru-cache | 10.4.3 | 804.3 kB | 804.3 kB | | opentracing | 0.14.7 | 194.81 kB | 194.81 kB | | source-map | 0.7.6 | 185.63 kB | 185.63 kB | | pprof-format | 2.2.1 | 163.06 kB | 163.06 kB | | @datadog/sketches-js | 2.1.1 | 109.9 kB | 109.9 kB | | lodash.sortby | 4.7.0 | 75.76 kB | 75.76 kB | | ignore | 7.0.5 | 63.38 kB | 63.38 kB | | istanbul-lib-coverage | 3.2.2 | 34.37 kB | 34.37 kB | | rfdc | 1.4.1 | 27.15 kB | 27.15 kB | | dc-polyfill | 0.1.10 | 26.73 kB | 26.73 kB | | @isaacs/ttlcache | 1.4.1 | 25.2 kB | 25.2 kB | | tlhunter-sorted-set | 0.1.0 | 24.94 kB | 24.94 kB | | shell-quote | 1.8.3 | 23.74 kB | 23.74 kB | | limiter | 1.1.5 | 23.17 kB | 23.17 kB | | retry | 0.13.1 | 18.85 kB | 18.85 kB | | semifies | 1.0.0 | 15.84 kB | 15.84 kB | | jest-docblock | 29.7.0 | 8.99 kB | 12.76 kB | | crypto-randomuuid | 1.0.0 | 11.18 kB | 11.18 kB | | ttl-set | 1.0.0 | 4.61 kB | 9.69 kB | | mutexify | 1.4.0 | 5.71 kB | 8.74 kB | | path-to-regexp | 0.1.12 | 6.6 kB | 6.6 kB | | koalas | 1.0.2 | 6.47 kB | 6.47 kB | | module-details-from-path | 1.0.4 | 3.96 kB | 3.96 kB |

_{🤖 This report was automatically generated by heaviest-objects-in-the-universe}

codecov · 2025-09-04T13:02:56Z

Codecov Report

✅ All modified and coverable lines are covered by tests.
✅ Project coverage is 84.32%. Comparing base (e6dcda4) to head (380a448).
⚠️ Report is 1 commits behind head on master.

Additional details and impacted files

@@           Coverage Diff           @@
##           master    #6383   +/-   ##
=======================================
  Coverage   84.31%   84.32%           
=======================================
  Files         477      477           
  Lines       20086    20086           
=======================================
+ Hits        16936    16937    +1     
+ Misses       3150     3149    -1

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.
📦 JS Bundle Analysis: Save yourself from yourself by tracking and limiting bundle sizes in JS merges.

…r/bedrock-runtime-tests-use-cassettes

pr-commenter · 2025-09-08T20:16:51Z

Benchmarks

Benchmark execution time: 2025-09-09 17:00:18

Comparing candidate commit 380a448 in PR branch sabrenner/bedrock-runtime-tests-use-cassettes with baseline commit e6dcda4 in branch master.

Found 0 performance improvements and 0 performance regressions! Performance is the same for 1683 metrics, 81 unstable metrics.

…b.com:DataDog/dd-trace-js into sabrenner/bedrock-runtime-tests-use-cassettes

…rom bedrock-runtime calls (#6383) * utils fixes * update testagent version * update fixtures * generate cassettes * test changes * use fixed testagent version * add in break line * add in nullish coalesce for anthropic input messages

sabrenner added 5 commits September 4, 2025 08:43

utils fixes

9a0b8fd

update testagent version

486aecf

update fixtures

c9729ab

generate cassettes

27ec239

test changes

22d63c0

sabrenner added the semver-patch label Sep 4, 2025

sabrenner added 2 commits September 8, 2025 16:08

use fixed testagent version

efeca3c

Merge branch 'master' of github.com:DataDog/dd-trace-js into sabrenne…

1202560

…r/bedrock-runtime-tests-use-cassettes

sabrenner and others added 4 commits September 9, 2025 09:17

Merge branch 'master' into sabrenner/bedrock-runtime-tests-use-cassettes

6cb7e4f

Merge branch 'master' into sabrenner/bedrock-runtime-tests-use-cassettes

7b9442c

add in break line

cc67d74

Merge branch 'sabrenner/bedrock-runtime-tests-use-cassettes' of githu…

975d38f

…b.com:DataDog/dd-trace-js into sabrenner/bedrock-runtime-tests-use-cassettes

sabrenner marked this pull request as ready for review September 9, 2025 14:32

sabrenner requested review from a team as code owners September 9, 2025 14:32

add in nullish coalesce for anthropic input messages

380a448

Kyle-Verhoog approved these changes Sep 9, 2025

View reviewed changes

sabrenner merged commit f12c439 into master Sep 9, 2025
698 checks passed

sabrenner deleted the sabrenner/bedrock-runtime-tests-use-cassettes branch September 9, 2025 17:30

dd-octo-sts bot mentioned this pull request Sep 9, 2025

v5.66.0 proposal #6378

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix(llmobs): properly parse "newer" anthropic models, cohere models from bedrock-runtime calls #6383

fix(llmobs): properly parse "newer" anthropic models, cohere models from bedrock-runtime calls #6383

sabrenner commented Sep 4, 2025

Uh oh!

github-actions bot commented Sep 4, 2025 •

edited

Loading

Uh oh!

codecov bot commented Sep 4, 2025 •

edited

Loading

Uh oh!

pr-commenter bot commented Sep 8, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

fix(llmobs): properly parse "newer" anthropic models, cohere models from bedrock-runtime calls #6383

fix(llmobs): properly parse "newer" anthropic models, cohere models from bedrock-runtime calls #6383

Conversation

sabrenner commented Sep 4, 2025

What does this PR do?

Motivation

Uh oh!

github-actions bot commented Sep 4, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Overall package size

Uh oh!

codecov bot commented Sep 4, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

pr-commenter bot commented Sep 8, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Benchmarks

Uh oh!

Uh oh!

Uh oh!

github-actions bot commented Sep 4, 2025 •

edited

Loading

codecov bot commented Sep 4, 2025 •

edited

Loading

pr-commenter bot commented Sep 8, 2025 •

edited

Loading