Thanks to visit codestin.com
Credit goes to github.com

Skip to content

Conversation

@tyler6204
Copy link
Member

Summary

  • Skip image understanding API call entirely when the primary model supports vision natively
  • Saves API cost and latency - no point describing an image when the model will see it directly

Changes

  • model-catalog.ts: Add input field to track model capabilities, plus modelSupportsVision() and findModelInCatalog() helpers
  • runner.ts: Check if primary model has vision before running image understanding; skip if it does

Test plan

  • Send image to vision-capable model (Claude Opus 4.5) → no image description API call, image passed natively
  • Send image to non-vision model → media understanding runs as before

🤖 Generated with Claude Code

When the primary model supports vision natively (e.g., Claude Opus 4.5),
skip the image understanding call entirely. The image will be injected
directly into the model context instead, saving an API call and avoiding
redundant descriptions.

Co-Authored-By: Claude Opus 4.5 <[email protected]>
steipete added a commit that referenced this pull request Jan 25, 2026
@steipete
Copy link
Contributor

Rebased onto main, added attachment-scoped skip decision for vision models + regression test + changelog.
Ran pnpm lint && pnpm build && pnpm test.
Landed commits:

Thanks @tyler6204!

@steipete steipete closed this Jan 25, 2026
mcinteerj pushed a commit to mcinteerj/moltbot that referenced this pull request Jan 25, 2026
rodrigouroz pushed a commit to rodrigouroz/moltbot that referenced this pull request Jan 26, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants