-
Notifications
You must be signed in to change notification settings - Fork 8.2k
Add sample_images to enable browseruse to learn how to operate a plat… #2910
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
2 issues found across 3 files
Since this is your first cubic review, here's how it works:
- cubic automatically reviews your code and comments on bugs and improvements
- Teach cubic by replying to its comments. cubic learns from your replies and gets better over time
- Ask questions if you need clarification on any suggestion
React with 👍 or 👎 to teach cubic. You can also tag @cubic-dev-ai
to give feedback, ask questions, or re-run the review.
fdb468a
to
f68c086
Compare
Co-authored-by: cubic-dev-ai[bot] <191113872+cubic-dev-ai[bot]@users.noreply.github.com>
Awesome - can you add an example with a dummy image into |
|
||
# Add sample images | ||
content_parts.extend(self.sample_images) | ||
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The example supplementation has been completed, and the corresponding pull request is: #2949. Please kindly review and provide any suggestions for further optimization. |
…form.
When setting up task in browseruse, add a parameter for annotating images, enabling browseruse to recognize some interactive components on web pages.
Summary by cubic
Add optional sample_images to Agent and message flow to include annotated example images in the vision prompt, helping the agent recognize interactive UI components. When provided, these images are inserted before screenshots; behavior is unchanged if omitted.