Codestin Search App

vladsavelyev · 2024-10-09T19:15:41Z

Ideas to handle large reports

Define token limits for each provider based on their documented context windows
Prioritize content:
- Always include the tools summary for context
- Always include general statistics if available
- Add detailed sections until approaching the limit
Use a conservative token estimation (4 chars per token) and leave a 10% buffer
Log a warning when content needs to be truncated

Additional recommendations that could be implemented:

Add configuration options to control content prioritization:

config.ai_summary_prioritize_sections = ["fastqc", "alignment_metrics"]  # Priority sections
config.ai_summary_exclude_sections = ["software_versions"]  # Sections to skip

Implement smarter content selection:

Skip redundant information
Summarize large tables/plots
Focus on sections with warning/error flags

Consider splitting analysis into multiple API calls for very large reports:

First call for high-level overview
Additional calls for detailed section analysis
Combine results intelligently

Add provider-specific optimizations:

Use different prompts/strategies based on model capabilities
Leverage features like function calling for structured output

…ntinue button if not seqera provider

* Seqera AI: Tweak prompt to hide stuff in a <details> collapsed element * Use same chat title as in history title * Remove unused import

Co-authored-by: Jason Βoxman <[email protected]>

ewels

Let's merge this sucka!

vladsavelyev added this to the v1.26 milestone Oct 9, 2024

vladsavelyev added 29 commits October 30, 2024 15:04

Merge branch 'ai' into ai-per-plot

92b092a

Fix for Safari

cee8ce6

Generate lists with 4 spaces

c4849f5

Merge branch 'ai' into ai-per-plot

f5c4aab

Pass JSON schema as a parameter

7a148ec

Typo

edf5bc1

Typo

a20815d

Add pyright config

b5ad693

Add markdownToHtml into JS env

a3e8714

Move AI JS stuff into ai.js

86dc0b2

Add continue-in-chat to per-section blocks

c83fa44

Merge branch 'ai' into ai-per-plot

108f05b

Format plot data for LLM. Support gpt

3b83772

Parametrize model name and add disclaimer. Smaller models do not work

a25d0a6

Pass MultiQC version and tags. Fix number formatting

f05fe30

Add showdown.js to convert markdown to html

44ff807

Show Seqera AI model in disclaimer

dc16094

Move dotenv

00851d7

Make Seqera AI token optional

fb3439f

Fix Continue with AI handler

e686b26

Fix for plot summaries too

d0b9df8

Highlight only sample prefixes/suffixes in sample tag. Do not show co…

d95b996

…ntinue button if not seqera provider

Fix typing

ac55c12

Default to anthropic

99ddc6e

Support short and full summary, default to short and anthropic provider

cd5ddf6

Switch to generalized endpoint

be83e4d

Use streaming endpoint

6ed48e8

Fix missing name in feature counts section

489699f

Show AI button even if section name is missing

6618eca

vladsavelyev and others added 11 commits January 17, 2025 11:38

Debug log messages about the use of env vars for API keys

980a371

Only save prompt to file if debug or dev

2c3cb42

Merge branch 'main' into ai

c7030bc

Preference to SEQERA_API_TOKEN

7290f5d

Lint

f2f46d2

Focus api_key field when its not entered

32acaec

More red

e880198

Minor tweaks to logging and error handling

065c5e7

Tweak debug log message about saved prompt location

2d2a27f

AI: Wrap Seqera AI prompt in <details> (#3050)

96843a2

* Seqera AI: Tweak prompt to hide stuff in a <details> collapsed element * Use same chat title as in history title * Remove unused import

Typos

1d13150

This comment was marked as resolved.

Sign in to view

ewels and others added 13 commits January 19, 2025 21:00

Add :::details to the js prompt as well

3b40129

Tags

a4d5f10

Typo

0515717

Address review comments from @jason-seqera

38caa5b

Apply suggestions from code review

d2a4354

Co-authored-by: Jason Βoxman <[email protected]>

Move creation date to report.creation_date, and format

8c87557

Add reties with exp backoff for Seqera AI

b70eaa4

Fix typing

1859851

Clean up report state

58b0332

AI docs: Tweaks and improvements

ec3f4f0

gitignore seqeradocs

8a56113

Support bustools 0.44.1 (#3053)

ceec185

Update docs screenshots

e17244a

ewels approved these changes Jan 21, 2025

View reviewed changes

vladsavelyev merged commit ebb52af into main Jan 22, 2025

vladsavelyev deleted the ai branch January 22, 2025 09:25

vladsavelyev restored the ai branch January 22, 2025 09:25

ewels deleted the ai branch September 17, 2025 20:10

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

AI tooltips#2915

AI tooltips#2915
vladsavelyev merged 336 commits into
mainfrom
ai

vladsavelyev commented Oct 9, 2024 •

edited

Loading

Uh oh!

This comment was marked as resolved.

ewels left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Uh oh!

Conversation

vladsavelyev commented Oct 9, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Ideas to handle large reports

Uh oh!

This comment was marked as resolved.

ewels left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

vladsavelyev commented Oct 9, 2024 •

edited

Loading