feat: add evaluator scoring #101

olzhik11 · 2025-07-10T12:33:30Z

Important

Adds evaluator scoring feature with EvaluatorsResource in LaminarClient, including tests for scoring methods and error handling.

Behavior:
- Adds EvaluatorsResource class in src/client/resources/evaluators.ts to handle evaluator scoring.
- Implements score() method in EvaluatorsResource to create scores using traceId or spanId.
- Throws error if neither traceId nor spanId is provided.
Client:
- Adds EvaluatorsResource to LaminarClient in src/client/index.ts.
- Provides evaluators getter in LaminarClient.
Tests:
- Adds evaluators-client.test.ts to test EvaluatorsResource methods.
- Tests scoring with both traceId and spanId, including error handling for missing IDs.
Misc:
- Updates version in package.json from 0.6.14 to 0.6.15.
- Removes unnecessary eslint-disable comments in src/cli.ts and src/evaluations.ts.

^{This description was created by}^{for 5d42e6d. You can customize this summary. It will automatically update as commits are pushed.}

greptile-apps

Your free trial has ended. If you'd like to continue receiving code reviews, you can add a payment method here.

ellipsis-dev

Important

Looks good to me! 👍

Reviewed everything up to d461670 in 1 minute and 39 seconds. Click for details.

Reviewed 409 lines of code in 4 files
Skipped 0 files when reviewing.
Skipped posting 3 draft comments. View those below.
Modify your settings and rules to customize what types of comments Ellipsis leaves. And don't forget to react with 👍 or 👎 to teach Ellipsis.

1. src/client/resources/evaluators.ts:93

Draft comment:
If both 'traceId' and 'spanId' are provided, the code will only use 'traceId'. Consider explicitly handling or validating this case to avoid silent ambiguity.
Reason this comment was not posted:
Decided after close inspection that this draft comment was likely wrong and/or not actionable: usefulness confidence = 10% vs. threshold = 50% The comment suggests handling a case that's impossible due to TypeScript's type system. The ScoreOptions type explicitly makes traceId and spanId mutually exclusive through its union type definition. TypeScript would catch any attempt to provide both at compile time. The comment shows a misunderstanding of the type system. Could there be runtime scenarios where the type checking is bypassed? Could there be cases where the input comes from external sources? The function takes ScoreOptions as its parameter which is enforced by TypeScript. Even if called from JavaScript, the type guard checks ('traceId' in options) provide runtime safety. The comment should be deleted because it suggests handling an impossible case that's already prevented by the type system and runtime checks.

2. src/client/resources/evaluators.ts:107

Draft comment:
Add a newline at the end of the file to conform with standard file formatting.
Reason this comment was not posted:
Confidence changes required: 33% <= threshold 50% None

3. test/evaluators-client.test.ts:223

Draft comment:
Consider adding tests for the scenario where both 'traceId' and 'spanId' are supplied to ensure the API behaves as expected in that case.
Reason this comment was not posted:
Confidence changes required: 50% <= threshold 50% None

Workflow ID: wflow_XZI0GHn0hRoxo8Qp

^{You can customize}^{by changing your verbosity settings, reacting with 👍 or 👎, replying to comments, or adding code review rules.}

dinmukhamedm

nice! A couple small nits + let's pnpm lint:fix, I think most/all your linting issues are auto fixable. Later, let's add a pre-commit hook / auto fix or something

src/client/resources/evaluators.ts

test/evaluators-client.test.ts

ellipsis-dev

Important

Looks good to me! 👍

Reviewed 1150b4f in 48 seconds. Click for details.

Reviewed 151 lines of code in 2 files
Skipped 0 files when reviewing.
Skipped posting 3 draft comments. View those below.
Modify your settings and rules to customize what types of comments Ellipsis leaves. And don't forget to react with 👍 or 👎 to teach Ellipsis.

1. src/client/resources/evaluators.ts:4

Draft comment:
Enum value change from 'SDK' to 'Code' is correctly applied. Ensure any external docs or comments are updated accordingly.
Reason this comment was not posted:
Confidence changes required: 0% <= threshold 50% None

2. src/client/resources/evaluators.ts:97

Draft comment:
Endpoint URL updated from '/v1/evaluators/score' to '/v1/evaluator-scores'. Consider extracting this endpoint into a constant for easier maintenance.
Reason this comment was not posted:
Confidence changes required: 33% <= threshold 50% None

3. test/evaluators-client.test.ts:38

Draft comment:
Test adjustments correctly reflect the endpoint and source changes. All assertions now expect 'Code' and new URL.
Reason this comment was not posted:
Confidence changes required: 0% <= threshold 50% None

Workflow ID: wflow_UKBzVWPgLCv385Mf

^{You can customize}^{by changing your verbosity settings, reacting with 👍 or 👎, replying to comments, or adding code review rules.}

ellipsis-dev

Important

Looks good to me! 👍

Reviewed 527138c in 1 minute and 29 seconds. Click for details.

Reviewed 202 lines of code in 4 files
Skipped 0 files when reviewing.
Skipped posting 4 draft comments. View those below.
Modify your settings and rules to customize what types of comments Ellipsis leaves. And don't forget to react with 👍 or 👎 to teach Ellipsis.

1. src/cli.ts:41

Draft comment:
Removed eslint-disable comments for global vars. Ensure declaring globals with 'var' is desired.
Reason this comment was not posted:
Comment looked like it was already resolved.

2. src/client/resources/evaluators.ts:4

Draft comment:
Added trailing comma in enum member for 'Code' improves diff readability. Verify that this formatting change is intentional.
Reason this comment was not posted:
Comment did not seem useful. Confidence is useful = 40% <= threshold 50% The comment is asking the PR author to verify if the formatting change is intentional, which violates the rule against asking for confirmation of intention. However, it also provides a suggestion about improving diff readability, which is useful. The comment could be rephrased to focus on the suggestion rather than asking for verification.

3. src/evaluations.ts:23

Draft comment:
Removed eslint-disable comments in global declarations. Confirm that using 'var' for globals is acceptable in your codebase.
Reason this comment was not posted:
Comment was on unchanged code.

4. test/evaluators-client.test.ts:52

Draft comment:
Refactored assertions to use deepStrictEqual for payload verification. This improves test conciseness—ensure no extra keys are unintentionally included.
Reason this comment was not posted:
Comment looked like it was already resolved.

Workflow ID: wflow_1X5Zxdd5YHpyaijN

^{You can customize}^{by changing your verbosity settings, reacting with 👍 or 👎, replying to comments, or adding code review rules.}

ellipsis-dev

Important

Looks good to me! 👍

Reviewed 5d42e6d in 1 minute and 1 seconds. Click for details.

Reviewed 71 lines of code in 2 files
Skipped 0 files when reviewing.
Skipped posting 2 draft comments. View those below.
Modify your settings and rules to customize what types of comments Ellipsis leaves. And don't forget to react with 👍 or 👎 to teach Ellipsis.

1. src/client/resources/evaluators.ts:97

Draft comment:
Updated endpoint URL. Consider extracting '/v1/evaluators/score' as a constant for reuse.
Reason this comment was not posted:
Confidence changes required: 33% <= threshold 50% None

2. test/evaluators-client.test.ts:38

Draft comment:
Test mocks updated to use '/v1/evaluators/score'. Consistent with client change.
Reason this comment was not posted:
Confidence changes required: 0% <= threshold 50% None

Workflow ID: wflow_s469vpW8s4m98AX5

^{You can customize}^{by changing your verbosity settings, reacting with 👍 or 👎, replying to comments, or adding code review rules.}

olzhik11 added 2 commits July 10, 2025 13:32

feat: add evaluator scoring

e0b7525

feat: bump version

d461670

olzhik11 self-assigned this Jul 10, 2025

greptile-apps bot reviewed Jul 10, 2025

View reviewed changes

ellipsis-dev bot reviewed Jul 10, 2025

View reviewed changes

dinmukhamedm reviewed Jul 10, 2025

View reviewed changes

src/client/resources/evaluators.ts Show resolved Hide resolved

test/evaluators-client.test.ts Outdated Show resolved Hide resolved

feat: update tests, endpoints, enum

1150b4f

ellipsis-dev bot reviewed Jul 10, 2025

View reviewed changes

feat: refactor + linter

527138c

ellipsis-dev bot reviewed Jul 10, 2025

View reviewed changes

feat: update endpoint

5d42e6d

ellipsis-dev bot reviewed Jul 10, 2025

View reviewed changes

Merge branch 'main' into feat/LAM-719

8f37f27

olzhik11 merged commit 211a9ff into main Jul 11, 2025
3 checks passed

dinmukhamedm deleted the feat/LAM-719 branch July 28, 2025 12:07

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: add evaluator scoring #101

feat: add evaluator scoring #101

Uh oh!

olzhik11 commented Jul 10, 2025 •

edited by ellipsis-dev bot

Loading

Uh oh!

greptile-apps bot left a comment

Uh oh!

ellipsis-dev bot left a comment

Uh oh!

dinmukhamedm left a comment

Uh oh!

Uh oh!

Uh oh!

ellipsis-dev bot left a comment

Uh oh!

ellipsis-dev bot left a comment

Uh oh!

ellipsis-dev bot left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

feat: add evaluator scoring #101

feat: add evaluator scoring #101

Uh oh!

Conversation

olzhik11 commented Jul 10, 2025 • edited by ellipsis-dev bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

greptile-apps bot left a comment

Choose a reason for hiding this comment

Uh oh!

ellipsis-dev bot left a comment

Choose a reason for hiding this comment

Uh oh!

dinmukhamedm left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

ellipsis-dev bot left a comment

Choose a reason for hiding this comment

Uh oh!

ellipsis-dev bot left a comment

Choose a reason for hiding this comment

Uh oh!

ellipsis-dev bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

olzhik11 commented Jul 10, 2025 •

edited by ellipsis-dev bot

Loading