Migrate getEvaluationStatistics #5346
Merged
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
/internal/evaluations/statisticsthat returns aggregated statistics (mean, confidence intervals) for specified evaluation runs and metricsImportant
Adds
/internal/evaluations/statisticsendpoint to fetch evaluation statistics with mean and confidence intervals for specified runs and metrics, updating server and client-side code accordingly./internal/evaluations/statisticsendpoint ininternal.rsto return aggregated statistics (mean, confidence intervals) for specified evaluation runs and metrics.get_evaluation_statistics_handleringet_statistics.rsto handle requests to the new endpoint.TensorZeroClientintensorzero.tsto includegetEvaluationStatistics()method for fetching statistics.get_evaluation_statisticsmethod toEvaluationQueriestrait inevaluation_queries.rs.evaluation_queries.rsto fetch raw statistics and compute confidence intervals.EvaluationStatisticsRowstruct inevaluation_queries.rsfor deserializing statistics data.EvaluationStatisticsandGetEvaluationStatisticsResponsetypes inEvaluationStatistics.tsandGetEvaluationStatisticsResponse.ts.index.ts.get_evaluation_statisticsinevaluation_queries.rs.evaluations.rsandevaluation_queries.rs.This description was created by
for 9120b40. You can customize this summary. It will automatically update as commits are pushed.