Make similar buckets for api and etcd request duration histogram #94134

tkashem · 2020-08-20T17:05:39Z

What type of PR is this?
/kind bug

What this PR does / why we need it:
etcd_request_duration_seconds uses the default buckets provided by prometheus client library.
DefBuckets = []float64{.005, .01, .025, .05, .1, .25, .5, 1, 2.5, 5, 10}
The maximum bucket size is 10s. On the other hand, apiserver_request_duration_seconds uses more fine grained bucket sizes and the maximum bucket size is 60s

The left panel shows latency for Deployment-DELETE api (metric=apiserver_request_duration_seconds), this is taking about 40s to complete. On the other hand, etcd latency (metric=etcd_request_duration_seconds) for the same object apps.Deployment-delete is capped at 10s. Now the difference in latency is hard to account for. It cloud be latency from ectd but we can't answer this question by looking at the metrics.

If the etcd metric has similar bucket sizes, we could account for the difference in latency.

This PR makes the bucket sizes for both metrics similar. Also, no existing bucket for etcd_request_duration_seconds was dropped.

Does this PR introduce a user-facing change?:

NONE

Additional documentation e.g., KEPs (Kubernetes Enhancement Proposals), usage docs, etc.:

NONE

Make similar buckets for the apiserver_request_duration_seconds and the etcd_request_duration_seconds histogram so that the result is more comparable side by side. etcd_request_duration_seconds uses the default buckets provided by prometheus client library: DefBuckets = []float64{.005, .01, .025, .05, .1, .25, .5, 1, 2.5, 5, 10} apiserver_request_duration_seconds on the other hand uses more fine grained buckets, and the maximum bucket size is 60s. Both histograms should use similar bucket sizes so they are more comparable side by side.

k8s-ci-robot · 2020-08-20T17:05:48Z

Hi @tkashem. Thanks for your PR.

I'm waiting for a kubernetes member to verify that this patch is reasonable to test. If it is, they should reply with /ok-to-test on its own line. Until that is done, I will not automatically test new commits in this PR, but the usual testing commands by org members will still work. Regular contributors should join the org to skip this step.

Once the patch is verified, the new status will be reflected by the ok-to-test label.

I understand the commands that are listed here.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

tkashem · 2020-08-20T17:15:21Z

/assign @brancz

tkashem · 2020-08-20T17:23:27Z

/assign @wojtek-t

fedebongio · 2020-08-20T20:06:46Z

/assign @logicalhan

hexfusion · 2020-08-20T20:42:50Z

+1 this makes a lot of sense to me, thanks for doing this @tkashem

logicalhan

/lgtm

/cc @brancz
(since we're changing buckets)

wojtek-t · 2020-08-24T07:55:56Z

I'm fine with it, but I will wait with approving for @brancz comment.

/assign @brancz

sttts · 2020-08-27T09:56:44Z

/ok-to-test

s-urbaniak · 2020-08-27T10:02:06Z

hey 👋 afaik @brancz is currently still vacationing :-)

metalmatze · 2020-08-27T10:25:42Z

I like this a lot. We have latencies on our dashboards for the API server and etcd too but didn't try to correlate those just yet. If we can make this change, this will become a lot easier indeed. 💯

squat · 2020-08-27T10:29:07Z

Looks great! This would be very handy (:

s-urbaniak · 2020-08-27T10:30:03Z

/approve

also from my side 👍 (not sure if that approval works or not)

brancz · 2020-09-02T13:17:36Z

Looks good from instrumentation side.

/lgtm

brancz · 2020-09-02T13:18:42Z

/assign @lavalamp

wojtek-t · 2020-09-02T13:31:57Z

/approve

k8s-ci-robot · 2020-09-02T13:32:31Z

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: s-urbaniak, tkashem, wojtek-t

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

~~staging/src/k8s.io/apiserver/pkg/storage/OWNERS~~ [wojtek-t]

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

lavalamp · 2020-09-02T18:41:22Z

/lgtm
/milestone v1.20

k8s-ci-robot requested review from hongchaodeng and wojtek-t August 20, 2020 17:06

k8s-ci-robot assigned logicalhan Aug 20, 2020

logicalhan reviewed Aug 20, 2020

View reviewed changes

k8s-ci-robot requested a review from brancz August 20, 2020 20:49

k8s-ci-robot added the lgtm "Looks good to me", indicates that a PR is ready to be merged. label Aug 20, 2020

k8s-ci-robot assigned brancz Aug 24, 2020

k8s-ci-robot added ok-to-test Indicates a non-member PR verified by an org member that is safe to test. and removed needs-ok-to-test Indicates a PR that requires an org member to verify it is safe to test. labels Aug 27, 2020

k8s-ci-robot assigned lavalamp Sep 2, 2020

wojtek-t self-assigned this Sep 2, 2020

k8s-ci-robot added the approved Indicates a PR has been approved by an approver from all required OWNERS files. label Sep 2, 2020

k8s-ci-robot added this to the v1.20 milestone Sep 2, 2020

k8s-ci-robot merged commit ab3ed8c into kubernetes:master Sep 2, 2020

tkashem mentioned this pull request Sep 28, 2020

REQUEST: New membership for tkashem kubernetes/org#2229

Closed

6 tasks

This was referenced Sep 15, 2021

[release-4.6] Rebase 1.19.14 openshift/kubernetes#960

Closed

[release-4.6] Bug 2008266: Rebase 1.19.14 openshift/kubernetes#962

Merged

tkashem mentioned this pull request Dec 15, 2021

Update etcdRequestLatency metrics bucket size #107042

Merged

Make similar buckets for api and etcd request duration histogram #94134

Make similar buckets for api and etcd request duration histogram #94134

Uh oh!

Conversation

tkashem commented Aug 20, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

k8s-ci-robot commented Aug 20, 2020

Uh oh!

tkashem commented Aug 20, 2020

Uh oh!

tkashem commented Aug 20, 2020

Uh oh!

fedebongio commented Aug 20, 2020

Uh oh!

hexfusion commented Aug 20, 2020

Uh oh!

logicalhan left a comment

Choose a reason for hiding this comment

Uh oh!

wojtek-t commented Aug 24, 2020

Uh oh!

sttts commented Aug 27, 2020

Uh oh!

s-urbaniak commented Aug 27, 2020

Uh oh!

metalmatze commented Aug 27, 2020

Uh oh!

squat commented Aug 27, 2020

Uh oh!

s-urbaniak commented Aug 27, 2020

Uh oh!

brancz commented Sep 2, 2020

Uh oh!

brancz commented Sep 2, 2020

Uh oh!

wojtek-t commented Sep 2, 2020

Uh oh!

k8s-ci-robot commented Sep 2, 2020

Uh oh!

lavalamp commented Sep 2, 2020

Uh oh!

Uh oh!

tkashem commented Aug 20, 2020 •

edited

Loading