[FIX] Add retry if first dashboard retrieval fails #9

lmarques03 · 2022-12-23T16:33:33Z

Occasionally when retrieving the dashboard information, an error would occur with the message:
"msg="status: 404, body: {\"message\":\"Dashboard not found\"}"

After some investigation it was concluded that the error occurred when the request to Grafana API coincided with the restart of Grafana pods.
This PR adds a one-time retry if the GET request fails.

nicolastakashi · 2022-12-28T16:20:27Z

Hey, @lmarques03 thanks for the contribution!
Can you elaborate a little bit more about this issue, looking at the description that you provided, seems to me that the dashboard doesn't exist, could it be an issue with the dashboard sync instead of the Grafana API?

lmarques03 · 2022-12-30T10:17:57Z

Hello, @nicolastakashi.
It certainly could be, however from our side, every time the error was triggered it coincided with the restart of Grafana pods. As such, the error is only triggered very occasionally, which indicates that most of the times the dashboard sync is done correctly. Also, the error does not persist, and on the next API call everything works fine, and the dashboard is found.

Apart from all of this, I was not able to replicate the error in my local environment, even when importing the same dashboards as in prd.

nicolastakashi · 2023-01-04T09:43:39Z

Hey, @lmarques03 thanks for the clarification.
Well since it's a transient issue in my view we can just check the error code and don't treat 404 as an error, this is a recurrent task, the next time the dashboards will be available and everything is ok.

WDT?

lmarques03 · 2023-01-04T13:26:02Z

@nicolastakashi That sounds like a good idea. Would you prefer to mantain the retry (and do it only when the error is 404), or simply ignore the error altogether, and wait for the next iteration?

nicolastakashi · 2023-01-10T08:32:57Z

Hi @lmarques03 I would say we can just handle the 404 as an warn instead of an error, because the retry is out of the box implemented by the pooling interval, wdyt?

lmarques03 · 2023-01-12T10:36:57Z

Hello @nicolastakashi. I changed the code to handle 404 as a warning. It should be fine now. Please review and let me know what you think.

nicolastakashi

Since you're using continue you don't need the else on the if statement

lmarques03 · 2023-01-25T10:49:08Z

Yeah that was dumb. I fixed it now.
If the dashboard retrieval results in an error, then it always continues to the next iteration. But it does not consider the 404 response as an error.

internal/grafana/client.go

lmarques03 added 2 commits December 23, 2022 16:23

[FIX] Add retry if first dashboard retrieval fails

fb9ed6e

[FIX] Format code

a49bc3e

lmarques03 added 2 commits January 12, 2023 10:34

[FEAT] Added warning if dashboard retrieval returns 404

fc6f33b

[FIX] Small code fix

041cde0

lmarques03 added 2 commits January 12, 2023 10:52

[FIX] Small fix in warning handling

66bf153

[FIX] Small fix in error handling

600b23c

nicolastakashi reviewed Jan 24, 2023

View reviewed changes

[FIX] Small fix to error handling

e2ef90d

nicolastakashi requested changes Jan 25, 2023

View reviewed changes

internal/grafana/client.go Show resolved Hide resolved

[FIX] Final fix to error handling

4bef673

nicolastakashi approved these changes Jan 27, 2023

View reviewed changes

nicolastakashi merged commit 0f7ea5b into nicolastakashi:main Jan 27, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[FIX] Add retry if first dashboard retrieval fails #9

[FIX] Add retry if first dashboard retrieval fails #9

Uh oh!

lmarques03 commented Dec 23, 2022

Uh oh!

nicolastakashi commented Dec 28, 2022

Uh oh!

lmarques03 commented Dec 30, 2022

Uh oh!

nicolastakashi commented Jan 4, 2023

Uh oh!

lmarques03 commented Jan 4, 2023

Uh oh!

nicolastakashi commented Jan 10, 2023

Uh oh!

lmarques03 commented Jan 12, 2023

Uh oh!

nicolastakashi left a comment •

edited

Loading

Uh oh!

lmarques03 commented Jan 25, 2023

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

[FIX] Add retry if first dashboard retrieval fails #9

[FIX] Add retry if first dashboard retrieval fails #9

Uh oh!

Conversation

lmarques03 commented Dec 23, 2022

Uh oh!

nicolastakashi commented Dec 28, 2022

Uh oh!

lmarques03 commented Dec 30, 2022

Uh oh!

nicolastakashi commented Jan 4, 2023

Uh oh!

lmarques03 commented Jan 4, 2023

Uh oh!

nicolastakashi commented Jan 10, 2023

Uh oh!

lmarques03 commented Jan 12, 2023

Uh oh!

nicolastakashi left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

lmarques03 commented Jan 25, 2023

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

nicolastakashi left a comment •

edited

Loading