ames: consolidate dead flows to a single behn timer #6738

pkova · 2023-07-24T23:35:37Z

As discussed with @yosoyubik and @joemfb out of band.

I tested this with 50 000 dead flows. Without consolidation these flows resulted in a constant 30 % cpu usage. Consolidating the timers led to a CPU usage of almost 0 with a 100 % spike for a few seconds every two minutes.

The retry interval is the normal ~m2, we can make it configurable later.

joemfb

This is a nice, simple approach, and should dramatically reduce the number of retry timers we have to maintain (and therefore the events logged for retry). At some point, I think we should develop "offline" heuristics for a peer and backoff even further.

A couple things:

joemfb · 2023-07-25T02:26:43Z

pkg/arvo/sys/vane/ames.hoon

+      =^  moz  u.cached-state
+        ?.  ?=(%15 -.u.cached-state)  [~ u.cached-state]
+        ~>  %slog.0^leaf/"ames: init dead flow consolidation timer"
+        :-  [[/ames]~ %pass /dead-flow %b %wait `@da`(add now ~m2)]~


Do we need to duplicate this timer initialization somewhere else to catch fresh boot?

I see no better way of doing this than state and +on-born. See a75a083 for how I decided to implement it. Note that the recork timer a few lines above this suffers from the same problem: it never gets initialized on new ships.

joemfb · 2023-07-25T02:28:32Z

pkg/arvo/sys/vane/ames.hoon

+              ::  set new timer if non-null and not at max-backoff
              ::
              =?  peer-core  ?=(^ new-wake)
+                ?:  =(~m2 rto.metrics.state)


Worth noting here (and above) that the ~m2 literal is important, we don't want to actually use max-backoff and accidentally consolidate the :ping app flow.

Added a comment in a3e7595

joemfb · 2023-07-25T17:11:23Z

pkg/arvo/sys/vane/ames.hoon

+            |=  [[=ship =ship-state] core=_event-core]
+            ^+  event-core
+            =/  peer-state=(unit peer-state)  (get-peer-state:core ship)
+            ?~  peer-state  core
+            %-  ~(rep by snd.u.peer-state)
+            |=  [[=bone =message-pump-state] cor=_core]
+              ?.  =(~m2 rto.metrics.packet-pump-state.message-pump-state)
+                cor
+              abet:(on-wake:(abed-peer:pe:cor ship u.peer-state) bone error)


Style nit: there are a couple extraneous layers of indentation here.

pkg/arvo/sys/vane/ames.hoon

joemfb

this will do

joemfb reviewed Jul 25, 2023

View reviewed changes

pkova force-pushed the pkova/dead-flow-consolidation branch from fb47490 to b7354eb Compare July 25, 2023 16:32

joemfb reviewed Jul 25, 2023

View reviewed changes

pkova force-pushed the pkova/dead-flow-consolidation branch 3 times, most recently from 2178c75 to 27fe522 Compare July 26, 2023 10:42

pkova added 6 commits July 28, 2023 16:33

ames: consolidate dead flows to a single behn timer

d2956a3

ames: store dead flow consolidation timer in state

45924f7

ci: test some nonsense

9d6a7fb

tests: fix tests broken by dead flow consolidation

64ba325

ames: fix indentation in dead flow handling

ff21513

ames: add comment explaining magic number in +set-wake

82d4e2a

pkova force-pushed the pkova/dead-flow-consolidation branch from a3e7595 to 82d4e2a Compare July 28, 2023 13:33

joemfb reviewed Jul 28, 2023

View reviewed changes

pkg/arvo/sys/vane/ames.hoon Show resolved Hide resolved

joemfb approved these changes Jul 28, 2023

View reviewed changes

pkova merged commit 19d7fe0 into next/kelvin/412 Jul 28, 2023

pkova deleted the pkova/dead-flow-consolidation branch July 28, 2023 14:32

pkova mentioned this pull request Aug 15, 2023

ames: fix bug in dead flow consolidation #6756

Merged

pkova mentioned this pull request Sep 14, 2023

ames: make dead flow consolidation toggleable, default off #6792

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

ames: consolidate dead flows to a single behn timer #6738

ames: consolidate dead flows to a single behn timer #6738

Uh oh!

pkova commented Jul 24, 2023

Uh oh!

joemfb left a comment

Uh oh!

joemfb Jul 25, 2023

Uh oh!

pkova Jul 25, 2023

Uh oh!

joemfb Jul 25, 2023

Uh oh!

pkova Jul 27, 2023

Uh oh!

joemfb Jul 25, 2023

Uh oh!

pkova Jul 26, 2023

Uh oh!

Uh oh!

joemfb left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

ames: consolidate dead flows to a single behn timer #6738

ames: consolidate dead flows to a single behn timer #6738

Uh oh!

Conversation

pkova commented Jul 24, 2023

Uh oh!

joemfb left a comment

Choose a reason for hiding this comment

Uh oh!

joemfb Jul 25, 2023

Choose a reason for hiding this comment

Uh oh!

pkova Jul 25, 2023

Choose a reason for hiding this comment

Uh oh!

joemfb Jul 25, 2023

Choose a reason for hiding this comment

Uh oh!

pkova Jul 27, 2023

Choose a reason for hiding this comment

Uh oh!

joemfb Jul 25, 2023

Choose a reason for hiding this comment

Uh oh!

pkova Jul 26, 2023

Choose a reason for hiding this comment

Uh oh!

Uh oh!

joemfb left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants