lib : Set the correct timeout attribute in ppoll #19715

nishant111 · 2025-10-09T13:34:17Z

Fix frr scheduling loop to not loose any Scheduling precision in ppoll

fd_poll does not honor timer_wait less than 1000usec which cause ppoll to spin until timer_wait ticks to 0 causing high CPU

lib/event.c

eqvinox · 2025-10-14T12:39:20Z

I'm in favor of this (consider it a bugfix really), and think we should defer on trying to be clever (e.g. introduce some minimum sleep time like #19598 does.) The entire thing here is probably a holdover from using poll(), which only has millisecond precision, and someone might've gone "oh but I need this 0.5ms timer to work, I'll just busy loop".

For reference, context switches on modern OSes take single digit µs times, cf. https://eli.thegreenplace.net/2018/measuring-context-switching-and-memory-overheads-for-linux-threads/ (might be even less 7 years later). And note that when calling ppoll(), we're done doing things and likely won't have anything valuable in CPU caches. With that in mind, it feels entirely reasonable to pass small timeouts into ppoll(). The kernel people aren't stupid either, there's probably some minimum timeout below which it will just spin in the kernel (or rather, "short-sleep" the CPU core, rather than spin) and not task switch.

While there certainly could be gains had from doing some accumulation/batching of timers, there can also be unforeseen effects, even to the network level in terms of microbursts. (Unlikely, but possible.) Without data showing us any benefit of that, let's just stick to the simple.

mjstapp · 2025-10-14T13:03:33Z

I guess I don't think the FRR issue is with context-switching, or cpu caches, or sub-microsecond packet-processing in a dataplane somewhere. I think we have two use-cases: control-plane timers, and BFD.

the control-plane protocols (and some of our own internal components/IPCS/etc) use timers. those timers are usually at second scale - there's not value at all in nanosecond precision for those timers.

BFD is sort of a special case, because we seem to be under some pressure to support quite tight timers for BFD in user-space, and at some scale in terms of number of peers/sessions. Those timers are at tens-of-milliseconds scale or thereabouts. Even there, we have only seen that those kinds of timers hit a ceiling when the system is busy.

But what we've been doing is not "simple": the library code does extra work to sort the timer list at sub-millisecond precision, and we try very hard to distinguish between multiple timers that are scheduled microseconds apart. As Donald has shown, that only wastes cpu time without benefitting network stability at all. so my preference would be to be "simple": we make it clear that our apis support millisecond resolution, for example, and we plumb that through the various layers so we aren't doing extra work to support a precision that we can't hope to reliably achieve.

For reference, context switches on modern OSes take single digit µs times, cf. https://eli.thegreenplace.net/2018/measuring-context-switching-and-memory-overheads-for-linux-threads/ (might be even less 7 years later). [...] And note that when calling ppoll(), we're done doing things and likely won't have anything valuable in CPU caches.

While there certainly could be gains had from doing some accumulation/batching of timers, there can also be unforeseen effects, even to the network level in terms of microbursts. (Unlikely, but possible.) Without data showing us any benefit of that, let's just stick to the simple.

nishant111 · 2025-10-17T11:52:01Z

As discussed in the community meeting , I have not removed the selectpoll_timeout code. I have just recalculated the correct tsp for ppoll inside "#if defined HAVE_PPOLL". poll() continues to use the old timeout calculation.

lib/event.c

mjstapp

Thanks, that looks clearer to me

donaldsharp

LGTM

lib/event.c

fd_poll does not honor timer_wait less than 1000usec which causes ppoll to spin until timer_wait ticks to 0 causing high CPU. Also, set 1 msec floor to poll() to not let poll() spin as well for tv_usec < 1000 and > 0. Also moving timeout related if-else ladder into #else poll compilation to not hit SA cland dead code warning. Signed-off-by: Nishant <[email protected]>

choppsv1

LGTM

frrbot bot added the libfrr label Oct 9, 2025

github-actions bot added size/M master labels Oct 9, 2025

mjstapp reviewed Oct 9, 2025

View reviewed changes

lib/event.c Show resolved Hide resolved

lib/event.c Outdated Show resolved Hide resolved

nishant111 force-pushed the nishant/frrEventSchedulingFix branch 3 times, most recently from 5b82a10 to 5e68313 Compare October 9, 2025 15:48

nishant111 changed the title ~~lib : Fix frr scheduling loop to not loose any Scheduling precision in ppoll~~ lib : Set the correct timeout attribute in ppoll Oct 9, 2025

eqvinox mentioned this pull request Oct 14, 2025

lib: Have a minimum event sleep of 1 ms. #19598

Open

nishant111 force-pushed the nishant/frrEventSchedulingFix branch from 5e68313 to a4a34a3 Compare October 17, 2025 11:49

github-actions bot added size/S rebase PR needs rebase and removed size/M labels Oct 17, 2025

choppsv1 approved these changes Oct 20, 2025

View reviewed changes

lib/event.c Outdated Show resolved Hide resolved

mjstapp reviewed Oct 20, 2025

View reviewed changes

lib/event.c Outdated Show resolved Hide resolved

choppsv1 approved these changes Oct 21, 2025

View reviewed changes

nishant111 force-pushed the nishant/frrEventSchedulingFix branch from a4a34a3 to 820e1f8 Compare October 23, 2025 07:07

mjstapp approved these changes Oct 23, 2025

View reviewed changes

donaldsharp approved these changes Oct 23, 2025

View reviewed changes

choppsv1 approved these changes Oct 23, 2025

View reviewed changes

choppsv1 requested changes Oct 23, 2025

View reviewed changes

lib/event.c Outdated Show resolved Hide resolved

nishant111 force-pushed the nishant/frrEventSchedulingFix branch from 820e1f8 to 39fb939 Compare October 28, 2025 05:57

github-actions bot added size/M and removed size/S labels Oct 28, 2025

nishant111 requested a review from choppsv1 October 28, 2025 09:04

nishant111 force-pushed the nishant/frrEventSchedulingFix branch from 39fb939 to 85e459f Compare October 28, 2025 09:46

choppsv1 requested changes Oct 28, 2025

View reviewed changes

choppsv1 approved these changes Oct 28, 2025

View reviewed changes

choppsv1 merged commit cff2068 into FRRouting:master Oct 28, 2025
10 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

lib : Set the correct timeout attribute in ppoll #19715

lib : Set the correct timeout attribute in ppoll #19715

Uh oh!

nishant111 commented Oct 9, 2025

Uh oh!

Uh oh!

Uh oh!

eqvinox commented Oct 14, 2025

Uh oh!

mjstapp commented Oct 14, 2025

Uh oh!

nishant111 commented Oct 17, 2025

Uh oh!

Uh oh!

Uh oh!

mjstapp left a comment

Uh oh!

donaldsharp left a comment

Uh oh!

Uh oh!

choppsv1 left a comment •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

lib : Set the correct timeout attribute in ppoll #19715

lib : Set the correct timeout attribute in ppoll #19715

Uh oh!

Conversation

nishant111 commented Oct 9, 2025

Uh oh!

Uh oh!

Uh oh!

eqvinox commented Oct 14, 2025

Uh oh!

mjstapp commented Oct 14, 2025

Uh oh!

nishant111 commented Oct 17, 2025

Uh oh!

Uh oh!

Uh oh!

mjstapp left a comment

Choose a reason for hiding this comment

Uh oh!

donaldsharp left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

choppsv1 left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

choppsv1 left a comment •

edited

Loading