Track on CPU events too #74

ants · 2024-06-12T14:04:01Z

To not count dead backends as still running on the CPU we need to detect that the backend is dead. Starting from PG17 proc->pid is reset in ProcKill, before this we can check if the process latch is disowned. Not nice to be poking around in latch internals like this, but all alternatives seem to involve scanning bestatus array and correlating pids.

Verified that the latch disown mechanism works on at least PostgreSQL 12-16.

Also makes sense to exclude ourselves as we will always be on CPU while looking at wait events.

Resolves #10

To not count dead backends as still running on the CPU we need to detect that the backend is dead. Starting from PG17 proc->pid is reset in ProcKill, before this we can check if the process latch is disowned. Not nice to be poking around in latch internals like this, but all alternatives seem to involve scanning bestatus array and correlating pids. Also makes sense to exclude ourselves as we will always be on CPU while looking at wait events.

ants · 2024-06-12T14:06:11Z

Should I add a GUC to turn this functionality on and off?

shinderuk · 2024-06-13T12:51:30Z

Thanks for working on this! I did some tests with pgbench and the results look good. Yes, I think we need a GUC to turn this on. Also the pg_wait_sampling_current view needs to be patched similarly.

Defaults to false meaning previous behavior is retained. Update pg_wait_sampling_current view to respect this flag.

ants · 2024-06-14T15:15:54Z

Added a sample_cpu GUC and updated the pg_wait_sampling_current view. GUC defaults to false for backwards compatibility. What is the preference on this? I think most people would want to see the events, so maybe it should default to true?

shinderuk

Thank you. Looks good, except for a couple minor things.

Why not turn sample_cpu = on by default? Mainly for backward compatibility. I'm afraid that blank event_type in the profile would puzzle unprepared users or break UI. Also, turning it on could overflow the history buffer with new rows requiring adjustment of history_size. Anyway, I'm not sure which default is more useful for a regular user. Does it make sense?

README.md

collector.c

pg_wait_sampling.c

README.md

egor-rogov · 2024-06-17T17:25:36Z

Thank you guys!

shinderuk self-requested a review June 13, 2024 07:32

shinderuk assigned ants Jun 14, 2024

Add a GUC for controlling whether on CPU events are counted

b156883

Defaults to false meaning previous behavior is retained. Update pg_wait_sampling_current view to respect this flag.

shinderuk requested changes Jun 14, 2024

View reviewed changes

README.md Outdated Show resolved Hide resolved

collector.c Outdated Show resolved Hide resolved

pg_wait_sampling.c Outdated Show resolved Hide resolved

README.md Outdated Show resolved Hide resolved

ants added 5 commits June 17, 2024 14:54

Fix typo in README

0ce12dc

Add comment justifying peeking into procLatch

f22906e

Make pg_wait_sampling_get_current() mirror probe_waits()

ceaafb8

Factor decision to sample into a separate function

567d0c9

Turn on sampling of on CPU events by default

fbe3e7f

shinderuk merged commit 12c0f7d into postgrespro:master Jun 17, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Track on CPU events too #74

Track on CPU events too #74

Uh oh!

ants commented Jun 12, 2024

Uh oh!

ants commented Jun 12, 2024

Uh oh!

shinderuk commented Jun 13, 2024

Uh oh!

ants commented Jun 14, 2024

Uh oh!

shinderuk left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

egor-rogov commented Jun 17, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Track on CPU events too #74

Track on CPU events too #74

Uh oh!

Conversation

ants commented Jun 12, 2024

Uh oh!

ants commented Jun 12, 2024

Uh oh!

shinderuk commented Jun 13, 2024

Uh oh!

ants commented Jun 14, 2024

Uh oh!

shinderuk left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

egor-rogov commented Jun 17, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants