Replicate generic hardware events on all CPU PMUs #2123

pcc · 2026-02-06T22:30:41Z

On systems with more than one PMU for the CPUs (e.g. Apple M series SOCs), generic hardware events are only created for an arbitrary PMU. Usually this is the big cluster's PMU, which can cause inaccuracies when the process is scheduled onto a little core. To fix this, teach PerfCounters to register generic hardware events on all CPU PMUs.

CPU PMUs are identified using the same method as perf.

pcc · 2026-02-06T22:31:24Z

cc @captain5050

src/perf_counters.cc

LebedevRI · 2026-02-07T01:04:40Z

Does this need any kind of guarding or is this always the right thing to do?

pcc · 2026-02-07T01:38:41Z

Does this need any kind of guarding or is this always the right thing to do?

This is always correct as far as I'm aware. I've tested that it does the right thing on an M2 Ultra Mac Studio, a Pixel 10 and a regular x86 PC (where it just finds the single PMU which returns the same results as not specifying a PMU).

LebedevRI

Didn't test, but seems fine.

dmah42 · 2026-02-09T11:03:49Z

can we add google test unit tests for this please? i don't know how tricky it is to test it but i'm a little concerned having no tests at all.

pcc · 2026-02-09T23:05:55Z

can we add google test unit tests for this please? i don't know how tricky it is to test it but i'm a little concerned having no tests at all.

We can test it by verifying that counters are non-zero when the process is pinned to each CPU in the system. We can adapt the test PerfCountersTest.Read1Counter for this purpose. Actually, on my Mac Studio that test fails at head (after patching BCR to fix the build failures with --define pfm=1: bazelbuild/bazel-central-registry#7502 bazelbuild/bazel-central-registry#7503) and it also fails after this change (because it didn't expect more than one counter with the same name). I'll take care of that as well.

pcc · 2026-02-10T04:09:40Z

can we add google test unit tests for this please? i don't know how tricky it is to test it but i'm a little concerned having no tests at all.

We can test it by verifying that counters are non-zero when the process is pinned to each CPU in the system. We can adapt the test PerfCountersTest.Read1Counter for this purpose. Actually, on my Mac Studio that test fails at head (after patching BCR to fix the build failures with --define pfm=1: bazelbuild/bazel-central-registry#7502 bazelbuild/bazel-central-registry#7503) and it also fails after this change (because it didn't expect more than one counter with the same name). I'll take care of that as well.

Done

dmah42 · 2026-02-10T09:47:25Z

looks like we need to update this from head and do some clang-format fun.

On systems with more than one PMU for the CPUs (e.g. Apple M series SOCs), generic hardware events are only created for an arbitrary PMU. Usually this is the big cluster's PMU, which can cause inaccuracies when the process is scheduled onto a little core. To fix this, teach PerfCounters to register generic hardware events on all CPU PMUs. CPU PMUs are identified using the same method as perf.

pcc · 2026-02-10T20:35:24Z

looks like we need to update this from head and do some clang-format fun.

Done

dmah42 · 2026-02-11T10:07:23Z

thank you so much :)

LebedevRI reviewed Feb 6, 2026

View reviewed changes

src/perf_counters.cc Outdated Show resolved Hide resolved

captain5050 approved these changes Feb 6, 2026

View reviewed changes

src/perf_counters.cc Outdated Show resolved Hide resolved

pcc force-pushed the cpu-pmu branch from 28cd043 to 2ab3d17 Compare February 7, 2026 00:34

LebedevRI reviewed Feb 7, 2026

View reviewed changes

src/perf_counters.cc Outdated Show resolved Hide resolved

pcc force-pushed the cpu-pmu branch from 2ab3d17 to 644e105 Compare February 7, 2026 01:37

pcc force-pushed the cpu-pmu branch from 644e105 to e4f34f8 Compare February 7, 2026 01:40

LebedevRI previously approved these changes Feb 7, 2026

View reviewed changes

LebedevRI requested a review from dmah42 February 7, 2026 01:44

pcc dismissed LebedevRI’s stale review via 42647f1 February 10, 2026 04:08

pcc force-pushed the cpu-pmu branch from 6059090 to 42647f1 Compare February 10, 2026 04:08

dmah42 previously approved these changes Feb 10, 2026

View reviewed changes

pcc dismissed dmah42’s stale review via 43eaa2e February 10, 2026 20:35

pcc force-pushed the cpu-pmu branch from 42647f1 to 43eaa2e Compare February 10, 2026 20:35

dmah42 approved these changes Feb 11, 2026

View reviewed changes

dmah42 merged commit 84732c8 into google:main Feb 11, 2026
82 of 84 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Replicate generic hardware events on all CPU PMUs #2123

Replicate generic hardware events on all CPU PMUs #2123

pcc commented Feb 6, 2026

Uh oh!

pcc commented Feb 6, 2026

Uh oh!

Uh oh!

Uh oh!

Uh oh!

LebedevRI commented Feb 7, 2026

Uh oh!

pcc commented Feb 7, 2026

Uh oh!

LebedevRI left a comment

Uh oh!

dmah42 commented Feb 9, 2026

Uh oh!

pcc commented Feb 9, 2026

Uh oh!

pcc commented Feb 10, 2026

Uh oh!

dmah42 commented Feb 10, 2026

Uh oh!

pcc commented Feb 10, 2026

Uh oh!

Uh oh!

dmah42 commented Feb 11, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Replicate generic hardware events on all CPU PMUs #2123

Replicate generic hardware events on all CPU PMUs #2123

Conversation

pcc commented Feb 6, 2026

Uh oh!

pcc commented Feb 6, 2026

Uh oh!

Uh oh!

Uh oh!

Uh oh!

LebedevRI commented Feb 7, 2026

Uh oh!

pcc commented Feb 7, 2026

Uh oh!

LebedevRI left a comment

Choose a reason for hiding this comment

Uh oh!

dmah42 commented Feb 9, 2026

Uh oh!

pcc commented Feb 9, 2026

Uh oh!

pcc commented Feb 10, 2026

Uh oh!

dmah42 commented Feb 10, 2026

Uh oh!

pcc commented Feb 10, 2026

Uh oh!

Uh oh!

dmah42 commented Feb 11, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants