Factor: base benchmarking for single/multiple u64, u128, and >u128 #9182

asder8215 · 2025-11-07T22:43:25Z

This PR adds benchmarking tests for the factor command to test how long uutils' factor command takes to compute the prime factors for u64/u128/>u128 values. It should also serve as a baseline for any modifications made to src/factors.rs to check for any improvement on performance.

…p the performance of calculating prime numbers for u64 and u128

… to add u64 digits from big_uint

src/uu/factor/src/factor.rs

sylvestre · 2025-11-07T23:12:41Z

please add it in the list here: .github/workflows/benchmarks.yml

github-actions · 2025-11-07T23:17:30Z

GNU testsuite comparison:

Skipping an intermittent issue tests/tail/overlay-headers (passes in this run but fails in the 'main' branch)

…arks.yml

asder8215 · 2025-11-08T02:04:11Z

Added to the list!

github-actions · 2025-11-08T02:22:54Z

GNU testsuite comparison:

Skip an intermittent issue tests/misc/tee (fails in this run but passes in the 'main' branch)

codspeed-hq · 2025-11-08T03:42:23Z

CodSpeed Performance Report

Merging #9182 will not alter performance

_{Comparing asder8215:factor_benchmarking (d977e29) with main (1074071)}

Summary

✅ 123 untouched
🆕 3 new
⏩ 2 skipped¹

Benchmarks breakdown

	Benchmark	`BASE`	`HEAD`	Change
🆕	`factor_multiple_big_uint`	N/A	16.2 ms	N/A
🆕	`factor_multiple_u128s[18446744073709551616]`	N/A	330.2 ms	N/A
🆕	`factor_multiple_u64s[2]`	N/A	184.5 ms	N/A

2 benchmarks were skipped, so the baseline results were used instead. If they were deleted from the codebase, click here and archive them to remove them from the performance reports. ↩

github-actions · 2025-11-08T04:02:38Z

GNU testsuite comparison:

Skipping an intermittent issue tests/misc/tee (passes in this run but fails in the 'main' branch)

asder8215 · 2025-11-08T04:18:51Z

I didn't expect it to take this long to run the benchmark. I think I'll reduce the range of numbers to iterate through for multiple_big_uint.

github-actions · 2025-11-08T04:41:32Z

GNU testsuite comparison:

Skipping an intermittent issue tests/misc/tee (passes in this run but fails in the 'main' branch)
Skipping an intermittent issue tests/tail/overlay-headers (passes in this run but fails in the 'main' branch)

github-actions · 2025-11-08T05:35:00Z

GNU testsuite comparison:

Skipping an intermittent issue tests/misc/tee (passes in this run but fails in the 'main' branch)

… from divan macro

…torize properly (factorize error take a while to propagate)

github-actions · 2025-11-08T07:40:46Z

GNU testsuite comparison:

Skipping an intermittent issue tests/tail/overlay-headers (passes in this run but fails in the 'main' branch)

sylvestre · 2025-11-08T08:10:02Z

please change the input sizes.
#9182 (comment)
benchmarks should be around 100 to 400 ms

src/uu/factor/benches/factor_bench.rs

sylvestre · 2025-11-08T08:11:12Z

also, 6 benchmarks seem a bit big, can we have 2 or 3 instead? thanks

…ingle u64/u128/big_uint benchmark tests

asder8215 · 2025-11-08T18:10:50Z

I took off the single benchmark tests and kept the multiple u64/u128/BigUint benchmark tests (with smaller range of numbers to factorize) since it would be easier to notice any improvement on the factor command from those cases. In total, those 3 benchmark tests take about 350-400 ms to run (when run locally).

github-actions · 2025-11-08T18:30:57Z

GNU testsuite comparison:

Skipping an intermittent issue tests/misc/tee (passes in this run but fails in the 'main' branch)

sylvestre · 2025-11-08T18:40:52Z

thanks

Factor: base benchmarking for single/multiple u64, u128, and >u128

sylvestre · 2025-11-08T23:33:55Z

seems that it is quite an unstable bench
#9174 (comment)
#9198 (comment)
etc
could you please have a look? thanks

asder8215 · 2025-11-09T01:27:03Z

I took a closer look at the num_prime crate source code and there is a bit of randomization going on for factorize128() and factors() (the same goes for factorize64(), but the docs denotes the primality check for u64s to be faster and deterministic). This might be the cause for why benchmarking for factors is a bit unstable, especially with the small range of numbers I'm using for u128/>u128 integers.

In fact, now that I look at it, factors() calls on factorize128() if the number is within the u128 range, which that function then calls on factorize64() if it sees that the number is within u64 range. Previous to the change in #9171, I think the overhead of calling factorize128() and then factorize64() from factors() made a small difference for the u64 case (which adds up as you increase the range of values piped to factors command via seq).

Do you want me to comment out the u128 and BigUint benchmark cases? I'm not sure if I can show a stable result with this small range of values (and possibly not within the range of millisecond benchmarking).

sylvestre · 2025-11-09T07:49:58Z

Do you want me to comment out the u128 and BigUint benchmark cases? I'm not sure if I can show a stable result with this small range of values (and possibly not within the range of millisecond benchmarking).

if possible, yes, we really need something stable here
do you know it is random ?

asder8215 · 2025-11-09T16:57:40Z

The piece of randomization I see is over here:

let divisor = loop {
            // try various factorization method iteratively, sort by time per iteration
            const NMETHODS: usize = 3;
            match i % NMETHODS {
                0 => {
                    // Pollard's rho
                    let start = MontgomeryInt::new(random::<u128>(), &target);
                    let offset = start.convert(random::<u128>());
                    let max_iter = max_iter_ratio << (target.bits() / 6); // unoptimized heuristic
                    if let (Some(p), _) = pollard_rho(
                        &SmallMint::from(target),
                        start.into(),
                        offset.into(),
                        max_iter,
                    ) {
                        break p.value();
                    }
                }
                1 => {
                    // Hart's one-line
                    let mul_target = target.checked_mul(480).unwrap_or(target);
                    let max_iter = max_iter_ratio << (mul_target.bits() / 6); // unoptimized heuristic
                    if let (Some(p), _) = one_line(&target, mul_target, max_iter) {
                        break p;
                    }
                }
                2 => {
                    // Shanks's squfof, try all mutipliers
                    let mut d = None;
                    for &k in SQUFOF_MULTIPLIERS.iter() {
                        if let Some(mul_target) = target.checked_mul(k as u128) {
                            let max_iter = max_iter_ratio * 2 * mul_target.sqrt().sqrt() as usize;
                            if let (Some(p), _) = squfof(&target, mul_target, max_iter) {
                                d = Some(p);
                                break;
                            }
                        }
                    }
                    if let Some(p) = d {
                        break p;
                    }
                }
                _ => unreachable!(),
            }
            i += 1;

            // increase max iterations after trying all methods
            if i % NMETHODS == 0 {
                max_iter_ratio *= 2;
            }
        };

Whenever factorize128() and factors() aren't able to find prime factors of the given number using small primes table it has in: https://docs.rs/num-prime/latest/src/num_prime/tables.rs.html, it iterates through various factorization methods to compute the prime factor for these numbers. Pollard's rho has a bit of randomization with the start and offset, so my assumption is that sometimes this iterative loop could end early or later than normal if Pollard's rho finds a good start/offset value.

This isn't an issue for u64 integers because the small primes table fits for the numbers within the 64 bit range (especially with the sequence of 2-2502 I'm using for u64), but when you have u128 or >u128 integers, it seems like it will often drop to this loop of various factorization methods num_prime crate uses.

I'm not certain on how to reason about this in a stable manner for multiple u128/>u128 integers.

Factor: base benchmarking for single/multiple u64, u128, and >u128

asder8215 added 4 commits November 7, 2025 02:50

factor: use num_prime's u64 and u128 factorization methods to speed u…

365ba78

…p the performance of calculating prime numbers for u64 and u128

fix cspell issue (renamed function) and changed parsing of u128 digit…

d10f756

… to add u64 digits from big_uint

rollback converting from big_uint to u128

28fafb9

factor: base benchmarking for single/multiple u64, u128, and >u128

d7f5b12

asder8215 mentioned this pull request Nov 7, 2025

factor: use num_prime crate's u64 and u128 factorization methods to speed up the performance of calculating prime numbers for u64 and u128 #9171

Merged

sylvestre reviewed Nov 7, 2025

View reviewed changes

src/uu/factor/src/factor.rs Outdated Show resolved Hide resolved

reset factor.rs to original branch code and added uu_factor to benchm…

5ada98b

…arks.yml

changed benchmarking to use a smaller range of u64/u128/biguint integers

e408039

reduced range for big_uint to 25 numbers only

51553a4

reduced factor multiple_big_uint to a range of 3

5fb1a8f

lowered divan bench samples and refactored benchmark code to use args…

b492abd

… from divan macro

asder8215 force-pushed the factor_benchmarking branch from 1f651e5 to b492abd Compare November 8, 2025 07:11

factor/bench: big_uint only benchmarks numbers that num_prime can fac…

9e788c7

…torize properly (factorize error take a while to propagate)

sylvestre reviewed Nov 8, 2025

View reviewed changes

src/uu/factor/benches/factor_bench.rs Outdated Show resolved Hide resolved

factor/bench: reduced range on u64 and u128 benchmarking and remove s…

d977e29

…ingle u64/u128/big_uint benchmark tests

sylvestre merged commit c615a0f into uutils:main Nov 8, 2025
122 checks passed

asder8215 added a commit to asder8215/coreutils that referenced this pull request Nov 8, 2025

Merge pull request uutils#9182 from asder8215/factor_benchmarking

092d410

Factor: base benchmarking for single/multiple u64, u128, and >u128

naoNao89 pushed a commit to naoNao89/coreutils that referenced this pull request Nov 8, 2025

Merge pull request uutils#9182 from asder8215/factor_benchmarking

7207dfa

Factor: base benchmarking for single/multiple u64, u128, and >u128

naoNao89 pushed a commit to naoNao89/coreutils that referenced this pull request Nov 9, 2025

Merge pull request uutils#9182 from asder8215/factor_benchmarking

3bf0e2b

Factor: base benchmarking for single/multiple u64, u128, and >u128

BrewTestBot mentioned this pull request Nov 10, 2025

uutils-coreutils 0.4.0 Homebrew/homebrew-core#253755

Merged

moonfruit mentioned this pull request Nov 10, 2025

uutils-selected 0.4.0 moonfruit/homebrew-tap#359

Closed

Uh oh!

Factor: base benchmarking for single/multiple u64, u128, and >u128 #9182

Factor: base benchmarking for single/multiple u64, u128, and >u128 #9182

Uh oh!

Conversation

asder8215 commented Nov 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

sylvestre commented Nov 7, 2025

Uh oh!

github-actions bot commented Nov 7, 2025

Uh oh!

asder8215 commented Nov 8, 2025

Uh oh!

github-actions bot commented Nov 8, 2025

Uh oh!

codspeed-hq bot commented Nov 8, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

CodSpeed Performance Report

Merging #9182 will not alter performance

Summary

Benchmarks breakdown

Footnotes

Uh oh!

github-actions bot commented Nov 8, 2025

Uh oh!

asder8215 commented Nov 8, 2025

Uh oh!

github-actions bot commented Nov 8, 2025

Uh oh!

github-actions bot commented Nov 8, 2025

Uh oh!

github-actions bot commented Nov 8, 2025

Uh oh!

sylvestre commented Nov 8, 2025

Uh oh!

Uh oh!

sylvestre commented Nov 8, 2025

Uh oh!

asder8215 commented Nov 8, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions bot commented Nov 8, 2025

Uh oh!

Uh oh!

sylvestre commented Nov 8, 2025

Uh oh!

sylvestre commented Nov 8, 2025

Uh oh!

asder8215 commented Nov 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

sylvestre commented Nov 9, 2025

Uh oh!

asder8215 commented Nov 9, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

asder8215 commented Nov 7, 2025 •

edited

Loading

codspeed-hq bot commented Nov 8, 2025 •

edited

Loading

asder8215 commented Nov 8, 2025 •

edited

Loading

asder8215 commented Nov 9, 2025 •

edited

Loading