implement adaptive-p sampler #17927

ddh0 · 2025-12-11T04:23:13Z

This PR implements a new sampler called adaptive-p that selects tokens near a configurable target probability over time.

How it works

The adaptive-p sampler transforms the token probability distribution to favor tokens that fall near a user-configurable probability target. Internally, the sampler maintains an exponential moving average of the original probabilities of selected tokens. It uses this, along with the user's set target, to compute an adapted target at each sampling step, steering the running average toward the configured target over time. If recent selections have been higher-probability than target, the sampler compensates by temporarily favoring lower-probability tokens, and vice versa.

Parameters

This sampler exposes two parameters:

Parameter name	Description	CLI argument	Valid range	Default value	Notes
`target`	Select tokens near this probability	`--adaptive-target N`	0.0 - 1.0	-1.0	When set to -1.0, the adaptive probability transform is disabled, and instead it just samples normally. Note that since the default value is -1.0, the sampler is disabled by default. This is intentional.
`decay`	Decay value for exponential moving average - lower values are more reactive, higher values are more stable	`--adaptive-decay N`	0.0 - 0.99	0.90	Clamped to <=0.99 at init to avoid unbounded accumulation

In most cases, you can just play with --adaptive-target. The default decay of 0.9 (for a ~10 token history) works well.

Usage notes

adaptive-p selects a token ID rather than just mutating candidates, so it must be last in the sampler chain. It shares this behaviour with some existing samplers like mirostat, dist, and greedy (mirostat being the closest relative).

Only mild truncation before this sampler is recommended. We suggest applying min-p before adaptive-p as the only other active sampler in the chain (optionally with top-k as well).

Example usage:

./build/bin/llama-server -m ~/gguf/my-model.gguf --samplers "top-k;min-p;adaptive-p" --top-k 128 --min-p 0.05 --adaptive-target 0.6

Other notes

This sampler was previously called "power law" in earlier versions of this PR, named for the power law transform we were applying to logits. We are no longer applying the power law transform. We also experimented with gaussian, but ultimately settled on the current formula.

Acknowledgements

@MrJackSpade - original "power law" sampler idea and implementation as can be seen here - they continued to help guide me as I worked through the implementation of this sampler. They deserve an enormous! amount of credit for the work they put in on this.
@Geechan for advice on sampler parameters and collaboration during development
@AesSedai for testing, collaboration, and hosting, as well as coming up with the final name "adaptive-p"

ddh0 · 2025-12-12T02:55:30Z

Nevermind, sorry, I think we want to do a little more testing. I'm going to mark this as draft again temporarily.

pnb

This looks very interesting! I wish the original compared to XTC, since the goals seem highly similar.

As an aside, I am curious if there is some way to make it work without selecting a token (i.e., only steps 1-3). I see why token selection is necessary, given the need to save the original probability to the history for the adaptive adjustment part. But, for example, maybe it would suffice instead to save the original probability of the highest-probability token after transforming, regardless of which one is eventually selected by a downstream sampler.

pnb · 2025-12-12T18:40:14Z

src/llama-sampling.cpp

+
+    // fixed power law transform parameters (from original implementation)
+    const float distribution_width = 0.2f;
+    const float peak_logit_value   = 3.0f;


Should these parameters be configurable like in the original implementation? There is probably a tradeoff with feature creep, having too many options for users to control, but some of these seem potentially important (especially distribution_width). Also, I noticed peak_logit_value is outside the range suggested in the original implementation; is that intentional?

Myself and the original author are discussing the parameters over the next few days, I agree that the current implementation is probably not ideal, which is why I marked it back as draft.

I will post a comment in the main thread with an update once we've got it more figured out. Thank you!

ref: https://gist.github.com/MrJackSpade/9be99c7efbba7b95a41377e123b7b069

MaggotHATE · 2025-12-17T13:25:12Z

Very interesting sampler, thank you for the implementation! I like the effect so far, it stays on topic even on long results.

One question: if this sample must be the last in the chain, why include it alongside other samplers? For now it looks like a user can make a mistake by putting it elsewhere, which is probably not what we want. Maybe it's worth adding it into the chain at the end, where the dist is, and notify that it will always be the last one if included.

ddh0 · 2025-12-17T14:54:53Z

How does this sampler handle the cases where high probability is justified? [...] This is all theoretical and maybe it doesn't matter in practice, but I'm just interested if the above cases are somehow accounted for.

The idea is that you're supposed to configure your truncation samplers (like min-p) in such a way that removes garbage tokens from the candidates pool, before it even hits Power Law. It's the same for temperature - if you're using a high temperature you should cut out the nonsense before you apply it. (@z80maniac)

if this sample must be the last in the chain, why include it alongside other samplers? For now it looks like a user can make a mistake by putting it elsewhere, which is probably not what we want. Maybe it's worth adding it into the chain at the end, where the dist is, and notify that it will always be the last one if included.

This is good feedback, thank you. I will consider how to change it so that the power law sampler is guaranteed to always be at the end of the chain, if it's active. (@MaggotHATE)

pnb · 2025-12-17T16:01:54Z

I took another look through the code and I think the choice of what is a tunable parameter vs. what is a fixed default is great. The knobs to tune make sense, and I tried playing with the other parameters (that are now constants) without seeing much obvious effect in the text. Overall I would say the effect of this sampler is a little subtle compared to XTC, but it is noticeable with a low target like .05, where lots of excessively popular adverbs disappear from the results.

ddh0 · 2025-12-17T21:10:31Z

Maybe it's worth adding it into the chain at the end, where the dist is, and notify that it will always be the last one if included.

This is addressed now in 7752998. Though I did not add any sort of notification because I think that's not necessary - the default behavior is unchanged, and power law will be used instead of dist only if it's added to the sampler chain.

Gentle poke to @ggerganov - are there any more changes needed here? What are your thoughts?

ggerganov · 2025-12-19T08:19:25Z

Let's come back to this after we merge #17004 (ETA: end of the year) as it will reduce the amount of work for me on this part of the code.

Spike645 · 2025-12-21T12:32:30Z

Really excited about this sampler getting merged. Looks very promising for creative writing.

- `ctx->weighted_sum` is now initialized and reset to `target / (1.0f - clamped_decay)` - `ctx->total_weight` is now initialized and reset to `1.0f / (1.0f - clamped_decay)` this fixes a "cold start" problem with the moving average

ddh0 · 2025-12-29T02:53:28Z

Barring any changes requested by reviewers/maintainers, I believe this implementation to be correct and finalized at this point. Just waiting on #17004.

no functional changes

ddh0 added 2 commits December 10, 2025 22:13

initial commit for branch

774cf23

simplify constants

5ab4ff7

loci-dev mentioned this pull request Dec 11, 2025

UPSTREAM PR #17927: implement Power Law sampling auroralabs-loci/llama.cpp#522

Open

ddh0 and others added 11 commits December 11, 2025 12:52

Merge branch 'ggml-org:master' into power-law-sampler

66e2d17

add params to struct common_params_sampling, add reference to PR

88fb0f3

explicitly clamp min_target and max_target to [0.0, 1.0]

374bfd4

add args, rename queue_size -> window_size

ffe1639

improved comments

4959878

minor

f3457a8

remove old unused code from algorithm

9316959

minor

b3aea57

add power law case to common_sampler_init, add sampler name mappings

cd7de7c

clarify behaviour when window_size = 0

534cb4f

add missing enums

dcada03

This comment was marked as outdated.

Sign in to view

ddh0 marked this pull request as ready for review December 11, 2025 23:59

ddh0 requested a review from ggerganov as a code owner December 11, 2025 23:59

ddh0 marked this pull request as draft December 12, 2025 02:55

ddh0 added 2 commits December 11, 2025 22:43

remove target_range param, make target == 1 no-op, cleanup code

2d62bbe

oops, straggler

5c78b79

pnb reviewed Dec 12, 2025

View reviewed changes

add missing parameters in server-task.cpp

53380c1

github-actions bot added examples server labels Dec 13, 2025

ddh0 and others added 5 commits December 12, 2025 23:19

copy from author

94cb883

ref: https://gist.github.com/MrJackSpade/9be99c7efbba7b95a41377e123b7b069

remove old debug log, style nit

0a19a3f

fix compiler warning, add commented-out logging per token

824bb3a

Merge branch 'ggml-org:master' into power-law-sampler

1879fc6

Merge branch 'ggml-org:master' into power-law-sampler

67a7336

add use_power_law flag + logic, minor cleanup

7752998

ddh0 added 2 commits December 17, 2025 22:07

Merge branch 'ggml-org:master' into power-law-sampler

6023572

Merge branch 'ggml-org:master' into power-law-sampler

dedbe36

Geechan mentioned this pull request Dec 19, 2025

Feature Request: Implement Adaptive P sampling ikawrakow/ik_llama.cpp#1074

Open

4 tasks

ddh0 added 2 commits December 19, 2025 17:53

Merge branch 'ggml-org:master' into power-law-sampler

f4703d4

Merge branch 'ggml-org:master' into power-law-sampler

89ebdf0

ddh0 and others added 7 commits December 21, 2025 14:41

Merge branch 'ggml-org:master' into power-law-sampler

55ad4a8

Merge branch 'ggml-org:master' into power-law-sampler

6bad4ae

Merge branch 'ggml-org:master' into power-law-sampler

295d1d8

Merge branch 'ggml-org:master' into power-law-sampler

ed2890e

Merge branch 'ggml-org:master' into power-law-sampler

51070e0

Merge branch 'ggml-org:master' into power-law-sampler

90f3bfb

update power-law -> adaptive-p

b95b088

ddh0 changed the title ~~implement Power Law sampling~~ implement adaptive-p sampler Dec 27, 2025

ddh0 and others added 2 commits December 28, 2025 20:12

Merge branch 'ggml-org:master' into power-law-sampler

f0d3f13

fix cold start EMA

e7a8920

- `ctx->weighted_sum` is now initialized and reset to `target / (1.0f - clamped_decay)` - `ctx->total_weight` is now initialized and reset to `1.0f / (1.0f - clamped_decay)` this fixes a "cold start" problem with the moving average

dungquixote42 mentioned this pull request Dec 29, 2025

Implement Adaptive-P Sampler ikawrakow/ik_llama.cpp#1100

Open

4 tasks

ddh0 added 6 commits December 29, 2025 20:14

Merge branch 'ggml-org:master' into power-law-sampler

05d7dc9

Merge branch 'ggml-org:master' into power-law-sampler

2d67b1c

update SHARPNESS constant to 10.0f

c6a6f63

minor style fixes

0807499

no functional changes

minor style fixes cont.

eb854e7

Merge branch 'ggml-org:master' into power-law-sampler

55757dc

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

implement adaptive-p sampler #17927

implement adaptive-p sampler #17927

ddh0 commented Dec 11, 2025 •

edited

Loading

Uh oh!

This comment was marked as outdated.

ddh0 commented Dec 12, 2025

Uh oh!

pnb left a comment

Uh oh!

pnb Dec 12, 2025

Uh oh!

ddh0 Dec 12, 2025

Uh oh!

MaggotHATE commented Dec 17, 2025

Uh oh!

ddh0 commented Dec 17, 2025 •

edited

Loading

Uh oh!

pnb commented Dec 17, 2025

Uh oh!

ddh0 commented Dec 17, 2025 •

edited

Loading

Uh oh!

ggerganov commented Dec 19, 2025

Uh oh!

Spike645 commented Dec 21, 2025

Uh oh!

ddh0 commented Dec 29, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

8 participants

implement adaptive-p sampler #17927

Are you sure you want to change the base?

implement adaptive-p sampler #17927

Conversation

ddh0 commented Dec 11, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

How it works

Parameters

Usage notes

Example usage:

Other notes

Acknowledgements

Uh oh!

This comment was marked as outdated.

ddh0 commented Dec 12, 2025

Uh oh!

pnb left a comment

Choose a reason for hiding this comment

Uh oh!

pnb Dec 12, 2025

Choose a reason for hiding this comment

Uh oh!

ddh0 Dec 12, 2025

Choose a reason for hiding this comment

Uh oh!

MaggotHATE commented Dec 17, 2025

Uh oh!

ddh0 commented Dec 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pnb commented Dec 17, 2025

Uh oh!

ddh0 commented Dec 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ggerganov commented Dec 19, 2025

Uh oh!

Spike645 commented Dec 21, 2025

Uh oh!

ddh0 commented Dec 29, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

8 participants

ddh0 commented Dec 11, 2025 •

edited

Loading

ddh0 commented Dec 17, 2025 •

edited

Loading

ddh0 commented Dec 17, 2025 •

edited

Loading