Codestin Search App

cocolato · 2026-02-08T08:16:33Z

Optimize BINARY_SLICE for list, tuple, and unicode object with int/None indices.

This is the first step for BINARY_SLICE optimize. We will implement more jit optimizations after this PR.

Issue: Avoid creating temporary objects in BINARY_SLICE #144569

Fidget-Spinner

Pretty close, thanks!

Lib/test/test_capi/test_opt.py

Python/bytecodes.c

Fidget-Spinner · 2026-02-08T11:21:22Z

You forgot to add the new recorder to BINARY_SLICE macro op

Python/bytecodes.c

Fidget-Spinner

lgtm just one comment and add news please

Python/bytecodes.c

Fidget-Spinner · 2026-02-08T12:27:05Z

Actually I've changed my mind. Can you please open a PR to convert BINARY_SLICE to the POP_TOP form please? Then after we merge that PR we can update this one

Fidget-Spinner

Let's leave the DECREF_INPUTS alone for now and try to remove it in a future PR.

The code has repeated instances that should be factored out to a micro-op

Python/bytecodes.c

Python/optimizer_bytecodes.c

Misc/NEWS.d/next/Core_and_Builtins/2026-02-08-13-14-00.gh-issue-144569.pjlJVe.rst

…e-144569.pjlJVe.rst

Python/bytecodes.c

Fidget-Spinner · 2026-02-08T16:44:48Z

This is pretty close, I'm going to push some changes in, then wait a day to see if Mark has any objections tomorrow.

cocolato · 2026-02-08T16:48:45Z

Thanks for your guidance!

1. We cannot exit after unpacking indices, as the stack contains tagged ints now which may lead to a crash. We must insert a guard for the type before unpacking indices. 2. It is possible for an indice to not fit in a tagged int, in which case we must deopt. 3. Recorded values do not always mean that that's the actual type we will see at runtime. So we must guard on that as well.

Fidget-Spinner · 2026-02-08T17:15:09Z

@cocolato please check out the latest commit and the extended commit message for the bugs fixed. Thanks!

Fidget-Spinner · 2026-02-08T17:25:15Z

add_int.py

def testfunc(n):
    data = [1, 2, 3, 4, 5]
    a, b = 1, 3
    for _ in range(n):
        x = data[a:b]
    return x

testfunc(50000000)

Results using hyperfine:

Summary
  PYTHON_JIT=1 ./python ../add_int.py ran
    1.12 ± 0.02 times faster than PYTHON_JIT=0 ./python ../add_int.py

So we made it >10% faster. Nice!

cocolato · 2026-02-09T03:10:04Z

@cocolato please check out the latest commit and the extended commit message for the bugs fixed. Thanks!

LGTM! Thanks again for the fix, I learned a lot from it.

Results using hyperfine:

This is my first time learning about this tool, and it looks great. In the past, when I needed to run benchmarks locally, timeit was somewhat inaccurate, and pyperformance was a bit too heavy.

markshannon

Thanks for doing this.
This looks sound, but there are some inefficiencies.

Beware the size of uops. They will be converted to machine code, so keeping code size down is important for performance.

Lib/test/test_capi/test_opt.py

Python/optimizer_bytecodes.c

Python/bytecodes.c

Python/optimizer_bytecodes.c

bedevere-app · 2026-02-10T16:03:58Z

Thanks for making the requested changes!

@markshannon: please review the changes made to this pull request.

markshannon · 2026-02-18T10:35:38Z

Sorry for the delay in getting back to this.

I don't think we want _UNPACK_INDICES. it seems to make the code both slower and more complex.

I think we should spilt this into two PRs: the first with just the interpreter changes (bytecodes.c), leaving the JIT optimizations for another PR.

To make it worth splitting the operation into uops we need to be able to optimize the uops effectively. That would mean handling the index adjustments in the optimizer as much as possible and leaving only minimal checking to the runtime.
To do that we will have to add a few new uops for loading tagged ints and bounds checking.

For example, take x[:-1], where x is known to be a list. We want it to be converted to something like:

// `x` is already on the stack and known to be a list
_LOAD_TAGGED_INT 0 // No bounds checking needed for 0.
_LOAD_TAGGED_INT -1
_LIST_LEN 3 // Operand is the stack depth of the list
_INDEX_ADJUST  // Convert the -1 to an in-bounds value.
_LIST_SLICE // No need to check indices here.

Which looks bulky, but each of the uops is only a few machine instructions (_LIST_SLICE will be a function call).

(In the above _LIST_LEN pushes the length of the list as a tagged int, and _INDEX_ADJUST replaces index len with adjusted_index following the slice index semantics.

So, let's keep it simple for the first PR, and just add special cases for lists, tuples and strings in BINARY_SLICE.

Python/bytecodes.c

cocolato · 2026-02-18T16:46:38Z

@markshannon Thanks for review! I have reimplemented the fast path for BINARY_SLICE in bytecodes.c. After this PR is merged, I will attempt other JIT optimizations.

cocolato · 2026-02-19T06:14:34Z

Windows tests failed, I'm trying to find the cause.

markshannon · 2026-02-20T11:09:37Z

Windows tests failed, I'm trying to find the cause.

It might we worth merging in latest main, to see if that fixes it.

Python/bytecodes.c

Objects/listobject.c

Python/ceval.c

cocolato · 2026-02-20T16:57:14Z

On my test machine, this optimization has about 16-17% speed up:

test.py:

data = [1, 2, 3, 4, 5]
a, b = 1, 3
def main():
    for _ in range(100000):
         x = data[a:b]
    return x
main()

bench result:

Summary
  ./python_opt ./test.py ran
    1.17 ± 0.08 times faster than ./python ./test.py

markshannon

Thanks
I've a couple of suggestions on how to structure the code, but other than that LGTM.
The performance numbers are really good.

Python/ceval.c

Co-authored-by: Mark Shannon <[email protected]>

cocolato · 2026-03-02T14:22:41Z

Thanks for review again! Updated.

markshannon

Looks good now.

Scalar replacement of BINARY_SLICE for list, tuple, and unicode

6ada5ed

cocolato requested review from Fidget-Spinner, markshannon, savannahostrowski and tomasr8 as code owners February 8, 2026 08:16

bedevere-app bot added the awaiting review label Feb 8, 2026

bedevere-app bot mentioned this pull request Feb 8, 2026

Avoid creating temporary objects in BINARY_SLICE #144569

Open

This comment was marked as outdated.

Sign in to view

Fidget-Spinner reviewed Feb 8, 2026

View reviewed changes

Lib/test/test_capi/test_opt.py Outdated Show resolved Hide resolved

Python/bytecodes.c Outdated Show resolved Hide resolved

add _RECORD_3OS_TYPE

2b5e88c

cocolato commented Feb 8, 2026

View reviewed changes

Python/bytecodes.c Outdated Show resolved Hide resolved

Fidget-Spinner reviewed Feb 8, 2026

View reviewed changes

Python/bytecodes.c Show resolved Hide resolved

Fidget-Spinner reviewed Feb 8, 2026

View reviewed changes

Python/bytecodes.c Outdated Show resolved Hide resolved

Python/optimizer_bytecodes.c Outdated Show resolved Hide resolved

📜🤖 Added by blurb_it.

3faaac9

Fidget-Spinner reviewed Feb 8, 2026

View reviewed changes

Misc/NEWS.d/next/Core_and_Builtins/2026-02-08-13-14-00.gh-issue-144569.pjlJVe.rst Outdated Show resolved Hide resolved

Fidget-Spinner and others added 2 commits February 8, 2026 13:23

Update Misc/NEWS.d/next/Core_and_Builtins/2026-02-08-13-14-00.gh-issu…

6290916

…e-144569.pjlJVe.rst

add _UNPACK_INDICES

3dd3f3a

cocolato commented Feb 8, 2026

View reviewed changes

Python/bytecodes.c Outdated Show resolved Hide resolved

move AdjustIndices to _UNPACK_INDICES

b47e3fc

This comment was marked as resolved.

Sign in to view

markshannon requested changes Feb 9, 2026

View reviewed changes

bedevere-app bot removed the awaiting review label Feb 9, 2026

bedevere-app bot added the awaiting change review label Feb 10, 2026

bedevere-app bot requested a review from markshannon February 10, 2026 16:04

Merge branch 'main' into optimize/BINARY_SLICE

581fd9e

move optimizations to bytecodes.c

6cfc40a

cocolato commented Feb 18, 2026

View reviewed changes

Python/bytecodes.c Show resolved Hide resolved

cocolato added 3 commits February 18, 2026 22:47

add check for __index__ object

a541f52

format code

af01d32

add _PyEval_UnpackIndices

c18eb5a

markshannon reviewed Feb 20, 2026

View reviewed changes

Python/bytecodes.c Outdated Show resolved Hide resolved

cocolato and others added 2 commits February 20, 2026 19:47

Merge branch 'main' into optimize/BINARY_SLICE

b1fa60f

reduce function call

d68c600

cocolato commented Feb 20, 2026

View reviewed changes

Objects/listobject.c Show resolved Hide resolved

Python/ceval.c Outdated Show resolved Hide resolved

cocolato commented Feb 20, 2026

View reviewed changes

Python/ceval.c Show resolved Hide resolved

cocolato requested a review from markshannon February 20, 2026 16:29

markshannon reviewed Mar 2, 2026

View reviewed changes

Python/ceval.c Outdated Show resolved Hide resolved

Python/ceval.c Outdated Show resolved Hide resolved

Python/ceval.c Show resolved Hide resolved

cocolato and others added 2 commits March 2, 2026 21:33

Update Python/ceval.c

eaf7662

Co-authored-by: Mark Shannon <[email protected]>

address code review

2706a66

cocolato requested a review from markshannon March 2, 2026 14:23

markshannon approved these changes Mar 2, 2026

View reviewed changes

bedevere-app bot added awaiting merge and removed awaiting change review labels Mar 2, 2026

markshannon merged commit 107863e into python:main Mar 2, 2026
75 of 76 checks passed

bedevere-app bot removed the awaiting merge label Mar 2, 2026

Uh oh!

Conversation

cocolato commented Feb 8, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

This comment was marked as outdated.

Fidget-Spinner left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Fidget-Spinner commented Feb 8, 2026

Uh oh!

Uh oh!

Fidget-Spinner left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Fidget-Spinner commented Feb 8, 2026

Uh oh!

Fidget-Spinner left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Fidget-Spinner commented Feb 8, 2026

Uh oh!

cocolato commented Feb 8, 2026

Uh oh!

Fidget-Spinner commented Feb 8, 2026

Uh oh!

Fidget-Spinner commented Feb 8, 2026

Uh oh!

cocolato commented Feb 9, 2026

Uh oh!

This comment was marked as resolved.

markshannon left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

bedevere-app bot commented Feb 10, 2026

Uh oh!

markshannon commented Feb 18, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

cocolato commented Feb 18, 2026

Uh oh!

cocolato commented Feb 19, 2026

Uh oh!

markshannon commented Feb 20, 2026

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

cocolato commented Feb 20, 2026

Uh oh!

markshannon left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

cocolato commented Mar 2, 2026

Uh oh!

markshannon left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

cocolato commented Feb 8, 2026 •

edited

Loading

markshannon commented Feb 18, 2026 •

edited

Loading