gh-100239: Specialize binary operations using BINARY_OP_EXTEND by eendebakpt · Pull Request #128956 · python/cpython

eendebakpt · 2025-01-17T19:46:20Z

We add list and tuple concatenation to BINARY_OP_EXTEND
We pass type information in BINARY_OP_EXTEND in tier 2. This allows the jit to perform better optimizations.
In the jit we can now eliminate the _GUARD_BINARY_OP_EXTEND if type information is known.
The fraction of code specialized for BINARY_OP increases from 70% to 90%.

Benchmark are performance neutral (in the +- 1% range) is seems.

Benchmark script

"""Benchmark for BINARY_OP_EXTEND type propagation.

Tests whether the tier 2 optimizer can eliminate guards when types are
known from previous BINARY_OP_EXTEND results.

Usage:
    ./python bench_binary_op_extend.py
    ./python bench_binary_op_extend.py --save result.json
    ./python bench_binary_op_extend.py --compare a.json b.json
"""

import sys
import pyperf

INNER = 2000


def bench_list_concat_subscr(n):
    """list + list followed by subscript — tests list type propagation."""
    a = [1, 2, 3]
    b = [4, 5, 6]
    total = 0
    for _ in range(n):
        c = a + b
        total += c[0] + c[3]
    return total


def bench_tuple_concat_unpack(n):
    """tuple + tuple followed by unpack — tests tuple type propagation."""
    t1 = (1, 2)
    t2 = (3, 4)
    total = 0
    for _ in range(n):
        a, b, c, d = t1 + t2
        total += a + d
    return total


def bench_str_repeat(n):
    """str * int in a loop — tests str type propagation."""
    s = "ab"
    total = 0
    for i in range(n):
        r = s * (i % 5)
        total += len(r)
    return total


def bench_bytes_concat(n):
    """bytes + bytes in a loop — tests bytes type propagation."""
    a = b"hello"
    b_ = b" world"
    total = 0
    for _ in range(n):
        c = a + b_
        total += len(c)
    return total


def bench_bytes_repeat(n):
    """bytes * int in a loop — tests bytes type propagation."""
    b = b"ab"
    total = 0
    for i in range(n):
        r = b * (i % 3)
        total += len(r)
    return total


def bench_tuple_repeat(n):
    """tuple * int in a loop — tests tuple type propagation."""
    t = (1, 2, 3)
    total = 0
    for i in range(n):
        r = t * (i % 3)
        total += len(r)
    return total


def bench_dict_merge(n):
    """dict | dict in a loop — tests dict type propagation."""
    d1 = {"a": 1, "b": 2}
    d2 = {"c": 3, "d": 4}
    total = 0
    for _ in range(n):
        d = d1 | d2
        total += len(d)
    return total


def bench_chained_list_ops(n):
    """Multiple list ops chained — tests guard elimination across ops."""
    a = [1, 2]
    b = [3, 4]
    total = 0
    for _ in range(n):
        c = a + b
        d = c + a
        total += d[0] + d[4]
    return total


def bench_mixed_float_int(n):
    """float + int and int + float — existing EXTEND specializations."""
    x = 1.5
    total = 0.0
    for i in range(n):
        a = x + i
        total += a 
    return total

def float_mix_mul(n):
    """float + int then float * float — tests unique flag for inplace mul."""
    x = 1.5
    total = 0.0
    for i in range(n):
        a = (x + i) * 2.0  # result of x+i should be unique -> inplace multiply
        total += a
    return total

BENCHMARKS = [
    ("list_concat_subscr", bench_list_concat_subscr),
    ("tuple_concat_unpack", bench_tuple_concat_unpack),
    ("str_repeat", bench_str_repeat),
    ("bytes_concat", bench_bytes_concat),
    ("bytes_repeat", bench_bytes_repeat),
    ("tuple_repeat", bench_tuple_repeat),
    ("dict_merge", bench_dict_merge),
    ("chained_list_ops", bench_chained_list_ops),
    ("mixed_float_int", bench_mixed_float_int),
    ("float_mix_mul", float_mix_mul),
]
    

def main():
    args = sys.argv[1:]

    if "--compare" in args:
        idx = args.index("--compare")
        file_a = args[idx + 1]
        file_b = args[idx + 2]
        import subprocess
        subprocess.run([sys.executable, "-m", "pyperf", "compare_to",
                       file_a, file_b, "--table"])
        return

    save_file = None
    if "--save" in args:
        idx = args.index("--save")
        save_file = args[idx + 1]

    runner = pyperf.Runner()
    for name, func in BENCHMARKS:
        # Warm up
        func(INNER)
        runner.bench_func(name, func, INNER)

    if save_file and runner.args.output:
        import shutil
        shutil.copy(runner.args.output, save_file)


if __name__ == "__main__":
    main()

Issue: Specialize long tail of binary operations using a table. #100239

# Conflicts: # Lib/test/test_capi/test_opt.py # Python/specialize.c

markshannon · 2026-04-07T09:27:27Z

This looks good overall, but I've not done a detailed review.

I have a couple of general concerns about the BINARY_OP_EXTEND optimization in general, not in this PR, but something to keep in mind:

How do we ensure the robustness of the VM and optimizations when we expose this to 3rd party code as we intend to do at some point in the future?
As binaryop_extend_descrs gets larger, specialization of binary ops will get slower. Can we sort the array, or use a mapping to reduce the overhead? It shouldn't be a problem yet, but could be in the future if there were 100s of entries.

eendebakpt requested a review from ericsnowcurrently as a code owner January 17, 2025 19:46

eendebakpt marked this pull request as draft January 17, 2025 19:46

bedevere-app bot mentioned this pull request Jan 17, 2025

Specialize long tail of binary operations using a table. #100239

Open

bedevere-app bot added the awaiting review label Jan 17, 2025

eendebakpt and others added 3 commits April 5, 2026 21:47

specialize concatenation of lists and tuples

02ca84d

📜🤖 Added by blurb_it.

27f4c56

refactor for type information

51d1b11

eendebakpt force-pushed the binary_op_list_list branch from 03b3922 to 51d1b11 Compare April 5, 2026 20:00

add unique type propagation

e8263f9

eendebakpt marked this pull request as ready for review April 5, 2026 21:28

eendebakpt requested review from Fidget-Spinner, markshannon, savannahostrowski and tomasr8 as code owners April 5, 2026 21:28

eendebakpt mentioned this pull request Apr 5, 2026

gh-100239: Propagate type info through _BINARY_OP_EXTEND in tier 2 #148146

Merged

Merge branch 'main' into binary_op_list_list

77c1558

# Conflicts: # Lib/test/test_capi/test_opt.py # Python/specialize.c

eendebakpt marked this pull request as draft April 6, 2026 19:56

bedevere-app bot removed the awaiting review label Apr 6, 2026

eendebakpt added 2 commits April 6, 2026 23:13

special case for concatenation

fe63c59

fix

f099585

eendebakpt changed the title ~~gh-100239: Specialize concatenation of lists and tuples~~ gh-100239: Specialize binary operations using BINARY_OP_EXTEND Apr 6, 2026

Merge branch 'main' into binary_op_list_list

7f2c4a0

eendebakpt mentioned this pull request Apr 7, 2026

Broader specialization in the Specializing Adaptive Interpreter for better JIT performance #143732

Open

add asserts

d2dcb87

eendebakpt marked this pull request as ready for review April 7, 2026 21:26

bedevere-app bot added the awaiting review label Apr 7, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

gh-100239: Specialize binary operations using BINARY_OP_EXTEND#128956

gh-100239: Specialize binary operations using BINARY_OP_EXTEND#128956
eendebakpt wants to merge 9 commits intopython:mainfrom
eendebakpt:binary_op_list_list

eendebakpt commented Jan 17, 2025 •

edited

Loading

Uh oh!

markshannon commented Apr 7, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

eendebakpt commented Jan 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

markshannon commented Apr 7, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

eendebakpt commented Jan 17, 2025 •

edited

Loading