[SOT][Faster Guard] Implement more`guard_tree_expr_node` and `TensorDtypeVariable.make_faster_guard` #72463

zrr1999 · 2025-04-24T13:46:11Z

PR Category

Execute Infrastructure

PR Types

Performance

Description

Implement more guard_tree_expr_node
Implement TensorDtypeVariable.make_faster_guard
开启 SOT_ENABLE_STRICT_GUARD_CHECK 下的 Guard Tree 验证

TODO

去掉GuardBase的概念，所有的Guard都改成基于 GuardNodeBase 的类，这样就不需要考虑check输入几个参数的问题了，每个类有自己的 lookup。GuardNodeBase子类分成 UnaryGuardNodeBase（功能类似目前的GuardNode），BinaryGuardNodeBase（可以基于 #72327 实现），原本的 ExprGuardNode 可基于 UnaryGuardNodeBase 实现实现复用。

实现 DummyGuardNode

FunctionGlobal 和 FunctionClosure 放在 tracker.py 统一管理吧

paddle-bot · 2025-04-24T13:46:17Z

你的PR提交成功，感谢你对开源项目的贡献!
请关注后续CI自动化测试结果，详情请参考Paddle-CI手册。
Your PR has been submitted. Thanks for your contribution!
Please wait for the result of CI firstly. See Paddle CI Manual for details.

Copilot

Pull Request Overview

This PR implements additional functionality for guard expression nodes and integrates a faster, specialized guard branch for tensor dtypes. Key changes include:

Adding implementations of guard_tree_expr_node in virtual frame, function closure tracker, and inline executor.
Implementing TensorDtypeVariable.make_faster_guard with a new branch for GetAttrTracker-based tensor variables.
Updating guard_tree_expr_node return types across various tracker classes and modifying the executor cache and C++ guard handling (now throwing on an empty guard chain).

Reviewed Changes

Copilot reviewed 7 out of 7 changed files in this pull request and generated 1 comment.

Show a summary per file

File	Description
python/paddle/jit/sot/opcode_translator/executor/virtual_frame.py	Added guard_tree_expr_node method for closure cells using attribute and item expression nodes.
python/paddle/jit/sot/opcode_translator/executor/variables/basic.py	Implemented TensorDtypeVariable.make_faster_guard with a new branch for GetAttrTracker-based cases.
python/paddle/jit/sot/opcode_translator/executor/tracker.py & opcode_inline_executor.py	Updated return type of guard_tree_expr_node to ExprNodeBase and implemented similar changes in inline executor.
python/paddle/jit/sot/opcode_translator/executor/function_graph.py	Modified the strategy for handling empty guard chains via a dummy guard chain.
paddle/fluid/pybind/sot/guards.cc	Changed empty guard chain handling to throw a runtime error.
python/paddle/jit/sot/opcode_translator/executor/executor_cache.py	Adjusted internal caching tuple structure and aligned guard index assertions.

Comments suppressed due to low confidence (3)

python/paddle/jit/sot/opcode_translator/executor/function_graph.py:331

Since the C++ side now throws a runtime error when the guard chain is empty, please verify that generating a dummy guard chain here consistently prevents empty guard chains in all execution scenarios.

guard_chain: GuardChain = [
                paddle.framework.core.GuardNode(
                    paddle.framework.core.DummyGuard(),
                    [paddle.framework.core.ConstantExprNode(True)],
                )
            ]

paddle/fluid/pybind/sot/guards.cc:405

The updated behavior now throws an exception when the guard chain is empty. Please verify that all cases leading to an empty guard chain are either updated to supply a valid dummy guard or handled appropriately to prevent unexpected runtime failures.

throw std::runtime_error("Empty guard chain");

python/paddle/jit/sot/opcode_translator/executor/variables/basic.py:342

Confirm that the PIR API check correctly reflects the runtime environment. If PIR is not enabled, ensure that falling back to object_equal_faster_guard still provides adequate guard coverage.

assert paddle.framework.use_pir_api(), "Only support PIR"

python/paddle/jit/sot/opcode_translator/executor/opcode_inline_executor.py

SigureMo · 2025-04-25T10:16:30Z

paddle/fluid/pybind/sot/guards.cc

-    // TODO(zrr1999): empty guard nodes means that some
-    // tracker.make_faster_guard is not implemented.
-    return;
+    throw std::runtime_error("Empty guard chain");


这里能直接 PADDLE_THROW 么？

SigureMo · 2025-04-25T10:17:22Z

python/paddle/jit/sot/opcode_translator/executor/executor_cache.py

-            self.cache[code] = [
-                (new_custom_code, guard_fn)
-            ], paddle.framework.core.GuardTree([guard_chain])
+            self.cache[code] = (


ruff 干的？

SigureMo · 2025-04-25T10:24:47Z

python/paddle/jit/sot/opcode_translator/executor/executor_cache.py

-                    # ), f"cache_index({cache_index}) is not equal to index({index})"
+                    assert (
+                        cache_index is None or index == cache_index
+                    ), f"cache_index({cache_index}) is not equal to index({index})"


现在 CI 上都能测到了么？没有一点问题么？Tensor 那边还没完善呢

可以确认一下，如果测试体系有问题的话，后续推进是有风险的

…typeVariable.make_faster_guard` (PaddlePaddle#72463)

zrr1999 added 3 commits April 24, 2025 13:31

impl FunctionClosureTracker.guard_tree_expr_node

139d00e

fix type hint

7f6563a

add check

58e1910

paddle-bot bot added the contributor External developers label Apr 24, 2025

zrr1999 marked this pull request as draft April 24, 2025 13:46

zrr1999 marked this pull request as ready for review April 24, 2025 13:52

zrr1999 added 3 commits April 25, 2025 04:35

throw error when guard_chain is empty

f04d50a

fix len(guard_chain) == 0

7c9ba5f

impl TensorDtypeVariable.make_faster_guard

fc2efd4

zrr1999 changed the title ~~[SOT][Faster Guard] Implement more guard_tree_expr_node~~ [SOT][Faster Guard] Implement moreguard_tree_expr_node and TensorDtypeVariable.make_faster_guard Apr 25, 2025

zrr1999 requested review from SigureMo and Copilot and removed request for SigureMo April 25, 2025 05:27

Copilot AI reviewed Apr 25, 2025

View reviewed changes

python/paddle/jit/sot/opcode_translator/executor/opcode_inline_executor.py Outdated Show resolved Hide resolved

fix

e973681

zrr1999 requested a review from SigureMo April 25, 2025 09:04

SigureMo reviewed Apr 25, 2025

View reviewed changes

SigureMo previously approved these changes Apr 25, 2025

View reviewed changes

zrr1999 changed the title ~~[SOT][Faster Guard] Implement moreguard_tree_expr_node and TensorDtypeVariable.make_faster_guard~~ [SOT][Faster Guard][3.13] Implement moreguard_tree_expr_node and TensorDtypeVariable.make_faster_guard Apr 25, 2025

zrr1999 changed the title ~~[SOT][Faster Guard][3.13] Implement moreguard_tree_expr_node and TensorDtypeVariable.make_faster_guard~~ [SOT][Faster Guard] Implement moreguard_tree_expr_node and TensorDtypeVariable.make_faster_guard Apr 25, 2025

zrr1999 changed the title ~~[SOT][Faster Guard] Implement moreguard_tree_expr_node and TensorDtypeVariable.make_faster_guard~~ [SOT][Faster Guard][3.13] Implement moreguard_tree_expr_node and TensorDtypeVariable.make_faster_guard Apr 25, 2025

rm unused code

a63a932

zrr1999 dismissed SigureMo’s stale review via a63a932 April 25, 2025 14:40

SigureMo approved these changes Apr 25, 2025

View reviewed changes

SigureMo changed the title ~~[SOT][Faster Guard][3.13] Implement moreguard_tree_expr_node and TensorDtypeVariable.make_faster_guard~~ [SOT][Faster Guard] Implement moreguard_tree_expr_node and TensorDtypeVariable.make_faster_guard Apr 25, 2025

SigureMo merged commit 99a2dc9 into PaddlePaddle:develop Apr 25, 2025
40 of 42 checks passed

zrr1999 deleted the guard_tree_expr_node branch April 26, 2025 02:08

YqGe585 pushed a commit to YqGe585/Paddle that referenced this pull request May 7, 2025

[SOT][Faster Guard] Implement moreguard_tree_expr_node and `TensorD…

b1c2e4b

…typeVariable.make_faster_guard` (PaddlePaddle#72463)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[SOT][Faster Guard] Implement more`guard_tree_expr_node` and `TensorDtypeVariable.make_faster_guard` #72463

[SOT][Faster Guard] Implement more`guard_tree_expr_node` and `TensorDtypeVariable.make_faster_guard` #72463

Uh oh!

zrr1999 commented Apr 24, 2025 •

edited

Loading

Uh oh!

paddle-bot bot commented Apr 24, 2025

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

SigureMo Apr 25, 2025

Uh oh!

SigureMo Apr 25, 2025

Uh oh!

SigureMo Apr 25, 2025

Uh oh!

Uh oh!

Uh oh!

[SOT][Faster Guard] Implement moreguard_tree_expr_node and TensorDtypeVariable.make_faster_guard #72463

[SOT][Faster Guard] Implement moreguard_tree_expr_node and TensorDtypeVariable.make_faster_guard #72463

Uh oh!

Conversation

zrr1999 commented Apr 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

PR Category

PR Types

Description

TODO

Uh oh!

paddle-bot bot commented Apr 24, 2025

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

Uh oh!

SigureMo Apr 25, 2025

Choose a reason for hiding this comment

Uh oh!

SigureMo Apr 25, 2025

Choose a reason for hiding this comment

Uh oh!

SigureMo Apr 25, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

[SOT][Faster Guard] Implement more`guard_tree_expr_node` and `TensorDtypeVariable.make_faster_guard` #72463

[SOT][Faster Guard] Implement more`guard_tree_expr_node` and `TensorDtypeVariable.make_faster_guard` #72463

zrr1999 commented Apr 24, 2025 •

edited

Loading