[CINN]Fix compilation cache re-use same fn_ptr wrongly #64718

Aurelius84 · 2024-05-29T11:53:39Z

PR Category

CINN

PR Types

Bug fixes

Description

Pcard-67164

问题背景

在科学计算LDC模型上，关闭缓存，如下kernel有8个:fn_elementwise_mul_elementwise_mul_elementwise_mul_elementwise_mul_elementwise_mul_elementwise_mul_elementwise_mul_elementwise_mul_elementwise_add_elementwise_add_yield_store_elementwise_mul_elementwise_add_elementwise_add_elementwise_add_elementwise_add_elementwise_add_

开启缓存，如下kernel会复用成1个：fn_elementwise_mul_elementwise_mul_elementwise_mul_elementwise_mul_elementwise_mul_elementwise_mul_elementwise_mul_elementwise_mul_elementwise_add_elementwise_add_yield_store_elementwise_mul_elementwise_add_elementwise_add_elementwise_add_elementwise_add_elementwise_add__2

对应是如下两个子图Case，其实在计算%16时，依赖的上游算子顺序是有差别的，一个是L6+L7+L8，一个是L5+L7+L8.

这几个上游算子，其实从Op层面，他们的ValueInfo、OpInfo完全一样的（即Hash一样），但内在中间算子依赖的顺序不同，对于生成的函数，可能导致输入Argument Tensor解析&映射的顺序有差异，会导致下图中两个红框的CodeGen函数代码段有不同的var，若命中缓存，则引起计算错误。

解决方案

fusion_info.cc中的FusionOpInfo数据结构里inner_deps_除了要考虑依赖的上游算子hash，也要考虑依赖的上游算子index

paddle-bot · 2024-05-29T11:53:44Z

你的PR提交成功，感谢你对开源项目的贡献!
请关注后续CI自动化测试结果，详情请参考Paddle-CI手册。
Your PR has been submitted. Thanks for your contribution!
Please wait for the result of CI firstly. See Paddle CI Manual for details.

Aurelius84 · 2024-05-29T12:05:49Z

paddle/phi/kernels/cpu/accuracy_check_kernel.cc

@@ -20,7 +20,7 @@
 #include "paddle/phi/core/kernel_registry.h"
 #include "paddle/phi/core/tensor_utils.h"

-static constexpr float kAtolValue = 1e-5;
+static constexpr float kAtolValue = 1e-8;


此处 atol 和 rtol 的阈值与numpy.allclose的默认值对齐

cxxly

LGTM

[CINN]Fix compilation cache re-use same fn_ptr wrongly

2abb74e

fix typo

019eb53

Aurelius84 commented May 29, 2024

View reviewed changes

Aurelius84 requested review from phlrain and cxxly May 30, 2024 05:59

phlrain approved these changes May 30, 2024

View reviewed changes

cxxly approved these changes May 30, 2024

View reviewed changes

Aurelius84 merged commit a93f26e into PaddlePaddle:develop May 30, 2024
32 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[CINN]Fix compilation cache re-use same fn_ptr wrongly #64718

[CINN]Fix compilation cache re-use same fn_ptr wrongly #64718

Aurelius84 commented May 29, 2024 •

edited

Loading

paddle-bot bot commented May 29, 2024

Aurelius84 May 29, 2024

cxxly left a comment

[CINN]Fix compilation cache re-use same fn_ptr wrongly #64718

[CINN]Fix compilation cache re-use same fn_ptr wrongly #64718

Conversation

Aurelius84 commented May 29, 2024 • edited Loading

PR Category

PR Types

Description

问题背景

解决方案

paddle-bot bot commented May 29, 2024

Aurelius84 May 29, 2024

Choose a reason for hiding this comment

cxxly left a comment

Choose a reason for hiding this comment

Aurelius84 commented May 29, 2024 •

edited

Loading