[SYCL] Disable vectorization and loop transformation passes #2458

bader · 2020-09-10T14:42:59Z

No description provided.

Loop unrolling in "SYCL optimization mode" uses default heuristic, which is tuned for CPU and might not be profitable for other devices.

This change seems to hide issues with broadcast tests on CPU.

bader · 2020-09-10T14:43:20Z

/summary:run

bader · 2020-09-25T14:54:25Z

/summary:run

llvm/lib/Transforms/IPO/PassManagerBuilder.cpp

bader · 2020-09-26T05:56:39Z

/summary:run

bader · 2020-09-28T15:52:17Z

/summary:run

This reverts commit 700bac4.

bader · 2020-09-29T08:57:48Z

/summary:run

This reverts commit f4dbb09.

bader · 2020-09-29T18:03:32Z

/summary:run

intel#2504)" This reverts commit 0c8d46e. Just to check if new regressions are caused by the driver update.

….37.17906 (intel#2504)"" This reverts commit 46b41d9.

bader · 2020-10-05T17:06:05Z

This PR exposes a regression in test_stream from Khronos SYCL-CTS on GPU.
The issue is addressed in https://github.com/intel/compute-runtime/releases/tag/20.39.17972, so we need to update the driver first.

This reverts commit 8a931aa.

bader · 2021-04-26T14:10:52Z

@DenisBakhvalov, once we removed special flag for ESIMD mode, this patch disables loop optimization transformations for ESIMD mode as well and there are a few failures in ESIMD specific tests. Could suggest the way to fix them?

bader · 2021-04-26T14:11:02Z

/summary:run

bader · 2021-04-28T18:19:34Z

/summary:run

intel/llvm#2458 fixes issue 2264.

bader · 2021-04-29T10:02:23Z

@DenisBakhvalov, once we removed special flag for ESIMD mode, this patch disables loop optimization transformations for ESIMD mode as well and there are a few failures in ESIMD specific tests. Could suggest the way to fix them?

Recent fixes + new GPU driver for Windows resolved all ESIMD specific issues except 3 failing llvm-test-suite tests on Windows (no failures on Linux)

Failed Tests (3):
SYCL :: ESIMD/private_memory/pm_access_1.cpp
SYCL :: ESIMD/private_memory/pm_access_2.cpp
SYCL :: ESIMD/private_memory/pm_access_3.cpp

[2021-04-28T20:30:26.851Z] warning: GenXPromoteArray: _ZTSN2cl4sycl6detail19__pf_kernel_wrapperI8KernelIDILi1EEEE allocation size is too big: using TPM
[2021-04-28T20:30:26.851Z] warning: GenXPromoteArray: _ZTSN2cl4sycl6detail19__pf_kernel_wrapperI8KernelIDILi1EEEE allocation size is too big: using TPM
[2021-04-28T20:30:26.851Z] warning: GenXPromoteArray: _ZTS8KernelIDILi1EE allocation size is too big: using TPM
[2021-04-28T20:30:26.851Z] warning: GenXPromoteArray: _ZTS8KernelIDILi1EE allocation size is too big: using TPM
[2021-04-28T20:30:26.851Z] warning: GenXPromoteArray: _ZTSN2cl4sycl6detail19__pf_kernel_wrapperI8KernelIDILi2EEEE allocation size is too big: using TPM
[2021-04-28T20:30:26.851Z] warning: GenXPromoteArray: _ZTSN2cl4sycl6detail19__pf_kernel_wrapperI8KernelIDILi2EEEE allocation size is too big: using TPM
[2021-04-28T20:30:26.851Z] warning: GenXPromoteArray: _ZTSN2cl4sycl6detail19__pf_kernel_wrapperI8KernelIDILi2EEEE allocation size is too big: using TPM
[2021-04-28T20:30:26.851Z] warning: GenXPromoteArray: _ZTSN2cl4sycl6detail19__pf_kernel_wrapperI8KernelIDILi2EEEE allocation size is too big: using TPM
[2021-04-28T20:30:26.851Z] warning: GenXPromoteArray: _ZTS8KernelIDILi2EE allocation size is too big: using TPM
[2021-04-28T20:30:26.851Z] warning: GenXPromoteArray: _ZTS8KernelIDILi2EE allocation size is too big: using TPM
[2021-04-28T20:30:26.851Z] warning: GenXPromoteArray: _ZTS8KernelIDILi2EE allocation size is too big: using TPM
[2021-04-28T20:30:26.851Z] warning: GenXPromoteArray: _ZTS8KernelIDILi2EE allocation size is too big: using TPM
[2021-04-28T20:30:26.851Z] warning: GenXPromoteArray: _ZTSN2cl4sycl6detail19__pf_kernel_wrapperI8KernelIDILi3EEEE allocation size is too big: using TPM
[2021-04-28T20:30:26.851Z] warning: GenXPromoteArray: _ZTSN2cl4sycl6detail19__pf_kernel_wrapperI8KernelIDILi3EEEE allocation size is too big: using TPM
[2021-04-28T20:30:26.851Z] warning: GenXPromoteArray: _ZTSN2cl4sycl6detail19__pf_kernel_wrapperI8KernelIDILi3EEEE allocation size is too big: using TPM
[2021-04-28T20:30:26.851Z] warning: GenXPromoteArray: _ZTSN2cl4sycl6detail19__pf_kernel_wrapperI8KernelIDILi3EEEE allocation size is too big: using TPM
[2021-04-28T20:30:26.851Z] warning: GenXPromoteArray: _ZTSN2cl4sycl6detail19__pf_kernel_wrapperI8KernelIDILi3EEEE allocation size is too big: using TPM
[2021-04-28T20:30:26.851Z] warning: GenXPromoteArray: _ZTS8KernelIDILi3EE allocation size is too big: using TPM
[2021-04-28T20:30:26.851Z] warning: GenXPromoteArray: _ZTS8KernelIDILi3EE allocation size is too big: using TPM
[2021-04-28T20:30:26.851Z] warning: GenXPromoteArray: _ZTS8KernelIDILi3EE allocation size is too big: using TPM
[2021-04-28T20:30:26.851Z] warning: GenXPromoteArray: _ZTS8KernelIDILi3EE allocation size is too big: using TPM
[2021-04-28T20:30:26.851Z] warning: GenXPromoteArray: _ZTS8KernelIDILi3EE allocation size is too big: using TPM
[2021-04-28T20:30:26.851Z] LLVM ERROR: Cannot find pointer replacement

@NikitaRudenkoIntel, do know if this a known issue of GPU compiler? I can't find where this diagnostics can be reported in DPC++.

NikitaRudenkoIntel · 2021-04-29T10:23:23Z

Hi, yes there are some issues in TPM. Actually, there is even a ticket with these exact tests and this exact failure. It is fixed. Can you check if your driver is up to date?

bader · 2021-04-29T10:33:26Z

Can you check if your driver is up to date?

We are using 27.20.100.9466 from https://downloadmirror.intel.com/30381/a08/igfx_win10_100.9466.zip.
Is there more recent publicly available version?

bader · 2021-04-29T12:01:50Z

/summary:run

bader · 2021-04-29T18:41:11Z

Hi, yes there are some issues in TPM. Actually, there is even a ticket with these exact tests and this exact failure. It is fixed. Can you check if your driver is up to date?

I managed to get passed this issue with recent optimization pipeline adjustments (probably).
We can get back to the driver version question if we encounter it again.

This reverts commit 4c37555.

bader · 2021-05-12T12:03:47Z

I looked at regressions and all of them are issues of external dependencies: OpenCL CPU, Level Zero GPU and llvm-test-suite tests.
I'll address llvm-test-suite test issue as soon as this patch is merged. Low-level runtime issues should be addressed by updating corresponding runtimes.

@mdtoguchi, @AGindinson, @intel/llvm-reviewers-runtime, please, review this change.

bader · 2021-05-12T12:06:28Z

Sorry, didn't notice that there no runtime changes anymore, so I need review only from Mike or Artem.

mdtoguchi

Driver OK

intel/llvm#2458 fixes issue 2264.

intel#2458 fixes issue 2264.

[benchmarks] add ability to filter benchmarks by suite

bader added 3 commits September 10, 2020 12:49

[SYCL] Disable loop unrolling and vectorization

587514e

Loop unrolling in "SYCL optimization mode" uses default heuristic, which is tuned for CPU and might not be profitable for other devices.

Disable loop pass pipeline in SYCL optimization mode.

ab30c86

This change seems to hide issues with broadcast tests on CPU.

Disable more vectorization passes

dec884d

Update LIT tests status.

41f88ab

bader mentioned this pull request Sep 10, 2020

SYCL device compiler optimizations impact #2264

Closed

bader added the performance Performance related issues label Sep 25, 2020

bader marked this pull request as ready for review September 25, 2020 14:53

bader requested a review from a team as a code owner September 25, 2020 14:53

bader requested a review from sergey-semenov September 25, 2020 14:53

bader commented Sep 25, 2020

View reviewed changes

llvm/lib/Transforms/IPO/PassManagerBuilder.cpp Outdated Show resolved Hide resolved

Revert LTO pipeline changes.

700bac4

bader mentioned this pull request Sep 25, 2020

[SYCL] Disable loop passes in SYCL optimization mode #2414

Closed

Merge remote-tracking branch 'intel/sycl' into loop-opts

8d74e41

bader added 3 commits September 29, 2020 11:28

Revert "Revert LTO pipeline changes."

f4dbb09

This reverts commit 700bac4.

Merge remote-tracking branch 'intel/sycl' into loop-opts

bf95c36

Merge remote-tracking branch 'intel/sycl' into loop-opts

0f2dac6

Revert "Revert "Revert LTO pipeline changes.""

5f54a0a

This reverts commit f4dbb09.

bader added 3 commits September 30, 2020 13:33

Disable broken tests

8a931aa

[NOT-FOR-MERGE]Revert "[BuildBot] Uplift GPU RT version to 20.37.17906 (

46b41d9

intel#2504)" This reverts commit 0c8d46e. Just to check if new regressions are caused by the driver update.

Revert "[NOT-FOR-MERGE]Revert "[BuildBot] Uplift GPU RT version to 20…

2179514

….37.17906 (intel#2504)"" This reverts commit 46b41d9.

MrSidims previously approved these changes Oct 5, 2020

View reviewed changes

bader added 2 commits October 6, 2020 19:18

Revert "Disable broken tests"

79662d3

This reverts commit 8a931aa.

Merge remote-tracking branch 'intel/sycl' into loop-opts

62fa77f

bader added 4 commits April 20, 2021 16:16

Merge remote-tracking branch 'intel/sycl' into loop-opts

1b11cd3

Remove fsycl-esimd option usage.

98b9cba

Recovered removed passes due to bad merge conflict resolution.

6b57983

Apply clang-format to the previous patch.

ec18cc2

Merge remote-tracking branch 'intel/sycl' into loop-opts

7c8fe4f

bader added 2 commits April 29, 2021 09:22

Disable diagnostics for disabled loop optimizations.

968271a

Disable reassociation pass to check performance impact.

4c37555

bader added a commit to bader/llvm-test-suite that referenced this pull request Apr 29, 2021

Enable boolean type test on GPU.

1388d46

intel/llvm#2458 fixes issue 2264.

bader mentioned this pull request Apr 29, 2021

Enable boolean type test on GPU. intel/llvm-test-suite#257

Merged

bader added 2 commits May 12, 2021 10:12

Merge remote-tracking branch 'intel/sycl' into loop-opts

8fdc04b

Revert "Disable reassociation pass to check performance impact."

b9a2c3d

This reverts commit 4c37555.

bader requested a review from MrSidims May 12, 2021 12:03

mdtoguchi approved these changes May 12, 2021

View reviewed changes

AGindinson approved these changes May 12, 2021

View reviewed changes

bader merged commit ff6929e into intel:sycl May 12, 2021

bader deleted the loop-opts branch May 12, 2021 18:40

againull pushed a commit to intel/llvm-test-suite that referenced this pull request May 14, 2021

Enable boolean type test on GPU. (#257)

4c8a8d4

intel/llvm#2458 fixes issue 2264.

aelovikov-intel pushed a commit to aelovikov-intel/llvm that referenced this pull request Mar 27, 2023

Enable boolean type test on GPU. (intel/llvm-test-suite#257)

8195e95

intel#2458 fixes issue 2264.

Chenyang-L pushed a commit that referenced this pull request Feb 18, 2025

Merge pull request #2458 from pbalcer/filter-suites

a2f7ed8

[benchmarks] add ability to filter benchmarks by suite

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[SYCL] Disable vectorization and loop transformation passes #2458

[SYCL] Disable vectorization and loop transformation passes #2458

bader commented Sep 10, 2020

bader commented Sep 10, 2020

bader commented Sep 25, 2020

bader commented Sep 26, 2020

bader commented Sep 28, 2020

bader commented Sep 29, 2020

bader commented Sep 29, 2020

bader commented Oct 5, 2020

bader commented Apr 26, 2021

bader commented Apr 26, 2021

bader commented Apr 28, 2021

bader commented Apr 29, 2021

NikitaRudenkoIntel commented Apr 29, 2021 •

edited by bader

Loading

bader commented Apr 29, 2021

bader commented Apr 29, 2021

bader commented Apr 29, 2021

bader commented May 12, 2021

bader commented May 12, 2021

mdtoguchi left a comment

[SYCL] Disable vectorization and loop transformation passes #2458

[SYCL] Disable vectorization and loop transformation passes #2458

Conversation

bader commented Sep 10, 2020

bader commented Sep 10, 2020

bader commented Sep 25, 2020

bader commented Sep 26, 2020

bader commented Sep 28, 2020

bader commented Sep 29, 2020

bader commented Sep 29, 2020

bader commented Oct 5, 2020

bader commented Apr 26, 2021

bader commented Apr 26, 2021

bader commented Apr 28, 2021

bader commented Apr 29, 2021

NikitaRudenkoIntel commented Apr 29, 2021 • edited by bader Loading

bader commented Apr 29, 2021

bader commented Apr 29, 2021

bader commented Apr 29, 2021

bader commented May 12, 2021

bader commented May 12, 2021

mdtoguchi left a comment

Choose a reason for hiding this comment

NikitaRudenkoIntel commented Apr 29, 2021 •

edited by bader

Loading