TF: rework XLA generate tests #16866

gante · 2022-04-20T21:24:23Z

What does this PR do?

In the light of recent findings (#16838), this PR reworks existing XLA generate tests. The following key changes were made:

Added a @unittest.skipIf on XLA generate tests, to skip when no GPU is present;
Rework XLA sample tests -- due to the minor numerical differences that arise when we use XLA (and that we can't control), the sampling step will gather different samples even when we use the same seed. The only thing we can properly test is whether a) we can seed them and b) the results are sensible;
Adds at least one XLA test where the batch size is > 1 and the inputs have different lengths, so we can confirm that masking works (GPT-2 is not working 💔 , added a TODO);
Removes redundant tests (we had tests outside the integration tests that were testing the same thing).

HuggingFaceDocBuilderDev · 2022-04-20T21:43:32Z

The documentation is not available anymore as the PR was closed or merged.

Rocketknight1 · 2022-04-21T11:41:09Z

tests/t5/test_modeling_tf_t5.py

+        expected_output_string = [
+            "Heute ist ein schöner Tag.",
+            "Ich habe vier Katzen, drei Hunde, zwei Vögel und ein Pferd.",
+        ]


Are we guaranteed that the logits are stable enough that we'll always sample this exact output? A flaky test can be really annoying!

We haven't had problems with similar tests, so I'm assuming we won't have problems :D

...fair point!

Rocketknight1

Overall, this looks great! The fact that we can't really test sample outputs is annoying, but I see why we can't really get around that problem.

tests/gpt2/test_modeling_tf_gpt2.py

tests/t5/test_modeling_tf_t5.py

tests/gpt2/test_modeling_tf_gpt2.py

gante · 2022-04-22T11:04:39Z

@patrickvonplaten reintroduced the fast tests, will merge as soon as CI gets to green

rework XLA generate tests

0b765aa

gante requested review from Rocketknight1 and patrickvonplaten April 20, 2022 21:24

gante added 2 commits April 21, 2022 09:44

make fixup

87ec87b

appropriate gpu detection fn

72745bd

Rocketknight1 reviewed Apr 21, 2022

View reviewed changes

Rocketknight1 approved these changes Apr 21, 2022

View reviewed changes

add TODOs to remove the skips after the original problem gets sorted

2784f80

patrickvonplaten reviewed Apr 22, 2022

View reviewed changes

tests/gpt2/test_modeling_tf_gpt2.py Show resolved Hide resolved

patrickvonplaten reviewed Apr 22, 2022

View reviewed changes

tests/t5/test_modeling_tf_t5.py Show resolved Hide resolved

patrickvonplaten approved these changes Apr 22, 2022

View reviewed changes

patrickvonplaten reviewed Apr 22, 2022

View reviewed changes

tests/gpt2/test_modeling_tf_gpt2.py Show resolved Hide resolved

reintroduce fast tests

79c7132

gante merged commit 6d90d76 into huggingface:main Apr 22, 2022

gante deleted the gpu_xla_tests branch April 22, 2022 11:38

elusenji pushed a commit to elusenji/transformers that referenced this pull request Jun 12, 2022

TF: rework XLA generate tests (huggingface#16866)

ab68d0b

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

TF: rework XLA generate tests #16866

TF: rework XLA generate tests #16866

gante commented Apr 20, 2022 •

edited

Loading

HuggingFaceDocBuilderDev commented Apr 20, 2022 •

edited

Loading

Rocketknight1 Apr 21, 2022

gante Apr 21, 2022

Rocketknight1 Apr 21, 2022

Rocketknight1 left a comment

gante commented Apr 22, 2022

TF: rework XLA generate tests #16866

TF: rework XLA generate tests #16866

Conversation

gante commented Apr 20, 2022 • edited Loading

What does this PR do?

HuggingFaceDocBuilderDev commented Apr 20, 2022 • edited Loading

Rocketknight1 Apr 21, 2022

Choose a reason for hiding this comment

gante Apr 21, 2022

Choose a reason for hiding this comment

Rocketknight1 Apr 21, 2022

Choose a reason for hiding this comment

Rocketknight1 left a comment

Choose a reason for hiding this comment

gante commented Apr 22, 2022

gante commented Apr 20, 2022 •

edited

Loading

HuggingFaceDocBuilderDev commented Apr 20, 2022 •

edited

Loading