TF: XLA bad words logits processor and list of processors #16974

gante · 2022-04-27T20:18:48Z

What does this PR do?

This PR converts to XLA-compatible the bad_words logits processor. As per the discussion below, I was unable to convert the ngrams one -- added an exception and a TODO.

Also makes a change to the list of processors -- XLA raised issues when the processors had different arguments, so I had to add cur_len to all processors. After the change, the list wrapper is also compatible with XLA.

…ams are returned

HuggingFaceDocBuilderDev · 2022-04-27T20:34:12Z

The documentation is not available anymore as the PR was closed or merged.

gante · 2022-04-27T21:03:09Z

@Rocketknight1 @patrickvonplaten I'm stuck on the ngram logits processor, so I'd like to request your suggestions regarding what to try out next :D The bad words logits processor is ready and XLA-compatible.

Context:

Without XLA, it works well;
With XLA, yields incorrect outputs (it masks the wrong tokens in some cases). It is not a CPU/GPU thing -- it has the same output regardless of the hardware;
The XLA/non-XLA mismatch is at the output of _calc_row_banned_ngram_tokens, which gets the tokens that should be banned for each row;
All intermediary variables I was able to pull out had the same contents. However, if I try to pull out all ngrams, I get a core dumped on XLA 🤔

Things I've tried (without any symptom change):

The current implementation is a tf.while_loop with tf.TensorArray. On ddc8911, we can see my original implementation with a tf.map_fn (which is closer to the original code). Both versions have the exact same symptoms described above, and return the same errors for the same inputs when XLA is on (!);
Pulling the initialization of the tf.TensorArray to the start of __call__, pass ngram_size as an argument, and use tf.function as a decorator to __call__. The two first changes are to attempt a retrace trigger, the last one to rule out problems associated with attempting to compile a class instance (as opposed to a function);
Using tf.shape instead of tensor.shape, as the former is more suited for symbolic tensors;
Using batches with a single row as input;
Looking for other ways to implement the sliding window on the inputs (i.e. getting the ngrams), with no success.

patrickvonplaten · 2022-04-28T12:52:08Z

I'd be very much in favor of just not converting the ngram Processor. I don't think it's a necessary requirement to publish the new TF generate method. Let's maybe leave this as a hard second issue in case the community is very interested in this feature.

I think it's now more important to think about how to advertise, document XLA TF generate well and not loose too much time on this.

patrickvonplaten · 2022-04-28T12:52:40Z

Also not that many models use this processor (only know of BART and T5 for some summarization tasks)

Rocketknight1 · 2022-04-28T14:29:17Z

Agree that it's not necessary to convert this one, but examining it, I suspect that there are some sneaky changes in output size depending on inputs, and XLA is struggling to deal with it. It seems very tough to convert to XLA, but if we decide we need it later let me know and I'll do my best to dig into it.

gante · 2022-04-28T17:48:15Z

Great 👍 I'm going to revert that one, add a TODO pointing at this PR, add a few final tests for the list of logits processors with XLA, and will ping you back.

…sponding changes)

gante · 2022-04-28T19:03:30Z

@Rocketknight1 @patrickvonplaten ready for review

patrickvonplaten · 2022-04-29T11:24:14Z

src/transformers/generation_tf_logits_process.py

@@ -401,6 +421,11 @@ def _get_generated_ngrams(hypo_idx):

    def __call__(self, input_ids: tf.Tensor, scores: tf.Tensor, cur_len: int) -> tf.Tensor:

+        # TODO (joao): enable XLA on this logits processor. See discussion and attempts in
+        # https://github.com/huggingface/transformers/pull/16974
+        if not tf.executing_eagerly():


100% with leaving as is for now!

patrickvonplaten

Nice! Great job on getting no_bad_word_tokens to work

src/transformers/generation_tf_logits_process.py

Rocketknight1 · 2022-04-29T14:49:59Z

src/transformers/generation_tf_logits_process.py


-        return banned_tokens
+        # Compares the current row against all bad word sequences, obtaining a mask with the matches.
+        match_mask = tf.map_fn(_tokens_match, tf.range(self.bad_word_seqs_ids.shape[0]), fn_output_signature=tf.bool)


If performance is slow, allowing more parallel_iterations here might improve things, since this is a lightweight comparison run over a potentially large number of bad_words.

Rocketknight1

Overall this looks good, especially since the tests are present and passing! The need to filter an arbitrary number of words, each of which can span multiple tokens is very challenging to implement in XLA, so the fact that it's working at all seems almost miraculous, lol.

…e#16974)

gante added 4 commits April 26, 2022 22:46

tmp commit

cfd8ccd

for future reference: ngram+xla like this gets a core dump if all_ngr…

ddc8911

…ams are returned

while_loop doesn't seem to work either

74530f8

og test

40e3a41

gante changed the title ~~TF: XLA bad words and ngram logits processors~~ TF: XLA bad words logits processor and list of processors Apr 28, 2022

revert ngram; add XLA test for the list of processors (and make corre…

accad57

…sponding changes)

gante marked this pull request as ready for review April 28, 2022 19:01

gante requested review from patrickvonplaten and Rocketknight1 April 28, 2022 19:01

typos

c14985a

gante added 2 commits April 29, 2022 10:16

cur_len is argument everywhere

fbeae60

typo

fc9972c

patrickvonplaten reviewed Apr 29, 2022

View reviewed changes

patrickvonplaten approved these changes Apr 29, 2022

View reviewed changes

gante commented Apr 29, 2022

View reviewed changes

src/transformers/generation_tf_logits_process.py Outdated Show resolved Hide resolved

Update src/transformers/generation_tf_logits_process.py

d4dcb3e

gante commented Apr 29, 2022

View reviewed changes

src/transformers/generation_tf_logits_process.py Outdated Show resolved Hide resolved

Update src/transformers/generation_tf_logits_process.py

4a69fba

Rocketknight1 reviewed Apr 29, 2022

View reviewed changes

Rocketknight1 approved these changes Apr 29, 2022

View reviewed changes

gante merged commit fb0ae12 into huggingface:main Apr 29, 2022

gante deleted the xla_bad_words_ngrams branch April 29, 2022 14:55

stevhliu pushed a commit to stevhliu/transformers that referenced this pull request May 3, 2022

TF: XLA bad words logits processor and list of processors (huggingfac…

310ba18

…e#16974)

elusenji pushed a commit to elusenji/transformers that referenced this pull request Jun 12, 2022

TF: XLA bad words logits processor and list of processors (huggingfac…

b3f5c78

…e#16974)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

TF: XLA bad words logits processor and list of processors #16974

TF: XLA bad words logits processor and list of processors #16974

gante commented Apr 27, 2022 •

edited

Loading

HuggingFaceDocBuilderDev commented Apr 27, 2022 •

edited

Loading

gante commented Apr 27, 2022

patrickvonplaten commented Apr 28, 2022 •

edited

Loading

patrickvonplaten commented Apr 28, 2022

Rocketknight1 commented Apr 28, 2022

gante commented Apr 28, 2022

gante commented Apr 28, 2022

patrickvonplaten Apr 29, 2022

patrickvonplaten left a comment

Rocketknight1 Apr 29, 2022

Rocketknight1 left a comment

TF: XLA bad words logits processor and list of processors #16974

TF: XLA bad words logits processor and list of processors #16974

Conversation

gante commented Apr 27, 2022 • edited Loading

What does this PR do?

HuggingFaceDocBuilderDev commented Apr 27, 2022 • edited Loading

gante commented Apr 27, 2022

patrickvonplaten commented Apr 28, 2022 • edited Loading

patrickvonplaten commented Apr 28, 2022

Rocketknight1 commented Apr 28, 2022

gante commented Apr 28, 2022

gante commented Apr 28, 2022

patrickvonplaten Apr 29, 2022

Choose a reason for hiding this comment

patrickvonplaten left a comment

Choose a reason for hiding this comment

Rocketknight1 Apr 29, 2022

Choose a reason for hiding this comment

Rocketknight1 left a comment

Choose a reason for hiding this comment

gante commented Apr 27, 2022 •

edited

Loading

HuggingFaceDocBuilderDev commented Apr 27, 2022 •

edited

Loading

patrickvonplaten commented Apr 28, 2022 •

edited

Loading