Efficient decoder text generation wrapper #273

jimypbr · 2023-02-27T23:58:30Z

What does this PR do?

Current text generation returns all the logits back from IPU to host. However we only need the logits from the token to be generated. This wraps the model to return only what is necessary to improve the IO performance of the text generation.

Fixes # (issue)

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you make sure to update the documentation with your changes?
Did you write any new necessary tests?

HuggingFaceDocBuilderDev · 2023-02-28T00:02:10Z

The documentation is not available anymore as the PR was closed or merged.

…ent dynamic slice implementation

…able in pipelines

…ause of other complications caused by this.

…he default ipu_config for text-gen pipelines.

optimum/graphcore/pipelines/__init__.py

katalinic-gc

lgtm

* Set the default matmul_proportion in IPUConfig to 0.2 so default config will work with decoder wrapper

Efficient encoder-decoder text generation wrapper

7b9fb5d

jimypbr added 7 commits March 6, 2023 17:10

Fixed enabled wrapper for all generation methods

ebe7018

Changed Seq2SeqWrapper to just DecoderWrapper

a168588

Manually set the available memory proportion to use the memory effici…

e83791a

…ent dynamic slice implementation

Put encoder on IPU

dafb6fe

Add optional execution encoder on cpu option. Clean up encoder execut…

5d6119a

…able in pipelines

Revert the encoder on IPU. I will deal with this in a separate PR bec…

d290059

…ause of other complications caused by this.

Add IPUConfig as input type to pipeline ipu_config argument. Update t…

e19ef88

…he default ipu_config for text-gen pipelines.

jimypbr marked this pull request as ready for review March 10, 2023 12:44

style fix

50dffca

jimypbr changed the title ~~Efficient encoder-decoder text generation wrapper~~ Efficient decoder text generation wrapper Mar 10, 2023

katalinic-gc reviewed Mar 10, 2023

View reviewed changes

optimum/graphcore/pipelines/__init__.py Outdated Show resolved Hide resolved

katalinic-gc approved these changes Mar 10, 2023

View reviewed changes

Remove reference to poptorch_encoder

a903b21

jimypbr merged commit 66929fd into main Mar 10, 2023

jimypbr deleted the faster-textgen branch March 10, 2023 15:25

ncouro-gc pushed a commit to graphcore/optimum-graphcore-fork that referenced this pull request Mar 17, 2023

Efficient decoder text generation wrapper (huggingface#273)

ac9d86e

* Set the default matmul_proportion in IPUConfig to 0.2 so default config will work with decoder wrapper

ncouro-gc pushed a commit to graphcore/optimum-graphcore-fork that referenced this pull request Mar 17, 2023

Efficient decoder text generation wrapper (huggingface#273)

609c870

* Set the default matmul_proportion in IPUConfig to 0.2 so default config will work with decoder wrapper

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Efficient decoder text generation wrapper #273

Efficient decoder text generation wrapper #273

jimypbr commented Feb 27, 2023

HuggingFaceDocBuilderDev commented Feb 28, 2023 •

edited

Loading

katalinic-gc left a comment

Efficient decoder text generation wrapper #273

Efficient decoder text generation wrapper #273

Conversation

jimypbr commented Feb 27, 2023

What does this PR do?

Before submitting

HuggingFaceDocBuilderDev commented Feb 28, 2023 • edited Loading

katalinic-gc left a comment

Choose a reason for hiding this comment

HuggingFaceDocBuilderDev commented Feb 28, 2023 •

edited

Loading