Fixup no_trainer examples scripts and add more tests #16765

muellerzr · 2022-04-13T18:31:09Z

Fixup `no_trainer` Examples and Bolster their tests

What does this add?

This changes the logging behavior inside the no_trainer scripts, slightly changes how the initial configuration is stored, and adds tests for the tracking API.

Who is it for?

Users of transformers who want to try out Accelerate quickly

Why is this needed?

I was made aware that the scripts were laggy when it came to how logs were sent to weights and biases when using the no_trainer scripts, and this was due to the step being passed in as a parameter, causing a lag in when it gets uploaded.

To follow akin to the original Accelerate scripts, these are now passed in as a "step" parameter to the overall dictionary logged via accelerate.log()

TensorBoard also does not like when Enum's are logged, so there is a manual adjustment rightr before saving the hyperparemeters to get the enum value from the LR Scheduler type.

Finally, as TensorBoard is a test requirement, I added in tests for tracking inside the no_trainer tests, as TensorBoard is also how we test that behavior in the CI in Accelerate proper.

sgugger

Looks all good to me, thanks a lot for fixing!

HuggingFaceDocBuilderDev · 2022-04-13T18:45:17Z

The documentation is not available anymore as the PR was closed or merged.

* Change tracking to store_true * Remove step param and use it in the log dictionary directly * use vars(args) when passing args to init_trackers * Include tracking tests since tensorboard is already a dep

muellerzr added 5 commits April 13, 2022 13:33

Change tracking to store_true

a006a33

Remove step param and use it in the log dictionary directly

f2eeff5

use vars(args) when passing args to init_trackers

a4d6782

Fix clm_no_trainer tracking init

9210ace

Include tracking tests since tensorboard is already a dep

d585d91

muellerzr added Examples Which is related to examples in general PyTorch Anything PyTorch External Using the library with external tools (onnx, tflite, ...) labels Apr 13, 2022

muellerzr requested a review from sgugger April 13, 2022 18:31

muellerzr changed the title ~~Fixup examples scripts and add more tests~~ Fixup no_trainer examples scripts and add more tests Apr 13, 2022

sgugger approved these changes Apr 13, 2022

View reviewed changes

muellerzr merged commit be752d1 into main Apr 13, 2022

muellerzr deleted the muellerzr-tracking-bugfix branch April 13, 2022 18:40

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fixup no_trainer examples scripts and add more tests #16765

Fixup no_trainer examples scripts and add more tests #16765

muellerzr commented Apr 13, 2022

sgugger left a comment

HuggingFaceDocBuilderDev commented Apr 13, 2022 •

edited

Loading

Fixup no_trainer examples scripts and add more tests #16765

Fixup no_trainer examples scripts and add more tests #16765

Conversation

muellerzr commented Apr 13, 2022

Fixup no_trainer Examples and Bolster their tests

What does this add?

Who is it for?

Why is this needed?

sgugger left a comment

Choose a reason for hiding this comment

HuggingFaceDocBuilderDev commented Apr 13, 2022 • edited Loading

Fixup `no_trainer` Examples and Bolster their tests

HuggingFaceDocBuilderDev commented Apr 13, 2022 •

edited

Loading