Upgrade applications/text_classifications/multi_class to use Trainer API #3679

sijunhe · 2022-11-04T10:37:26Z

PR types

Function optimization

PR changes

Others

Description

Upgrade applications/text_classifications/multi_class to use Trainer API

applications/text_classification/multi_class/README.md

ZeyuChen · 2022-11-05T08:38:15Z

applications/text_classification/multi_class/README.md

+- `device`: 使用的设备，默认为`gpu`。
+- `per_device_train_batch_size`: 每次训练每张卡上的样本数量。可根据实际GPU显存适当调小/调大此配置。
+- `per_device_eval_batch_size`: 每次评估每张卡上的样本数量。可根据实际GPU显存适当调小/调大此配置。
+


找不到save_strategy的参数说明，整体检查下使用的配置是否有欠缺

因为支持了Trainer的原因，增加了大量可使用的参数. 所以这里只是说明了一些用户可能调整的主要参数。剩下的参数在文档中链接了TrainingArguments的参数文档，供用户查阅

applications/text_classification/multi_class/train.py

lugimzzz · 2022-11-15T11:05:42Z

applications/text_classification/multi_class/README.md

+    --metric_for_best_model accuracy \
+    --load_best_model_at_end \
+    --evaluation_strategy epoch \
+    --save_strategy epoch


新增一个可配置参数--save_total_limit 1,限制保存的epoch数

applications/text_classification/multi_class/README.md

applications/text_classification/multi_class/train.py

…eNLP into text_class_trainer

sijunhe · 2022-11-17T04:14:57Z

Addressed comments from @lugimzzz

lugimzzz

LGTM

sijunhe added 4 commits November 4, 2022 18:12

add trainer to multi_class finetuning

766c901

styles

6b17278

fix styles

1216920

log eval metrics

116348f

ZeyuChen requested review from lugimzzz and ZHUI November 5, 2022 08:34

ZeyuChen reviewed Nov 5, 2022

View reviewed changes

sijunhe added 2 commits November 5, 2022 18:11

address comments

93eb4ea

Merge branch 'develop' into text_class_trainer

75fa295

sijunhe requested a review from ZeyuChen November 6, 2022 14:37

lugimzzz added the text classification label Nov 7, 2022

lugimzzz assigned sijunhe Nov 7, 2022

Merge branch 'develop' into text_class_trainer

c78f165

lugimzzz reviewed Nov 15, 2022

View reviewed changes

lugimzzz reviewed Nov 16, 2022

View reviewed changes

applications/text_classification/multi_class/train.py Outdated Show resolved Hide resolved

sijunhe added 4 commits November 17, 2022 12:08

address comments

c1537d5

precommit

cce0262

Merge remote-tracking branch 'origin/develop' into text_class_trainer

8393718

Merge branch 'text_class_trainer' of https://github.com/sijunhe/Paddl…

02e1128

…eNLP into text_class_trainer

fix README style

62e771f

lugimzzz approved these changes Nov 17, 2022

View reviewed changes

sijunhe merged commit 60a51f0 into PaddlePaddle:develop Nov 17, 2022

sijunhe deleted the text_class_trainer branch November 17, 2022 05:22

sijunhe mentioned this pull request Nov 17, 2022

PaddleNLP 2.4.3 Release Note Candidate #3774

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Upgrade applications/text_classifications/multi_class to use Trainer API #3679

Upgrade applications/text_classifications/multi_class to use Trainer API #3679

sijunhe commented Nov 4, 2022

ZeyuChen Nov 5, 2022

sijunhe Nov 5, 2022

lugimzzz Nov 15, 2022

sijunhe commented Nov 17, 2022

lugimzzz left a comment

Upgrade applications/text_classifications/multi_class to use Trainer API #3679

Upgrade applications/text_classifications/multi_class to use Trainer API #3679

Conversation

sijunhe commented Nov 4, 2022

PR types

PR changes

Description

ZeyuChen Nov 5, 2022

Choose a reason for hiding this comment

sijunhe Nov 5, 2022

Choose a reason for hiding this comment

lugimzzz Nov 15, 2022

Choose a reason for hiding this comment

sijunhe commented Nov 17, 2022

lugimzzz left a comment

Choose a reason for hiding this comment