Skip to content

nan error in eval_T2M_HumamML3D #190

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Closed
CMY-CTO opened this issue Feb 2, 2024 · 2 comments
Closed

nan error in eval_T2M_HumamML3D #190

CMY-CTO opened this issue Feb 2, 2024 · 2 comments

Comments

@CMY-CTO
Copy link

CMY-CTO commented Feb 2, 2024

Hi!
When I run the evaluation part of HumanML3D with the prompt python -m eval.eval_humanml --model_path ./save/humanml_trans_enc_512/model000475000.pt --device 0, there's a NAN Error.
Whether it's Replication 0 or Replication 1, they all show the same problem.
Screen Shot 2024-02-02 at 19 44 02
Screen Shot 2024-02-02 at 19 44 44 1

And it should be a number, not NaN, as the screenshot attached shows, which is from the official evaluation file eval_humanml_trans_enc_512_000475000_gscale2.5_wo_mm.log in ./save/humanml_trans_enc_512
image

It's really puzzling.
I am looking forward to your help, and thank you in advance!

@CMY-CTO
Copy link
Author

CMY-CTO commented Feb 2, 2024

And with the screenshot of the detailed error message.
image
I really want to know what caused Nan error to occur, not only in the official checkpoint but also in the own ckpt after train_MDM
image

@CMY-CTO CMY-CTO changed the title nan error in eval_HumamML3D nan error in eval_T2M_HumamML3D Feb 2, 2024
@GuyTevet
Copy link
Owner

GuyTevet commented Feb 4, 2024

Can you check #110 and see if it helps?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants