Fix PyTorch stateful RNN/LSTM gradient computation error resolves #20875 #20916

praveenhosdrug123 · 2025-02-17T14:40:22Z

The error occurred because the PyTorch autograd engine detected that tensors required for gradient computation were modified in-place, invalidating the computational graph.

Fix: Added explicit state cloning in RNN.step() for PyTorch backend when stateful=True to create new tensor objects with separate memory allocation.

google-cla · 2025-02-17T14:40:27Z

Thanks for your pull request! It looks like this may be your first contribution to a Google open source project. Before we can look at your pull request, you'll need to sign a Contributor License Agreement (CLA).

View this failed invocation of the CLA check for more information.

For the most up to date status, view the checks section at the bottom of the pull request.

codecov-commenter · 2025-02-17T14:45:40Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 82.44%. Comparing base (86873b5) to head (318c07d).
Report is 23 commits behind head on master.

Additional details and impacted files

@@            Coverage Diff             @@
##           master   #20916      +/-   ##
==========================================
+ Coverage   82.22%   82.44%   +0.21%     
==========================================
  Files         561      561              
  Lines       52955    53219     +264     
  Branches     8205     8245      +40     
==========================================
+ Hits        43544    43876     +332     
+ Misses       7373     7336      -37     
+ Partials     2038     2007      -31

Flag	Coverage Δ
keras	`82.26% <100.00%> (+0.22%)`	⬆️
keras-jax	`64.01% <0.00%> (-0.04%)`	⬇️
keras-numpy	`58.83% <0.00%> (-0.01%)`	⬇️
keras-openvino	`32.64% <0.00%> (+0.23%)`	⬆️
keras-tensorflow	`64.46% <0.00%> (-0.17%)`	⬇️
keras-torch	`64.08% <100.00%> (-0.03%)`	⬇️

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

fchollet

Thanks for the PR!

keras/src/layers/rnn/rnn.py

keras/src/layers/rnn/rnn_test.py

…as-team#20875 (keras-team#20916) * Fix PyTorch stateful RNN gradient computation error * Updates post feedback

google-ml-butler bot added the size:S label Feb 17, 2025

google-ml-butler bot assigned gbaned Feb 17, 2025

praveenhosdrug123 changed the title ~~Fix PyTorch stateful RNN/LSTM gradient computation error~~ Fix PyTorch stateful RNN/LSTM gradient computation error resolves #20875 Feb 17, 2025

praveenhosdrug123 force-pushed the fix-lstm-inplace-pytorch branch from 4bfd4a8 to e0c4415 Compare February 17, 2025 18:14

Fix PyTorch stateful RNN gradient computation error

48e20f6

praveenhosdrug123 force-pushed the fix-lstm-inplace-pytorch branch from e0c4415 to 48e20f6 Compare February 17, 2025 18:21

praveenhosdrug123 mentioned this pull request Feb 17, 2025

Inplace modification error during back propagation when training a stateful LSTM using PyTorch as backend #20875

Closed

fchollet reviewed Feb 17, 2025

View reviewed changes

keras/src/layers/rnn/rnn.py Show resolved Hide resolved

fchollet reviewed Feb 27, 2025

View reviewed changes

keras/src/layers/rnn/rnn.py Outdated Show resolved Hide resolved

keras/src/layers/rnn/rnn_test.py Outdated Show resolved Hide resolved

Updates post feedback

318c07d

fchollet approved these changes Mar 3, 2025

View reviewed changes

google-ml-butler bot added kokoro:force-run ready to pull Ready to be merged into the codebase labels Mar 3, 2025

fchollet merged commit f7115c2 into keras-team:master Mar 3, 2025
7 checks passed

11happy pushed a commit to 11happy/keras that referenced this pull request Mar 9, 2025

Fix PyTorch stateful RNN/LSTM gradient computation error resolves ker…

137ea6b

…as-team#20875 (keras-team#20916) * Fix PyTorch stateful RNN gradient computation error * Updates post feedback

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix PyTorch stateful RNN/LSTM gradient computation error resolves #20875 #20916

Fix PyTorch stateful RNN/LSTM gradient computation error resolves #20875 #20916

praveenhosdrug123 commented Feb 17, 2025

google-cla bot commented Feb 17, 2025

codecov-commenter commented Feb 17, 2025 •

edited

Loading

fchollet left a comment

Fix PyTorch stateful RNN/LSTM gradient computation error resolves #20875 #20916

Fix PyTorch stateful RNN/LSTM gradient computation error resolves #20875 #20916

Conversation

praveenhosdrug123 commented Feb 17, 2025

google-cla bot commented Feb 17, 2025

codecov-commenter commented Feb 17, 2025 • edited Loading

Codecov Report

fchollet left a comment

Choose a reason for hiding this comment

codecov-commenter commented Feb 17, 2025 •

edited

Loading