[BUG] remove customized attentionLSTM layer. Replace with keras attention #8721

jnrusson1 · 2025-08-21T08:55:23Z

Reference Issues/PRs

Fixes #8696
May clash with #8710

What does this implement/fix? Explain your changes.

This PR fixes a critical dimension mismatch issue in the LSTM-FCN network's attention mechanism. The original custom attention implementation was designed for sequence-level processing but was being used in an LSTM cell context, causing the attention mechanism to fail when processing individual timesteps.

Changes made:

Replaced the complex, buggy custom AttentionLSTM implementation with the standard Keras Attention layer
Fixed the input shape handling by applying attention to the full sequence before LSTM processing
Removed the problematic _time_distributed_dense function that expected timesteps that didn't exist at the cell level
Maintained backward compatibility - the attention=True parameter still works as expected
Simplified the codebase by removing ~800 lines of custom attention implementation

Why this fix was needed:
The original attention mechanism tried to process individual timesteps (batch, features) but expected full sequences (batch, timesteps, features). This caused the _time_distributed_dense function to fail when trying to reshape inputs that had no timestep dimension.

Does your contribution introduce a new dependency? If yes, which one?

No new dependencies. The Keras Attention layer is already available in TensorFlow, which is already a dependency of sktime.

What should a reviewer concentrate their feedback on?

Verify that the attention mechanism now works correctly with LSTM-FCN
Check that the fix maintains backward compatibility
Confirm that the simplified implementation is cleaner and more maintainable
Ensure that the dimension handling is now correct

Did you add any tests for the change?

Yes, the existing tests should continue to pass. The fix resolves the core issue that was preventing the attention mechanism from working, so existing LSTM-FCN tests with attention=True should now work correctly. (not sure if there were any of these tests though(

Any other comments?

This fix addresses a fundamental architectural issue where the attention mechanism was misapplied. The original custom implementation was overly complex for the use case and introduced bugs. The Keras Attention layer provides the same functionality in a simpler, more reliable way.

The fix also makes the codebase more maintainable by removing custom attention code that was difficult to debug and maintain.

fkiraly · 2025-08-21T09:56:17Z

fixed a small linting error and started the tests

fkiraly

The failure is unrelated

remove customized attentionLSTM layer. Replace with keras attention

23df3c4

jnrusson1 requested review from benHeid, felipeangelimvieira, fkiraly, fnhirwa, geetu040 and yarnabrina as code owners August 21, 2025 08:55

fkiraly added module:classification classification module: time series classification enhancement Adding new functionality labels Aug 21, 2025

Update lstmfcn.py

636835e

fkiraly approved these changes Aug 21, 2025

View reviewed changes

fkiraly merged commit 3696d8c into sktime:main Aug 21, 2025
64 of 66 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[BUG] remove customized attentionLSTM layer. Replace with keras attention #8721

[BUG] remove customized attentionLSTM layer. Replace with keras attention #8721

Uh oh!

jnrusson1 commented Aug 21, 2025

Uh oh!

fkiraly commented Aug 21, 2025

Uh oh!

fkiraly left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

[BUG] remove customized attentionLSTM layer. Replace with keras attention #8721

[BUG] remove customized attentionLSTM layer. Replace with keras attention #8721

Uh oh!

Conversation

jnrusson1 commented Aug 21, 2025

Reference Issues/PRs

What does this implement/fix? Explain your changes.

Does your contribution introduce a new dependency? If yes, which one?

What should a reviewer concentrate their feedback on?

Did you add any tests for the change?

Any other comments?

Uh oh!

fkiraly commented Aug 21, 2025

Uh oh!

fkiraly left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!