[ENHANCEMENT] Multi-Scale Encoder for Robust Zero-Shot Forecasting #250
Killer3048
started this conversation in
Ideas
Replies: 1 comment 1 reply
-
Thanks for the suggestions @Killer3048. I’m converting this into a discussion since we prefer to use issues for bugs or feature requests around the published models. |
Beta Was this translation helpful? Give feedback.
1 reply
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
[ENHANCEMENT] Multi-Scale encoder for robust Zero-Shot forecasting
Summary
This proposal outlines a Multi-Scale Encoder that processes time-series data at multiple resolutions (e.g., daily vs. weekly) in parallel. Each branch learns scale-specific patterns before a fusion step aggregates their outputs. Such a design significantly boosts zero-shot performance by enabling the model to generalize more effectively to new time-series domains with potentially unknown seasonalities.
Proposal
Parallel Branches for Different Timescales
Short-Term Branch
Medium-Term Branch
Optional Additional Branches
Branch-Specific Transformer Encoders
Patch
class) but configured to its patch size.InstanceNorm
), so that each scale handles outliers or mean shifts independently.Fusion Layer
Simplified Decoder
Rationale
Broader Pattern Recognition
Reduced Overfitting
Improved Generalization
Interpretability
Additional Notes
Implementation Feasibility
Patch
andInstanceNorm
modules with different configurations. Memory usage may grow with each branch, so weight-sharing or using fewer layers can mitigate this.Potential Attention Optimizations
O(n^2)
cost in each branch.By incorporating these adjustments, the model gains the ability to learn and fuse scale-specific patterns effectively, leading to more robust zero-shot forecasting performance.
Beta Was this translation helpful? Give feedback.
All reactions