How to use it in transformer? #1

kpmokpmo · 2021-09-15T09:59:35Z

Hi, thanks for your work.

Just several quick questions here:

When embedding the S/Tnorm blocks into the transformer baseline, should I discard or keep the original layer/group norm?
It seems that your paper and 'Data Normalization for Bilinear Structures in High-Frequency Financial Time-series' sort of similar. Just curious if there is any main difference I didn't noticed.

Thank you very much!

JLDeng · 2021-09-15T12:19:16Z

Hi, thanks for you interest.

According to my experience, you can keep the original layer, but it may depend on your task.
Thanks for your suggestion. I have just checked this paper. I think the basic idea is similar. One of the major difference is that the normalized features should be combined with the original features and then fed to the following operations, otherwise the forecasting results would not be good.

JLDeng · 2021-09-15T13:50:41Z

In addition, I notice that they only applied normalization on the input data. Our work demonstrated that this operation can be generalized to latent space.

kpmokpmo · 2021-09-16T01:22:35Z

Thank you for quick reply! Well, I still want to double check about the design:
if the attention block has the following structure:

S+T norm & concat+conv
x = x + self.drop_path(self.attn(self.norm1(x)))
x = x + self.drop_path(self.mlp(self.norm2(x)))

I think at least self.norm1 plays a duplicated role as the S/T norm layer. Please correct me if I shouldn't insert the ST norm here at all. Many thanks.

JLDeng closed this as completed Sep 15, 2021

JLDeng reopened this Sep 15, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to use it in transformer? #1

How to use it in transformer? #1

kpmokpmo commented Sep 15, 2021

JLDeng commented Sep 15, 2021 •

edited

Loading

JLDeng commented Sep 15, 2021

kpmokpmo commented Sep 16, 2021

How to use it in transformer? #1

How to use it in transformer? #1

Comments

kpmokpmo commented Sep 15, 2021

JLDeng commented Sep 15, 2021 • edited Loading

JLDeng commented Sep 15, 2021

kpmokpmo commented Sep 16, 2021

JLDeng commented Sep 15, 2021 •

edited

Loading