Skip to content

Decoder-only Transformer models missing X-Blocks/AdaLN? #5

Description

@JiahuiKChen

Hi, thanks for this very clean repo!

I had a question while trying to reproduce the decoder-only transformer Table 1 results.

In the current code, it looks like the provided decoder-only Transformer configs for BC, BESO, and RF all use plain TransformerEncoder blocks rather than AdaLN/X-Blocks:

Could you clarify whether these existing decoder-only Transformer models are the ones that produced Table 1's Decoder-Only Transformer results?

If not, would it be possible to share the decoder-only X-Block/AdaLN implementations and configs corresponding to the Table 1 X-BC, X-BESO, and X-RF results?

Thanks a lot.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Fields

    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions