Translations:Attention Is All You Need/21/en

    From Marovi AI
    Revision as of 04:22, 28 April 2026 by FuzzyBot (talk | contribs) (Importing a new version from external source)
    (diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)

    On the WMT 2014 English-to-German translation task, the big transformer model achieved a bleu score of 28.4, surpassing the previous best results including ensembles by over 2 bleu points. On the WMT 2014 English-to-French task, it achieved 41.0 bleu, establishing a new single-model state-of-the-art at a fraction of the training cost of prior models.