Translations:Attention Is All You Need/21/en

    From Marovi AI
    Revision as of 00:32, 27 April 2026 by FuzzyBot (talk | contribs) (Importing a new version from external source)
    (diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)

    On the WMT 2014 English-to-German translation task, the big Transformer model achieved a BLEU score of 28.4, surpassing the previous best results including ensembles by over 2 BLEU points. On the WMT 2014 English-to-French task, it achieved 41.0 BLEU, establishing a new single-model state-of-the-art at a fraction of the training cost of prior models.