Translations:Attention Mechanisms/7/en: Difference between revisions

Latest revision as of 23:33, 27 April 2026

Information about message (contribute)

This message has no documentation. If you know where or how this message is used, you can help other translators by adding documentation to this message.

Message definition (Attention Mechanisms)

where <math>W_s</math>, <math>W_h</math>, and <math>v</math> are learned parameters. The attention weights are obtained by applying {{Term|softmax}}:

where $$ W_s $$ , $$ W_h $$ , and $$ v $$ are learned parameters. The attention weights are obtained by applying softmax:

Revision as of 00:30, 27 April 2026 (view source) FuzzyBot (talk \| contribs) (Importing a new version from external source)		Latest revision as of 23:33, 27 April 2026 (view source) FuzzyBot (talk \| contribs) (Importing a new version from external source) Tag: Manual revert
(2 intermediate revisions by the same user not shown)
Line 1:		Line 1:
	where <math>W_s</math>, <math>W_h</math>, and <math>v</math> are learned parameters. The attention weights are obtained by applying softmax:		where <math>W_s</math>, <math>W_h</math>, and <math>v</math> are learned parameters. The attention weights are obtained by applying {{Term\|softmax}}: