All translations

Enter a message name below to show all available translations.

Found 3 translations.

Name	Current message text
^h English (en)	A common heuristic is to use a {{Term\|learning rate}} 10–100x smaller for pretrained layers than for the new classification head, preventing catastrophic forgetting of learned representations.
^h Spanish (es)	Una heurística común es usar una {{Term\|learning rate\|tasa de aprendizaje}} de 10 a 100 veces menor para las capas preentrenadas que para la nueva cabeza de clasificación, evitando el olvido catastrófico de las representaciones aprendidas.
^h Chinese (zh)	一种常见的经验法则是为预训练层使用比新分类头小 10–100 倍的{{Term\|learning rate\|学习率}}，从而防止已学习表示的灾难性遗忘。