Pages that link to "Module:Glossary"
The following pages link to Module:Glossary:
Displayed 50 items.
- Wide & Deep Learning for Recommender Systems/en (transclusion) (← links)
- Outrageously Large Neural Networks: The Sparsely-Gated Mixture-of-Experts Layer/paper/en (transclusion) (← links)
- Decoupled Weight Decay Regularization/paper/en (transclusion) (← links)
- MTGR: Industrial-Scale Generative Recommendation Framework in Meituan/paper/en (transclusion) (← links)
- Searching for Activation Functions/paper/en (transclusion) (← links)
- Searching for Activation Functions/en (transclusion) (← links)
- Incorporating Nesterov Momentum into Adam/en (transclusion) (← links)
- Decoupled Weight Decay Regularization/en (transclusion) (← links)
- Dropout: A Simple Way to Prevent Neural Networks from Overfitting/en (transclusion) (← links)
- Modeling Task Relationships in Multi-task Learning with Multi-gate Mixture-of-Experts/paper/en (transclusion) (← links)
- Deep & Cross Network for Ad Click Predictions/en (transclusion) (← links)
- Outrageously Large Neural Networks: The Sparsely-Gated Mixture-of-Experts Layer/en (transclusion) (← links)
- Logistic regression (transclusion) (← links)
- Logistic regression/en (transclusion) (← links)
- Transformer (transclusion) (← links)
- Factorization Machines (transclusion) (← links)
- Transformer/en (transclusion) (← links)
- Deep learning (transclusion) (← links)
- Logistic regression/es (transclusion) (← links)
- Logistic regression/zh (transclusion) (← links)
- Factorization Machines/en (transclusion) (← links)
- Deep learning/en (transclusion) (← links)
- Transformer/zh (transclusion) (← links)
- Transformer/es (transclusion) (← links)
- Factorization Machines/es (transclusion) (← links)
- Deep learning/es (transclusion) (← links)
- Deep learning/zh (transclusion) (← links)
- Factorization Machines/zh (transclusion) (← links)
- DCN V2: Improved Deep & Cross Network and Practical Lessons for Web-scale Learning to Rank Systems/paper/zh (transclusion) (← links)
- DCN V2: Improved Deep & Cross Network and Practical Lessons for Web-scale Learning to Rank Systems/paper/es (transclusion) (← links)
- Adam: A Method for Stochastic Optimization/paper (transclusion) (← links)
- BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding/paper (transclusion) (← links)
- Attention Is All You Need/paper (transclusion) (← links)
- Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift/paper (transclusion) (← links)
- Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift/paper/es (transclusion) (← links)
- Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift/paper/en (transclusion) (← links)
- Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift/paper/zh (transclusion) (← links)
- BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding/paper/zh (transclusion) (← links)
- BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding/paper/es (transclusion) (← links)
- Language Models are Few-Shot Learners/paper (transclusion) (← links)
- Generative Adversarial Networks/paper (transclusion) (← links)
- Attention Is All You Need/paper/zh (transclusion) (← links)
- Attention Is All You Need/paper/es (transclusion) (← links)
- Efficient Estimation of Word Representations in Vector Space/paper (transclusion) (← links)
- Deep Residual Learning for Image Recognition/paper (transclusion) (← links)
- Language Models are Few-Shot Learners/paper/zh (transclusion) (← links)
- Language Models are Few-Shot Learners/paper/es (transclusion) (← links)
- Adam: A Method for Stochastic Optimization/paper/es (transclusion) (← links)
- BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding/paper/en (transclusion) (← links)
- Adam: A Method for Stochastic Optimization/paper/zh (transclusion) (← links)