Eosc/pretraining/nvidia

From Nordic Language Processing Laboratory

Revision as of 10:11, 7 November 2020 by Oe (talk | contribs) (→‎Background)

(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)

Jump to: navigation, search

Contents

1 Background
2 Software Installation
3 Data Preparation
4 Training Example

Background

This page provides a recipe to large-scale pre-training of a BERT neural language model, using the high-efficiency NVIDIA BERT implementation (which is based on TensorFlow, in contrast to the NVIDIA Megatron code).

Software Installation

Data Preparation

Training Example

Retrieved from "http://wiki.nlpl.eu/index.php?title=Eosc/pretraining/nvidia&oldid=1101"