Revision as of 19:45, 16 September 2020

Working Notes for Norwegian BERT-Like Models

SentencePiece library finds 157 unique characters in Norwegian Wikipedia dump.

Do we have available Norwegian test sets for typical NLP tasks to evaluate our NorBERT?

@@ Line 13: / Line 13: @@
 [https://github.com/google/sentencepiece SentencePiece] library finds '''157''' unique characters in Norwegian Wikipedia dump.
+= Evaluation =
+Do we have available Norwegian test sets for typical NLP tasks to evaluate our NorBERT?