Difference between revisions of "Vectors/metadata"

From Nordic Language Processing Laboratory
Jump to: navigation, search
(Algorithms)
(Algorithms)
Line 4: Line 4:
  
 
== Algorithms ==
 
== Algorithms ==
* "command": exact command which was run to train the model, for example, <tt>word2vec -min-count 10 -size 100 -window 10 -negative 5 -iter 2 -threads 16 -cbow 0 -binary 0</tt>
+
* "'''command'''": exact command which was run to train the model, for example, <tt>"word2vec -min-count 10 -size 100 -window 10 -negative 5 -iter 2 -threads 16 -cbow 0 -binary 0"</tt>
* "id": NLPL identifier of the algorithm (an integer)
+
* "'''id'''": NLPL identifier of the algorithm (an integer)
* "name": human-readable name of the algorithm, for example, <tt>Gensim Continuous Skipgram</tt>
+
* "'''name'''": human-readable name of the algorithm, for example, <tt>"Gensim Continuous Skipgram"</tt>
* "tool": tool used to train models with this algorithm, for example, <tt>Gensim</tt>
+
* "'''tool'''": tool used to train models with this algorithm, for example, <tt>"Gensim"</tt>
* "url": webpage of the tool used, for example, <tt>https://github.com/RaRe-Technologies/gensim</tt>
+
* "'''url'''": webpage of the tool used, for example, <tt>"https://github.com/RaRe-Technologies/gensim"</tt>
* "version": version of the tool used, for example, <tt>3.6</tt>
+
* "'''version'''": version of the tool used, for example, <tt>"3.6"</tt>
  
 
== Corpora ==
 
== Corpora ==

Revision as of 00:02, 23 December 2019

This page describes the fields in the NLPL vector repository catalogue. The catalogue itself is a JSON file, for example, 20.json for the version 2.0 of the Repository.

All field except "id" are optional.

Algorithms

  • "command": exact command which was run to train the model, for example, "word2vec -min-count 10 -size 100 -window 10 -negative 5 -iter 2 -threads 16 -cbow 0 -binary 0"
  • "id": NLPL identifier of the algorithm (an integer)
  • "name": human-readable name of the algorithm, for example, "Gensim Continuous Skipgram"
  • "tool": tool used to train models with this algorithm, for example, "Gensim"
  • "url": webpage of the tool used, for example, "https://github.com/RaRe-Technologies/gensim"
  • "version": version of the tool used, for example, "3.6"

Corpora

Models