Difference between revisions of "Vectors/metadata"
(Created page with "This page describes the fields in the [http://vectors.nlpl.eu/repository/ NLPL vector repository] catalogue. The catalogue itself is a <tt>JSON</tt> file, for example, <tt>20....") |
(→Algorithms) |
||
Line 4: | Line 4: | ||
== Algorithms == | == Algorithms == | ||
− | + | * "command": exact command which was run to train the model, for example, <tt>word2vec -min-count 10 -size 100 -window 10 -negative 5 -iter 2 -threads 16 -cbow 0 -binary 0</tt> | |
− | + | * "id": NLPL identifier of the algorithm (an integer) | |
− | + | * "name": human-readable name of the algorithm, for example, <tt>Gensim Continuous Skipgram</tt> | |
− | + | * "tool": tool used to train models with this algorithm, for example, <tt>Gensim</tt> | |
− | + | * "url": webpage of the tool used, for example, <tt>https://github.com/RaRe-Technologies/gensim</tt> | |
− | + | * "version": version of the tool used, for example, <tt>3.6</tt> | |
== Corpora == | == Corpora == |
Revision as of 00:02, 23 December 2019
This page describes the fields in the NLPL vector repository catalogue. The catalogue itself is a JSON file, for example, 20.json for the version 2.0 of the Repository.
All field except "id" are optional.
Algorithms
- "command": exact command which was run to train the model, for example, word2vec -min-count 10 -size 100 -window 10 -negative 5 -iter 2 -threads 16 -cbow 0 -binary 0
- "id": NLPL identifier of the algorithm (an integer)
- "name": human-readable name of the algorithm, for example, Gensim Continuous Skipgram
- "tool": tool used to train models with this algorithm, for example, Gensim
- "url": webpage of the tool used, for example, https://github.com/RaRe-Technologies/gensim
- "version": version of the tool used, for example, 3.6