Difference between revisions of "Corpora/OPUS"

From Nordic Language Processing Laboratory
Jump to: navigation, search
m (http://opus.nlpl.eu)
(http://opus.nlpl.eu)
Line 2: Line 2:
 
== http://opus.nlpl.eu ==
 
== http://opus.nlpl.eu ==
  
OPUS is a collection of open parallel corpora in many languages. It provides bilingually aligned data sets, interfaces, tools and more. The data sets are available in various common formats and are provided for download and for use within the NLPL infrastructure. The service is hosted at CSC in Finland and the core of the data is also available from sigma2 on abel. Tools for processing the data are accessible from taito and more detailed information can be found on the [http://opus.nlpl.eu/trac OPUS Wiki]:
+
OPUS is a collection of open parallel corpora in many languages. It provides bilingually aligned data sets, interfaces, tools and more. The data sets are available in various common formats and are provided for download and for use within the NLPL infrastructure. The service is hosted at CSC in Finland and the core of the data is also available from sigma2 on abel. Tools for processing the data are accessible from puhti and more detailed information can be found on the [http://opus.nlpl.eu/trac OPUS Wiki]:
  
 
* Information for [http://opus.nlpl.eu/trac/wiki/NLPL NLPL Users]
 
* Information for [http://opus.nlpl.eu/trac/wiki/NLPL NLPL Users]

Revision as of 13:14, 28 February 2020

http://opus.nlpl.eu

OPUS is a collection of open parallel corpora in many languages. It provides bilingually aligned data sets, interfaces, tools and more. The data sets are available in various common formats and are provided for download and for use within the NLPL infrastructure. The service is hosted at CSC in Finland and the core of the data is also available from sigma2 on abel. Tools for processing the data are accessible from puhti and more detailed information can be found on the OPUS Wiki:

The on-line search interface is available from http://opus.nlpl.eu/bin/opuscqp.pl and the word-alignment-based lexicon is accessible from http://opus.nlpl.eu/lex.php

Contact: Jörg Tiedemann via e-mail - firstname.lastname at helsinki.fi (first name without dots)