CLARIN to IMDI, DATE: 2009-08-13
Corpora of Newspaper Texts
Corpora of Newspaper Texts
Computer corpora in Finnish, Swedish and English languages (newspaper texts), with requests and relevance information used in information retrieval evaluation.
ISO639-3:eng
English
Unknown
Unknown
Unknown
ISO639-3:fin
Finnish
Unknown
Unknown
Unknown
ISO639-3:swe
Swedish
Unknown
Unknown
Unknown
Written Corpus
Department of Information Studies, University of Tampere
Collection: About 142.2, 42.5, and 251 million word tokens respectively, or 1088MB, 281 MB, and 1530 MB respectively.
Eija Airio (
1225
2380