Helsingin yliopiston korpuspalvelimen monikielinen aineistokokoelma

Kuvaus

The collection is available in Kielipankki - the Language Bank of Finland (puhti.csc.fi, in the directory /appl/data/kielipankk/mrc-uhlcs/. Access rights instructions: http://www.kielipankki.fi/access). The UHLCS, maintained by the University of Helsinki, was founded in late 1980. At present, the UHLCS contains computer corpora of more than 50 languages, including samples of minority languages and extensive corpora representing different text types. In 2000, the corpora of the Uralic, Turkic, Tungusic, Mongolic, Chukotko-Kamchatkan, Iranian and North-East Caucasian languages were edited for public use with the financial support of the Max Planck Institute for Evolutionary Anthropology, Leipzig. In summer 2003, metadata descriptions for the corpora were prepared with the financial support of the ECHO project (European Cultural Inheritance Online). There are also tools at the UHLCS which can be used in analyzing the corpora. UHLCS contains the following corpora: * Avar * Chukchi * Chuvash * English * Erzya and Moksha Mordvin (literature, journals) * Erzya and Moksha Mordvin (word lists) * Estonian 1 * Estonian 2 * Even * Evenki * Finland Swedish Text Corpus (FISC) * Finnish (Bibles) * Finnish (literature) * Ingrian * Kalmyk * Khanty (North Khanty) (corpora and translations) * Komi Zyrian (corpora and texts) * Komi Zyrian (literature) * Koryak * Kurdish * Lak * Latin * Lude (Ludian) * Nanay * Nenets (Tundra Nenets) * North Saami (literature) * North Saami (Sámikultuvradoaibmagotti smiehttamush) * Ossete * Swahili * Tabassaran * Tajik * Turkic languages * Ume Saami * Uralic languages UHLCS has many different IPR holders. Should you have any questions regarding the collection, please contact Pirkko Suihkonen (suihkonen.pirkko@gmail.com). License details: http://urn.fi/urn:nbn:fi:lb-2015041302 The purpose of the resource use must be outlined in a research plan.
Näytä enemmän

Julkaisuvuosi

2018

Aineiston tyyppi

Tekijät

CSC - Tieteen tietotekniikan keskus Oy - Kuraattori

University of Helsinki - Kuraattori

Projekti

Muut tiedot

Tieteenalat

Kielitieteet

Kieli

pohjoissaame, Avaarin kieli, Tšuvassin kieli, Tšuktšin kieli, saksa, englanti, viro, Evenin kieli, Evenkin kieli, suomi, ranska, Nanain kieli, italia, Inkeroisen kieli, Hantin kieli, Jazvan komi, Korjakin kieli, kurdi, latina, Lakin kieli, Lyydin kieli, Mokšan kieli, Ersän kieli, hollanti, Norjan kieli, Osseetin kieli, Venäjän kieli, Kiltinänsaame, Uumajansaame, ruotsi, Tabasaranin kieli, Tataarin kieli, Tadžikin kieli, Udmurtin kieli, Uzbekin kieli, Kalmukin kieli, Forest Nenets language

Saatavuus

Saatavuutta rajoitettu

Lisenssi

muu

Avainsanat

Asiasanat

Ajallinen kattavuus

undefined

Liittyvät aineistot