Show simple item record

 
dc.contributor.author Giellatekno - Saami Language Technology, UiT The Arctic University of Norway
dc.contributor.author The Divvun group at UiT The Arctic University of Norway
dc.date.accessioned 2015-10-27T08:07:31Z
dc.date.available 2015-10-27T08:07:31Z
dc.date.issued 2015-10-10
dc.identifier.uri http://hdl.handle.net/11509/100
dc.description The SIKOR North Saami free corpus is a monolingual text corpus of North Saami that contains administrative, law, religious, non-fiction, fiction, and science texts. It is work done by the Giellatekno and Divvun research groups, Department of Linguistics, UiT The Arctic University of Norway, as well as by members of the language community. In particular, the following colleagues have contributed to the creation of the ressource: Ciprian Gerstenberger, Børre Gaup, Lene Antonsen, Thomas Omma, and Trond Trosterud. Linguistically, the data set (746,329 sentences; 8,936,437 tokens) features word form, lemma, morphosyntactic analysis, and dependency relations between tokens. The corpus has been automatically processed and linguistically analyzed with the Giellatekno/Divvun tools. Therefore, it may contain wrong annotations. In case you find any errors the creators would appreciate your feedback sent to giellatekno@uit.no and feedback@divvun.no. Please note that the Giellatekno resources are dynamic in nature. To ensure that you have a completely updated version, please contact Giellatekno (see Contact Info in metadata).
dc.language.iso sme
dc.publisher Giellatekno - Saami Language Technology
dc.rights Creative Commons - Attribution 3.0 Unported (CC BY 3.0)
dc.rights.uri http://creativecommons.org/licenses/by/3.0/
dc.rights.label CC
dc.source.uri http://giellatekno.uit.no/index.eng.html
dc.subject Monolingual Corpus
dc.subject Text Corpus
dc.subject North Saami
dc.subject Dependency Tree Bank
dc.title SIKOR North Saami free corpus
dc.type corpus
metashare.ResourceInfo#ContentInfo.mediaType text
has.files yes
branding Clarino
demo.uri http://gtweb.uit.no/korp
contact.person Trond Trosterud trond.trosterud@uit.no Giellatekno - Saami Language Technology
size.info 746329 sentences
size.info 8936437 tokens
files.size 112986472
files.count 1


 Files in this item

This item is
Distributed under Creative Commons
and licensed under:
Creative Commons - Attribution 3.0 Unported (CC BY 3.0)
Attribution Required
Icon
Name
SIKOR_sme_20151010.zip
Size
107.75 MB
Format
application/zip
 Download file

Show simple item record