dc.contributor.author | Giellatekno - Saami Language Technology, UiT The Arctic University of Norway |
dc.contributor.author | The Divvun group at UiT The Arctic University of Norway |
dc.date.accessioned | 2015-10-27T08:07:43Z |
dc.date.available | 2015-10-27T08:07:43Z |
dc.date.issued | 2015-10-10 |
dc.identifier.uri | http://hdl.handle.net/11509/102 |
dc.description | The SIKOR South Saami free corpus is a monolingual text corpus of South Saami that contains administrative, law, religious, non-fiction, and fiction texts. It is work done by the Giellatekno and Divvun research groups, Department of Linguistics, UiT The Arctic University of Norway, as well as by members of the language community. In particular, the following colleagues have contributed to the creation of the ressource: Ciprian Gerstenberger, Børre Gaup, Risten-Birje Steinfjell, Lene Antonsen, Trond Trosterud, and Maja Kappfjell. Linguistically, the data set (58,407 sentences; 646,273 tokens) features word form, lemma, morphosyntactic analysis, and dependency relations between tokens. The corpus has been automatically processed and linguistically analyzed with the Giellatekno/Divvun tools. Therefore, it may contain wrong annotations. In case you find any errors the creators would appreciate your feedback sent to giellatekno@uit.no and feedback@divvun.no. Please note that the Giellatekno resources are dynamic in nature. To ensure that you have a completely updated version, please contact Giellatekno (see Contact Info in metadata). |
dc.language.iso | sma |
dc.publisher | Giellatekno - Saami Language Technology |
dc.rights | Creative Commons - Attribution 3.0 Unported (CC BY 3.0) |
dc.rights.uri | http://creativecommons.org/licenses/by/3.0/ |
dc.rights.label | CC |
dc.source.uri | http://giellatekno.uit.no/index.eng.html |
dc.subject | Monolingual Corpus |
dc.subject | Text Corpus |
dc.subject | South Saami |
dc.subject | Dependency Tree Bank |
dc.title | SIKOR South Saami free corpus |
dc.type | corpus |
metashare.ResourceInfo#ContentInfo.mediaType | text |
has.files | yes |
branding | Clarino |
demo.uri | http://gtweb.uit.no/korp |
contact.person | Trond Trosterud trond.trosterud@uit.no Giellatekno - Saami Language Technology |
size.info | 58461 sentences |
size.info | 646384 tokens |
files.size | 7913681 |
files.count | 1 |
Files in this item
This item is
Creative Commons - Attribution 3.0 Unported (CC BY 3.0)
Distributed under Creative Commons
and licensed under:Creative Commons - Attribution 3.0 Unported (CC BY 3.0)