Show simple item record

 
dc.contributor.author Margje, Post
dc.contributor.author Pineda Dijkerman, David
dc.date.accessioned 2024-06-11T10:04:16Z
dc.date.available 2024-06-11T10:04:16Z
dc.date.issued 2024-03-22
dc.identifier.uri http://hdl.handle.net/11509/150
dc.description The Kola Peninsula Spoken Corpus (KoPeSC) is a dataset of sound recordings and their transcriptions in ELAN of Pomor Russian dialect speech and of Sámi and Russian speech as spoken by the indigenous peoples of Kola Peninsula. Most recordings are sociolinguistic interviews collected during fieldwork expeditions that took place between 2001 and 2008, with Margje Post and David Pineda (then UiT, now UiB) as main researchers. KoPeSC 1, the first dataset, consists of all audio files (in mp3- and WAVE-format) and their transcriptions (in ELAN), with metadata, to the following publication: Post, Margje & David Pineda (2024). Речь поморов Терского берега Белого моря. Звучащая хрестоматия [“Pomor Speech from the Ter Coast of the White Sea: A Spoken Anthology”]; Slavic Bergensia, Volume 15. DOI: xxx The dataset in KoPeSC 1 consists of: – The 30 audio files to the 29 texts in the anthology, both in WAVE-format and in .mp3-format; – 30 ELAN transcription files (.eaf) to these audio files, with their transcriptions (both in simplified phonetic script and in standardized Russian); – metadata: KoPeSC1_SlavBerg15_metadata.xlsx Vol. 15 of Slavica Bergensia is an open access anthology of the Pomor Russian dialect as it is spoken on the Ter Coast of the White Sea, with 30 short excerpts from interviews with 21 elderly dialect speakers. This publication in Russian also contains background information to the region and its dialect, in-depth analyses of a selection of linguistic features and commentaries on each single text. In the publication itself the recordings are transcribed in a simplified phonetic transcription. The transcriptions in ELAN in this repository also contain transcriptions in Standard Russian, which are better suited for queries and analyses. ELAN allows searching through multiple annotation files, so one can search for an expression in all sound and transcriptions files of the anthology at once and listen to each individual token or download a spreadsheet with all tokens of the expression; cf. https://www.mpi.nl/corpus/html/elan/ch07s02.html License: CC BY-NC-SA 4.0, https://creativecommons.org/licenses/by-nc-sa/4.0/ [версия на русском языке: https://creativecommons.org/licenses/by-nc-sa/4.0/deed.ru] Although the sound and text data to Slavica Bergensia 15 are made freely available for access, printing and download for non-commercial use, the audio recordings are classified as personal data. Please note that every individual user is responsible for treating the participants in the interviews with respect and sincerity. The publication of these data has been registered in RETTE (project nr. F3438, https://rette.app.uib.no), UiB’s system for monitoring and control of the processing of personal data in research and student projects, and follows the Norwegian national research ethical guidelines for projects processing personal data (https://www.forskningsetikk.no/en/guidelines/). The dialect recordings were collected during various field work expeditions and for different projects between 1961 and 2006, most of them by Margje Post (then University of Tromsø, now University of Bergen) and colleagues from Tromsø, Moscow and Bochum between 2001 and 2006. The dataset also contains five fairy tales, recorded in 1961 and 1964 by Dmitrij Balašov (Petrozavodsk – St. Petersburg), and an excerpt from a folkloristic interview by Marina Vlasova (St. Petersburg) from 1987. Most speakers are from the village Varzuga, but recordings from Umba, Kuzomen’, Tetrino and Čavanga are represented as well. For details, see KoPeSC1_SlavBerg15_metadata.xlsx. More dialect recordings will be made available in a separate dataset as KoPeSC 2, including the long versions of the interviews from which the excerpts were taken. The fieldwork expeditions, the cooperation with prof. Christian Sappok from the University of Bochum and the transcriptions have been supported by grants from UiT The Arctic University of Norway, DAAD, DFG and the University of Bergen. We are indebted to the Audio Archive of the Institute of Linguistics, Literature and History of the Karelian Research Centre of the Russian Academy of Sciences (KarRC RAS) in Petrozavodsk for the recordings of texts 2-5 (from 1964) and to the folklorist Marina Vlasova (Puškinskij Dom, Saint Petersburg) and to her colleagues at the Audio Archive of Puškinskij Dom for texts 1 (1961) and 8 (1987). For questions or to receive the annotation guidelines or phonetic transcriptions, please contact the Corpus manager (Margje Post, UiB). –––––––––––––––– KoPeSC 1 is the first dataset of the Kola Peninsula Spoken Corpus. The Kola Peninsula Spoken Corpus (KoPeSC) consists of several datasets, which are planned to be archived in CLARINO, including more recordings of Pomor Russian dialect speech from the Ter Coast from the Tromsø-Bergen archive, which have been transcribed in ELAN, and sound files and transcriptions that were collected during fieldwork in 2007 and 2008 in Lovozero and Krasnoščelje (Central Kola Peninsula) by Margje Post and David Pineda (then UiT, now UiB). These recordings consist of interviews in Russian with native speakers of Sámi and Komi-Zyryan and with former Pomor Russian inhabitants of Ponoj, a coastal village on the easternmost part of Kola Peninsula.
dc.language.iso rus
dc.publisher University of Bergen
dc.rights Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0)
dc.rights.uri https://creativecommons.org/licenses/by-nc-sa/4.0/
dc.rights.label CC
dc.source.uri https://boap.uib.no/books/sb/catalog/series/slavica-bergensia
dc.subject spontaneous speech
dc.subject dialects
dc.subject spoken corpus
dc.subject Russian
dc.title The Kola Peninsula Spoken Corpus (KoPeSC) 1: Spoken Corpus to “Речь поморов Терского берега Белого моря: Звучащая хрестоматия” [“Pomor Speech on the Ter Coast of the White Sea: A spoken anthology”] (Slavica Bergensia 15)
dc.type corpus
metashare.ResourceInfo#ContentInfo.mediaType audio
has.files yes
branding Clarino
demo.uri https://boap.uib.no/books/sb/catalog/series/slavica-bergensia
contact.person Margje Post Margje.Post@uib.no University of Bergen
size.info 15600 words
size.info 117 minutes
files.size 522399362
files.count 4


 Files in this item  Download all files in item (498.2 MB)

This item is
Distributed under Creative Commons
and licensed under:
Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0)
Attribution Required Noncommercial Share Alike
Icon
Name
KoPeSC1-SlavBerg15_metadata.xlsx
Size
53.21 KB
Format
Microsoft Excel 2007
Description
metadata in Excel (.xlsx)
 Download file
Icon
Name
KoPeSC1-SlavBerg15_ELAN_eaf.zip
Size
240.9 KB
Format
application/zip
Description
zip of 30 ELAN transcriptions (.eaf)
 Download file   Preview
  File Preview  
  • __MACOSX
    • ._TB21_VAR1-15_27.eaf-1 B
    • ._TB26_VAR2-09_1825s.eaf-1 B
    • ._TB04_Var64Balashov3kumy.eaf-1 B
    • ._TB19_Var2001_OE22_002_01.eaf-1 B
    • ._TB17_Var2001_EF26_008_05.eaf-1 B
    • ._TB28_VAR2-09_3162s.eaf-1 B
    • ._TB20b_Var2001_MD35_009_18.eaf-1 B
    • ._TB20a_Var2001_MD35_009_05.eaf-1 B
    • ._TB08_Var1987_КК-03В_01.eaf-1 B
    • ._TB15_Var2001_MD25_004_00.eaf-1 B
    • ._TB23_VAR2-08_720s.eaf-1 B
    • ._TB10_Var2001_AD08_005_02.eaf-1 B
    • ._TB06_Umb2001_AM13_003_21.eaf-1 B
    • ._TB02_Var64Balashov1Potomb.eaf-1 B
    • ._TB01_Kuz61_Kakvtristatri(1)-tweaked.eaf-1 B
    • ._TB18_Var2001_EV24_007_25.eaf-1 B
    • ._TB27_VAR2-09_470s.eaf-1 B
    • ._TB11_Var2001_MD26_016_11.eaf-1 B
    • ._TB01_Kuz61_Kakvtristatri(2).eaf-1 B
    • ._TB12_Var2001_ZI29_003_13.eaf-1 B
    • ._TB09_Var2001_TV17_001_16.eaf-1 B
    • ._TB07_Umb2001_JA12_018_03.eaf-1 B
    • ._TB24_VAR2-08_2382s.eaf-1 B
    • ._TB05_Var64Balashov4stariknagorke.eaf-1 B
    • ._TB25_VAR2-09_805s.eaf-1 B
    • ._TB03_Var64Balashov2zhilstarik.eaf-1 B
    • ._TB22_VAR2-04_753-1243s.eaf-1 B
    • ._TB14_Var2001_EI34_001_05.eaf-1 B
    • ._TB13_Var2001_MZ39_001_07.eaf-1 B
    • ._TB16_Var2004_AE41_001_002_05.eaf-1 B
    • TB14_Var2001_EI34_001_05.eaf-1 B
    • TB06_Umb2001_AM13_003_21.eaf-1 B
    • TB13_Var2001_MZ39_001_07.eaf-1 B
    • TB23_VAR2-08_720s.eaf-1 B
    • TB05_Var64Balashov4stariknagorke.eaf-1 B
    • TB02_Var64Balashov1Potomb.eaf-1 B
    • TB26_VAR2-09_1825s.eaf-1 B
    • TB04_Var64Balashov3kumy.eaf-1 B
    • TB19_Var2001_OE22_002_01.eaf-1 B
    • TB17_Var2001_EF26_008_05.eaf-1 B
    • TB28_VAR2-09_3162s.eaf-1 B
    • TB01_Kuz61_Kakvtristatri(1)-tweaked.eaf-1 B
    • TB15_Var2001_MD25_004_00.eaf-1 B
    • TB03_Var64Balashov2zhilstarik.eaf-1 B
    • TB10_Var2001_AD08_005_02.eaf-1 B
    • TB21_VAR1-15_27.eaf-1 B
    • TB16_Var2004_AE41_001_002_05.eaf-1 B
    • TB25_VAR2-09_805s.eaf-1 B
    • TB22_VAR2-04_753-1243s.eaf-1 B
    • TB18_Var2001_EV24_007_25.eaf-1 B
    • TB11_Var2001_MD26_016_11.eaf-1 B
    • TB12_Var2001_ZI29_003_13.eaf-1 B
    • TB01_Kuz61_Kakvtristatri(2).eaf-1 B
    • TB27_VAR2-09_470s.eaf-1 B
    • TB09_Var2001_TV17_001_16.eaf-1 B
    • TB20b_Var2001_MD35_009_18.eaf-1 B
    • TB20a_Var2001_MD35_009_05.eaf-1 B
    • TB07_Umb2001_JA12_018_03.eaf-1 B
    • TB24_VAR2-08_2382s.eaf-1 B
    • TB08_Var1987_КК-03В_01.eaf-1 B
Icon
Name
KoPeSC1-SlavBerg15_sound_mp3.zip
Size
91.83 MB
Format
application/zip
Description
zip of 30 sound files (.mp3)
 Download file   Preview
  File Preview  
  • __MACOSX
    • ._TB23_VAR2-08_720s.mp3-1 B
    • ._TB10_Var2001_AD08_005_02.mp3-1 B
    • ._TB06_Umb2001_AM13_003_21.mp3-1 B
    • ._TB01_Kuz61_Kakvtristatri(1)-tweaked.mp3-1 B
    • ._TB02_Var64Balashov1Potomb.mp3-1 B
    • ._TB18_Var2001_EV24_007_25.mp3-1 B
    • ._TB27_VAR2-09_470s.mp3-1 B
    • ._TB11_Var2001_MD26_016_11.mp3-1 B
    • ._TB01_Kuz61_Kakvtristatri(2).mp3-1 B
    • ._TB12_Var2001_ZI29_003_13.mp3-1 B
    • ._TB09_Var2001_TV17_001_16.mp3-1 B
    • ._TB07_Umb2001_JA12_018_03.mp3-1 B
    • ._TB24_VAR2-08_2382s.mp3-1 B
    • ._TB05_Var64Balashov4stariknagorke.mp3-1 B
    • ._TB25_VAR2-09_805s.mp3-1 B
    • ._TB03_Var64Balashov2zhilstarik.mp3-1 B
    • ._TB22_VAR2-04_753-1243s.mp3-1 B
    • ._TB14_Var2001_EI34_001_05.mp3-1 B
    • ._TB16_Var2004_AE41_001_002_05.mp3-1 B
    • ._TB13_Var2001_MZ39_001_07.mp3-1 B
    • ._TB21_VAR1-15_27.mp3-1 B
    • ._TB26_VAR2-09_1825s.mp3-1 B
    • ._TB04_Var64Balashov3kumy.mp3-1 B
    • ._TB19_Var2001_OE22_002_01.mp3-1 B
    • ._TB17_Var2001_EF26_008_05.mp3-1 B
    • ._TB28_VAR2-09_3162s.mp3-1 B
    • ._TB20b_Var2001_MD35_009_18.mp3-1 B
    • ._TB20a_Var2001_MD35_009_05.mp3-1 B
    • ._TB08_Var1987_КК-03В_01.mp3-1 B
    • ._TB15_Var2001_MD25_004_00.mp3-1 B
    • TB03_Var64Balashov2zhilstarik.mp3-1 B
    • TB01_Kuz61_Kakvtristatri(1)-tweaked.mp3-1 B
    • TB15_Var2001_MD25_004_00.mp3-1 B
    • TB10_Var2001_AD08_005_02.mp3-1 B
    • TB21_VAR1-15_27.mp3-1 B
    • TB16_Var2004_AE41_001_002_05.mp3-1 B
    • TB25_VAR2-09_805s.mp3-1 B
    • TB22_VAR2-04_753-1243s.mp3-1 B
    • TB18_Var2001_EV24_007_25.mp3-1 B
    • TB11_Var2001_MD26_016_11.mp3-1 B
    • TB12_Var2001_ZI29_003_13.mp3-1 B
    • TB01_Kuz61_Kakvtristatri(2).mp3-1 B
    • TB27_VAR2-09_470s.mp3-1 B
    • TB09_Var2001_TV17_001_16.mp3-1 B
    • TB20b_Var2001_MD35_009_18.mp3-1 B
    • TB20a_Var2001_MD35_009_05.mp3-1 B
    • TB07_Umb2001_JA12_018_03.mp3-1 B
    • TB24_VAR2-08_2382s.mp3-1 B
    • TB08_Var1987_КК-03В_01.mp3-1 B
    • TB14_Var2001_EI34_001_05.mp3-1 B
    • TB06_Umb2001_AM13_003_21.mp3-1 B
    • TB13_Var2001_MZ39_001_07.mp3-1 B
    • TB23_VAR2-08_720s.mp3-1 B
    • TB05_Var64Balashov4stariknagorke.mp3-1 B
    • TB02_Var64Balashov1Potomb.mp3-1 B
    • TB26_VAR2-09_1825s.mp3-1 B
    • TB04_Var64Balashov3kumy.mp3-1 B
    • TB19_Var2001_OE22_002_01.mp3-1 B
    • TB17_Var2001_EF26_008_05.mp3-1 B
    • TB28_VAR2-09_3162s.mp3-1 B
Icon
Name
KoPeSC1-SlavBerg15_sound_wav.zip
Size
406.08 MB
Format
application/zip
Description
zip of 30 sound files (.wav)
 Download file   Preview
  File Preview  
  • __MACOSX
    • ._TB23_VAR2-08_720s.wav-1 B
    • ._TB10_Var2001_AD08_005_02.wav-1 B
    • ._TB06_Umb2001_AM13_003_21.wav-1 B
    • ._TB01_Kuz61_Kakvtristatri(1)-tweaked.wav-1 B
    • ._TB18_Var2001_EV24_007_25.wav-1 B
    • ._TB27_VAR2-09_470s.wav-1 B
    • ._TB01_Kuz61_Kakvtristatri(2).wav-1 B
    • ._TB11_Var2001_MD26_016_11.wav-1 B
    • ._TB09_Var2001_TV17_001_16.wav-1 B
    • ._TB07_Umb2001_JA12_018_03.wav-1 B
    • ._TB24_VAR2-08_2382s.wav-1 B
    • ._TB05_Var64Balashov4stariknagorke.wav-1 B
    • ._TB25_VAR2-09_805s.wav-1 B
    • ._TB03_Var64Balashov2zhilstarik.wav-1 B
    • ._TB22_VAR2-04_753-1243s.wav-1 B
    • ._TB14_Var2001_EI34_001_05.wav-1 B
    • ._TB13_Var2001_MZ39_001_07.wav-1 B
    • ._TB16_Var2004_AE41_001_002_05.wav-1 B
    • ._TB21_VAR1-15_27.wav-1 B
    • ._TB26_VAR2-09_1825s.wav-1 B
    • ._TB04_Var64Balashov3kumy.wav-1 B
    • ._TB19_Var2001_OE22_002_01.wav-1 B
    • ._TB17_Var2001_EF26_008_05.wav-1 B
    • ._TB28_VAR2-09_3162s.wav-1 B
    • ._TB20b_Var2001_MD35_009_18.wav-1 B
    • ._TB20a_Var2001_MD35_009_05.wav-1 B
    • ._TB08_Var1987_КК-03В_01.wav-1 B
    • ._TB15_Var2001_MD25_004_00.wav-1 B
    • TB01_Kuz61_Kakvtristatri(1)-tweaked.wav-1 B
    • TB15_Var2001_MD25_004_00.wav-1 B
    • TB03_Var64Balashov2zhilstarik.wav-1 B
    • TB10_Var2001_AD08_005_02.wav-1 B
    • TB21_VAR1-15_27.wav-1 B
    • TB16_Var2004_AE41_001_002_05.wav-1 B
    • TB25_VAR2-09_805s.wav-1 B
    • TB22_VAR2-04_753-1243s.wav-1 B
    • TB18_Var2001_EV24_007_25.wav-1 B
    • TB11_Var2001_MD26_016_11.wav-1 B
    • TB12_Var2001_ZI29_003_13.wav-1 B
    • TB01_Kuz61_Kakvtristatri(2).wav-1 B
    • TB27_VAR2-09_470s.wav-1 B
    • TB09_Var2001_TV17_001_16.wav-1 B
    • TB20b_Var2001_MD35_009_18.wav-1 B
    • TB20a_Var2001_MD35_009_05.wav-1 B
    • TB07_Umb2001_JA12_018_03.wav-1 B
    • TB24_VAR2-08_2382s.wav-1 B
    • TB08_Var1987_КК-03В_01.wav-1 B
    • TB14_Var2001_EI34_001_05.wav-1 B
    • TB06_Umb2001_AM13_003_21.wav-1 B
    • TB13_Var2001_MZ39_001_07.wav-1 B
    • TB23_VAR2-08_720s.wav-1 B
    • TB05_Var64Balashov4stariknagorke.wav-1 B
    • TB02_Var64Balashov1Potomb.wav-1 B
    • TB26_VAR2-09_1825s.wav-1 B
    • TB04_Var64Balashov3kumy.wav-1 B
    • TB19_Var2001_OE22_002_01.wav-1 B
    • TB17_Var2001_EF26_008_05.wav-1 B
    • TB28_VAR2-09_3162s.wav-1 B

Show simple item record