Show simple item record

 
dc.contributor.author Parra Escartín, Carla
dc.date.accessioned 2014-11-26T13:56:09Z
dc.date.accessioned 2014-11-26T16:53:08Z
dc.date.available 2014-11-26T13:56:09Z
dc.date.available 2014-11-26T16:53:08Z
dc.date.issued 2012-04-18
dc.identifier.uri http://hdl.handle.net/11509/79
dc.description TRIS Spanish-German parallel corpus (v0.3) Specialized parallel corpus Spanish-German (ES-ES, DE-AT and DE-DE), texts from the European Commission between 1997-2010. The texts are technical regulations in a variety of domains. This third version is sentence aligned and is in TMX and TEI format. TMX files are sentence aligned while TEI encoded files have the information about sentence alignment in stand-off annotation. Every sentence includes information about the domain, the year and the file it belongs to as well as the sentence number. It contains files written in Austria and translated into European Spanish from three different domains: - B00: Construction (205 files; 70,648 sentences; 1,563,000 words; time frame: 1999-2010) - C00A: Agriculture, Fishing and Foodstuffs (12 files; 4879 sentences; 137,354 words; time frame: 1999-2001) - H00: Domestic and Leisure Equipment (12 files; 1229 sentences; 58328 words; time frame: 2005-2010) Additionally the corpus has also been Part-Of-Speech tagged using the TreeTagger POS tagger and the POS tagged files are also available. Versions 0.1 and 0.2 are kept as individual records because they are (currently) intended to be downloaded individually. Version 0.3 is encoded in TEI P5 and includes files from two new domains not included in versions 0.1 and 0.2: C00A (Agriculture, Fishing and Foodstuffs), which is currently under alignment and H00 (Domestic and Leisure Equipment), which includes all files available in the database up to 2010.
dc.description.sponsorship Common Language Resources and their Applications (CLARA - Project number: 238405) URL: http://clara.uib.no Funding Type: Eu Funds Funders: SP3-People-ITN (Network for Initial Training, Marie Curie Actions, FP7) Project duration: 01.12.2009 - 30.11.2013
dc.language.iso deu
dc.language.iso spa
dc.publisher University of Bergen
dc.rights Attribution-NonCommercial-ShareAlike 3.0 Unported (CC BY-NC-SA 3.0)
dc.rights.uri http://creativecommons.org/licenses/by-nc-sa/3.0/
dc.rights.label CC
dc.source.uri http://clara.b.uib.no/fellows/carla-parra-escartin/tris/
dc.subject Corpus
dc.title Parallel Corpus of documents from the Technical Regulations Information System for German-Spanish (v0.3)
dc.type corpus
metashare.ResourceInfo#ContactInfo#PersonInfo.givenName Parra Escartín, Carla
metashare.ResourceInfo#ContactInfo#PersonInfo.surname Parra Escartín, Carla
metashare.ResourceInfo#ContactInfo#PersonInfo#OrganizationInfo.organizationName University of Bergen
metashare.ResourceInfo#ContactInfo#PersonInfo#OrganizationInfo#CommunicationInfo.email carla.parra@uib.no
metashare.ResourceInfo#ContentInfo.mediaType text
metashare.ResourceInfo#DistributionInfo#LicenseInfo.distributionAccessMedium downloadable
metashare.ResourceInfo#ResourceCreationInfo#FundingInfo#ProjectInfo.fundingType #1-euFunds
metashare.ResourceInfo#ResourceCreationInfo#FundingInfo#ProjectInfo.projectName #1-Common Language Resources and their Applications (CLARA - Project number: 238405)
metashare.ResourceInfo#TextInfo#SizeInfo.size 1758419
metashare.ResourceInfo#TextInfo#SizeInfo.sizeUnit words
metashare.ResourceInfo#ValidationInfo.validated True
hidden
hasMetadata true
has.files yes
branding clarino
branding clarino
files.size 56580547
files.count 1


 Files in this item

This item is
Distributed under Creative Commons
and licensed under:
Attribution-NonCommercial-ShareAlike 3.0 Unported (CC BY-NC-SA 3.0)
Attribution Required Noncommercial Share Alike
Icon
Name
TRIS-V03.zip
Size
53.96 MB
Format
application/zip
Description
TRIS-V03
 Download file

Show simple item record