CARInA - Corpus of Aligned Read speech Including Annotations

Documentation of the data
datacite.description.TechnicalInfo

Audio files, TextGrid-files and BPF files

Countries to which the data refer
datacite.geolocation.iso3166

GERMANY

Description of the data
datacite.resourceType

CARInA is a German-language speech corpus containing speech material of the German Spoken Wikipedia Corpus. It is organized by completeness and speakers. The folder "Complete" contains all speech material which is annotated at orthographic, morphosyntactic, broad phonetic, narrow phonetic as well as prosodic speech level. The folder "WorkInProgress" contains all material with at least on one incomplete annotation level.

Type of the data
datacite.resourceTypeGeneral

Sound

Total size of the dataset
datacite.size

54035507106

Author
dc.contributor.author

Kath, Hannes

Upload date
dc.date.accessioned

2021-09-29T14:31:30Z

Publication date
dc.date.available

2021-09-29T14:31:30Z

Publication date
dc.date.available

2026-06-04T14:27:02Z

Data of data creation
dc.date.created

2021

Publication date
dc.date.issued

2021-09-29

Abstract of the dataset
dc.description.abstract

Data CARInA, MATLAB-Code, Documents. The speech corpus is annotated (by automatic systems) at orthographic, morphosyntactic, broad phonetic, narrow phonetic and prosodic level.

Public reference to this page
dc.identifier.uri

https://opara.zih.tu-dresden.de/handle/123456789/2505

Public reference to this page
dc.identifier.uri

https://doi.org/10.25532/OPARA-144

dc.language
dc.language

eng

Publisher
dc.publisher

Technische Universität Dresden

Licence
dc.rights

Attribution-ShareAlike 4.0 International

URI of the licence text
dc.rights.uri

http://creativecommons.org/licenses/by-sa/4.0/

Specification of the discipline(s)
dc.subject.classification

4::44::408

Specification of the discipline(s)
dc.subject.classification

1::11::104::104-04

Title of the dataset
dc.title

CARInA - Corpus of Aligned Read speech Including Annotations

Files

Original bundle

Now showing 1 - 5 of 5
Loading...
Thumbnail Image
Name:
CARInA_Complete.tar.gz
Size:
7.81 GB
Format:
Unknown data format
Description:
Loading...
Thumbnail Image
Name:
CARInA_WorkInProgress.tar.gz
Size:
42.44 GB
Format:
Unknown data format
Description:
Loading...
Thumbnail Image
Name:
zipfile_contents.txt
Size:
23.35 MB
Format:
Plain Text
Loading...
Thumbnail Image
Name:
Documents.tar.gz
Size:
4.63 MB
Format:
Unknown data format
Description:
Loading...
Thumbnail Image
Name:
MATLAB-code.tar.gz
Size:
50.18 MB
Format:
Unknown data format
Description:
Attribution-ShareAlike 4.0 International