CARInA - Corpus of Aligned Read speech Including Annotations
Documentation of the data | Audio files, TextGrid-files and BPF files | |
Countries to which the data refer | GERMANY | |
Description of the data | CARInA is a German-language speech corpus containing speech material of the German Spoken Wikipedia Corpus. It is organized by completeness and speakers. The folder "Complete" contains all speech material which is annotated at orthographic, morphosyntactic, broad phonetic, narrow phonetic as well as prosodic speech level. The folder "WorkInProgress" contains all material with at least on one incomplete annotation level. | |
Type of the data | Sound | |
Total size of the dataset | 54035507106 | |
Author | Kath, Hannes | |
Upload date | 2021-09-29T14:31:30Z | |
Publication date | 2021-09-29T14:31:30Z | |
Publication date | 2026-06-04T14:27:02Z | |
Data of data creation | 2021 | |
Publication date | 2021-09-29 | |
Abstract of the dataset | Data CARInA, MATLAB-Code, Documents. The speech corpus is annotated (by automatic systems) at orthographic, morphosyntactic, broad phonetic, narrow phonetic and prosodic level. | |
Public reference to this page | https://opara.zih.tu-dresden.de/handle/123456789/2505 | |
Public reference to this page | https://doi.org/10.25532/OPARA-144 | |
dc.language | eng | |
Publisher | Technische Universität Dresden | |
Licence | Attribution-ShareAlike 4.0 International | |
URI of the licence text | http://creativecommons.org/licenses/by-sa/4.0/ | |
Specification of the discipline(s) | 4::44::408 | |
Specification of the discipline(s) | 1::11::104::104-04 | |
Title of the dataset | CARInA - Corpus of Aligned Read speech Including Annotations |
Files
Original bundle
- Name:
- CARInA_WorkInProgress.tar.gz
- Size:
- 42.44 GB
- Format:
- Unknown data format
- Description:
Collections

