Corpus of Aligned Read speech Including Annotations (CARInA)
CARInA is a German-language speech corpus containing speech material of the German Spoken Wikipedia Corpus. It is organized by completeness and speakers. The folder "Complete" contains all speech material which is annotated at orthographic, morphosyntactic, broad phonetic, narrow phonetic as well as prosodic speech level. The folder "WorkInProgress" contains all material with at least on one incomplete annotation level.
This collection is open access and publicly accessible.
(Technische Universität Dresden, 2021)Data CARInA, MATLAB-Code, Documents. The speech corpus is annotated (by automatic systems) at orthographic, morphosyntactic, broad phonetic, narrow phonetic and prosodic level.