<?xml version="1.0" encoding="UTF-8"?><rdf:RDF xmlns="http://purl.org/rss/1.0/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:dc="http://purl.org/dc/elements/1.1/">
<channel rdf:about="https://opara.zih.tu-dresden.de/xmlui/handle/123456789/5914">
<title>Perceptual evaluation of broadband physics-based speech sound synthesis with a 1D versus a 3D acoustic model</title>
<link>https://opara.zih.tu-dresden.de/xmlui/handle/123456789/5914</link>
<description>Articulatory synthesis is a useful tool to explore the relationship between &#13;
			the speech production and perception processes. &#13;
			However, including the high frequencies (above about 5 kHz) requires  &#13;
			a three-dimensional (3D) acoustical model for realistic simulations. &#13;
			In this frequency range, &#13;
			one-dimensional (1D) acoustic models fail to predict additional&#13;
			resonances and anti-resonances related to the 3D properties of &#13;
			the acoustic field. &#13;
			While articulatory synthesis based on 3D &#13;
			acoustic models is nowadays achievable for isolated phonemes, the impact of such&#13;
			models on the perception by human listeners remains largely unknown.&#13;
			In this study, it was first examined whether the high-frequency part of stimuli &#13;
			generated with 1D and 3D acoustic models &#13;
			can be differentiated in a pair comparison task. &#13;
			The results show that such differences can be discriminated, which is in line &#13;
			with recent findings showing that purely spectral cues can contribute &#13;
			to the perception of speech at high frequencies.&#13;
			A second perceptual experiment that consisted in rating the naturalness of the &#13;
			stimuli on a four-level Likert scale did not show any significant effect &#13;
			of the acoustic model.&#13;
			However, it highlighted differences of naturalness between the synthesized &#13;
			phonemes.</description>
<dc:date>2026-04-07T07:20:06Z</dc:date>
</channel>
</rdf:RDF>
