Social Perceptions of Synthetic Speakers
合成扬声器的社会认知
基本信息
- 批准号:423651352
- 负责人:
- 金额:--
- 依托单位:
- 依托单位国家:德国
- 项目类别:Research Grants
- 财政年份:2019
- 资助国家:德国
- 起止时间:2018-12-31 至 2021-12-31
- 项目状态:已结题
- 来源:
- 关键词:
项目摘要
Speech signals automatically induce social perceptions in listeners regarding the speakers. With acoustic analysis and signal manipulation, a great body of knowledge has been accumulated regarding relevant acoustic correlates of social perceptions, such as spectral and prosodic parameters, as well as perceptual dimensions for natural speech. However, despite the advent of modern speech synthesis paradigms providing very high quality, it is yet to be understood, if results from natural speech also hold for synthesized speech. Hence, the major research question is: “Which acoustic features of synthesized speech affect subjective perceptions of social speaker characteristics?”In order to answer this question, this project studies social perception of the two basic social attributions, competence and benevolence, for text-to-speech (TTS) synthesizers in two potential application domains: Stimuli from the topics of healthcare and of customer service. Results are compared to those obtained from natural speech in earlier projects. It is tested whether competence and benevolence also emerge as basic social attributions, or if other dimensions are more relevant. Regarding the speech signal, similarities and differences in acoustic parameters and their systematics are identified. A mid-term result is an acoustic prediction model of the identified social dimensions for synthesized speech.On a methodological level, utterances are created with state-of-the-art TTS systems and systematically modified on the signal level, in order to produce stimuli for empirical testing with human listeners. Crowd-sourcing techniques are applied for the required listening and rating tests. The final goal is to examine, how acoustic features and patterns can be directly incorporated in modern TTS methodologies (Hidden-Markov-Models, Deep Neural Networks) instead of post-processing signal manipulation. This leads to the secondary research question: “Which alterations of the synthesis procedure lead to positive perceptions of speakers?” For this aim, current approaches from speaker conversion are applied.Apart from the fundamental knowledge gained from this research, results will be relevant for TTS system developers, in order to efficiently improve voices for particular service domains.
语音信号自动在听众中就演讲者引起社会看法。通过声学分析和信号操纵,关于社会感知的相关声学相关性,例如光谱和韵律参数,以及自然语音的知觉维度,广泛的知识已经准确。但是,尽管现代语音综合范式的发展提供了非常高质量的范围,但如果自然语音的结果也适用于合成的语音,但尚待理解。 Hence, the major research question is: “Which acoustic features of synthesized speech affect subjective perceptions of social speaker characteristics?”In order to answer this question, this project studies social perception of the two basic social attributes, competence and benevolence, for text-to-speech (TTS) synthesizers in two potential application domains: Stimuli from the topics of healthcare and of customer service.将结果与早期项目中自然语音获得的结果进行比较。它测试了能力和仁慈是否也作为基本社会属性出现,还是其他维度更相关。关于语音信号,声学参数及其系统的相似性和差异。中期结果是一个综合语音的确定社会维度的声学预测模型。在方法论水平上,用最新的TTS系统创建了话语,并在信号级别进行了系统修改,以便与人类听众产生经验测试的刺激。群源技术适用于所需的聆听和评级测试。最终的目标是检查如何将声学特征和模式直接纳入现代TTS方法(隐藏式Markov模型,深神经网络)而不是后处理信号操作中。这导致了二级研究问题:“综合程序的哪些变化导致对说话者的积极看法?”为了实现这一目标,采用了说话者转换的当前方法。从这项研究中获得的基本知识中,结果将与TTS系统开发人员相关,以便有效地改善特定服务领域的声音。
项目成果
期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
数据更新时间:{{ journalArticles.updateTime }}
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ patent.updateTime }}
Professor Dr.-Ing. Sebastian Möller其他文献
Professor Dr.-Ing. Sebastian Möller的其他文献
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
{{ truncateString('Professor Dr.-Ing. Sebastian Möller', 18)}}的其他基金
Quantification of perceived location privacy, and its relationship to privacy behaviour
感知位置隐私的量化及其与隐私行为的关系
- 批准号:
409241470 - 财政年份:2019
- 资助金额:
-- - 项目类别:
Research Grants
Simulation of Conversation Behavior in Case of Impaired Telephone Transmission
电话传输受损情况下的对话行为模拟
- 批准号:
320253669 - 财政年份:2016
- 资助金额:
-- - 项目类别:
Research Grants
Quality Attributes and Overall Quality of Transmitted Speech
传输语音的质量属性和总体质量
- 批准号:
289919134 - 财政年份:2016
- 资助金额:
-- - 项目类别:
Research Grants (Transfer Project)
Subjective measurement and instrumental estimation of mobile online gaming quality based on perceptual dimensions
基于感知维度的移动网络游戏质量主观测量与工具评价
- 批准号:
279244726 - 财政年份:2015
- 资助金额:
-- - 项目类别:
Research Grants
Subjective measurement and instrumental estimation of conversational speech quality based on perceptual dimensions
基于感知维度的会话语音质量主观测量与仪器评价
- 批准号:
251103195 - 财政年份:2014
- 资助金额:
-- - 项目类别:
Research Grants
Modellierung von Benutzerverhalten zur Usability-Evaluierung von Sprachdialogdiensten mit Hilfe von techniksoziologisch ermittelten Regeln
借助技术社会学确定的规则对用户行为进行建模,以进行语音对话服务的可用性评估
- 批准号:
152700694 - 财政年份:2009
- 资助金额:
-- - 项目类别:
Research Grants
Qualitätsmessung multimodaler Mensch-Maschine-Interaktion
多模态人机交互的质量测量
- 批准号:
55252204 - 财政年份:2008
- 资助金额:
-- - 项目类别:
Research Grants
Qualität multimodaler Mensch-Maschine-Interaktion
多模态人机交互的质量
- 批准号:
5454604 - 财政年份:2005
- 资助金额:
-- - 项目类别:
Heisenberg Fellowships
Sprachsignal-Qualitätsmessung auf der Grundlage auditiv und messtechnisch definierter Qualitätsattribute
基于听觉和计量定义的质量属性的语音信号质量测量
- 批准号:
5427824 - 财政年份:2004
- 资助金额:
-- - 项目类别:
Research Grants
Knowledge-enhanced information extraction across languages for pharmacovigilance
跨语言的知识增强信息提取用于药物警戒
- 批准号:
442445488 - 财政年份:
- 资助金额:
-- - 项目类别:
Research Grants
相似国自然基金
液晶聚合物的可控合成与智能感知性能的研究
- 批准号:22305175
- 批准年份:2023
- 资助金额:30 万元
- 项目类别:青年科学基金项目
营养感知分子Ubxd8调控缢蛏HUFA合成的作用及机制研究
- 批准号:32102763
- 批准年份:2021
- 资助金额:24.00 万元
- 项目类别:青年科学基金项目
营养感知分子Ubxd8调控缢蛏HUFA合成的作用及机制研究
- 批准号:
- 批准年份:2021
- 资助金额:30 万元
- 项目类别:青年科学基金项目
胞浆甘氨酰-tRNA合成酶cytoGARS感知甘氨酸的分子机制及其对肝细胞癌的影响
- 批准号:32070756
- 批准年份:2020
- 资助金额:59 万元
- 项目类别:面上项目
基于场景精细感知对抗的虚假图像合成研究
- 批准号:61902400
- 批准年份:2019
- 资助金额:28.0 万元
- 项目类别:青年科学基金项目
相似海外基金
心を動かす質感認知と瞳孔反応の因果およびその操作技術
触动心灵的纹理感知和瞳孔反应的因果关系,以及操纵它们的技术
- 批准号:
22K17987 - 财政年份:2022
- 资助金额:
-- - 项目类别:
Grant-in-Aid for Early-Career Scientists
教養知とその形成―その比較分析と教養教育の類型化の実践的検証
文科知识及其形成:文科教育类型学的比较分析与实践验证
- 批准号:
20K02979 - 财政年份:2020
- 资助金额:
-- - 项目类别:
Grant-in-Aid for Scientific Research (C)
Cognitive neuroscience investigation of the unconscious processes of aesthetic consciousness: an integrated understanding of processing and influencing processes
审美意识无意识过程的认知神经科学研究:对加工和影响过程的综合理解
- 批准号:
19H04432 - 财政年份:2019
- 资助金额:
-- - 项目类别:
Grant-in-Aid for Scientific Research (B)
A study on the usage of the Japanese verb "suru" to express the meaning of attribute/state and its corresponding expressions in English
日语动词“suru”表达属性/状态的用法及其英语对应表达研究
- 批准号:
19K20789 - 财政年份:2018
- 资助金额:
-- - 项目类别:
Grant-in-Aid for Research Activity Start-up
感情共有の機能に関する実験的検討:関係構築と集合的態度形成における役割
情感分享功能的实验研究:其在关系建立和集体态度形成中的作用
- 批准号:
18K03017 - 财政年份:2018
- 资助金额:
-- - 项目类别:
Grant-in-Aid for Scientific Research (C)