Next-Generation Expressive Personalized Voices for Speech-Generating Devices
用于语音生成设备的下一代富有表现力的个性化声音
基本信息
- 批准号:10547241
- 负责人:
- 金额:$ 27.58万
- 依托单位:
- 依托单位国家:美国
- 项目类别:
- 财政年份:2022
- 资助国家:美国
- 起止时间:2022-08-15 至 2024-08-14
- 项目状态:已结题
- 来源:
- 关键词:ALS patientsAdoptionAdultAgeAlgorithmsAmyotrophic Lateral SclerosisAugmentative and Alternative CommunicationCharacteristicsChildChild HealthClientDepressed moodDiseaseDysarthriaEmotionsEncapsulatedEvaluationFemaleGenerationsGoalsGovernmentHumanHybridsIndividualKnowledgeLaboratory ResearchLearningLinguisticsMachine LearningMethodsModelingNetwork-basedNeurodegenerative DisordersOnset of illnessOutcomeOutputPersonsPhaseProcessProductionReadingRecordsRehabilitation therapyRiskRunningServicesSpeechStructureSurveysSystemTechnologyTextTrainingVoiceVoice Qualitybasecommercial applicationcommunication devicedeep neural networkdesignexperienceexperimental studyimprovedknowledge basemachine learning algorithmmalemimeticsnext generationnovelsoundsuccessvirtual vocal tract
项目摘要
Project Summary/Abstract
The creation of personalized synthetic voices has wide application in medical/rehabilitation settings for pa-
tients who rely on a speech-generating device (SGD) for communication. One common application is voice
banking, wherein a person who risks losing their voice, such as somebody with a neurodegenerative disease
like Amyotrophic Lateral Sclerosis (ALS), records their own speech before the onset of disease-related dysar-
thria for later use in an SGD that mimics their natural speech characteristics. While the technology underlying
the creation of such personalized synthetic voices is growing in maturity and adoption by SGD users, it still suf-
fers from two primary limitations: a lack of expressiveness and a burdensome amount of recording needed to
create highly natural-sounding voices. The proposed project aims to remedy this situation by marrying the ma-
chine-learning technology behind ModelTalker, a pioneering voice-banking text-to-speech service developed at
Nemours Children’s Health, with the knowledge-based technology underlying Synfony, a rule-based text-to-
speech system developed by Synfonica LLC, which is capable of generating a variety of speech styles and ex-
pressive modes. The expert knowledge built into Synfonica will be used to design an optimal set of sentences
for voice bankers to record, and its algorithms for the generation of natural-sounding prosody in different
modes and styles will be integrated into ModelTalker’s machine-learning algorithms, creating a hybrid system
that embraces the best qualities of both approaches. The new text-to-speech (TTS) system resulting from this
project will (a) require a minimal amount of recorded speech from the voice banker, (b) accurately capture
their vocal identity, and (c) be structured such that new expressive modes and speech styles can be added easily
without additional recording. The feasibility of the project will be demonstrated by recording the voices of an
adult male, an adult female, and a child, and generating TTS voices that can speak in three expressive modes
(neutral, happy, and sad). Perceptual experiments will be run to evaluate their intelligibility, naturalness, suc-
cess in capturing the vocal identity of the speaker, and the appropriateness of their expressive modes. In gen-
eral, the project will be a major step forward in enabling the users of personalized synthetic voices to express
their emotions and intentions.
项目摘要/摘要
个性化合成声音的创建在医疗/康复环境中广泛应用
依靠语音生成设备(SGD)进行通信的t。一个常见的应用是语音
银行业务,其中一个冒着失去声音的人,例如患有神经退行性疾病的人
像肌萎缩性侧索硬化症(ALS)一样,在与疾病相关的dysar-发作之前记录了自己的语音
供以后使用的SGD使用,该SGD模仿其自然语音特征。而底层技术
这种个性化的合成声音的创建在成熟和被SGD用户的采用中增长,它仍然可以
从两个主要局限
创建高度自然的声音。拟议的项目旨在通过嫁给Ma-来记住这种情况
ModelTalker背后的Chine学习技术,ModelTalker,这是一款开发的语音言论语音到语音服务
Nemours儿童健康,具有基于知识的技术的基础技术,这是一种基于规则的文本到基础的文本
Synfonica LLC开发的语音系统,该系统能够产生各种语音风格和extress
压力模式。 Synfonica内置的专家知识将用于设计最佳句子集
供语音银行家记录及其在不同
模式和样式将集成到ModelTalker的机器学习算法中,创建一个混合系统
这具有两种方法的最佳品质。由此产生的新文本到语音(TTS)系统
项目将(a)要求语音银行家的记录量最小,(b)准确捕获
它们的声音身份以及(c)的结构使得可以轻松添加新的表达方式和语音样式
没有其他记录。该项目的可行性将通过记录一个
成年男性,一个成年女性和一个孩子,并产生可以以三种表达模式说话的TTS声音
(中立,快乐和悲伤)。将进行感知实验,以评估其智力,自然性,成功
在捕捉说话者的声音身份以及其表达方式的适当性时。在一般
错误,该项目将是使个性化合成声音的用户表达的主要一步
他们的情绪和意图。
项目成果
期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
数据更新时间:{{ journalArticles.updateTime }}
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ patent.updateTime }}
H TIMOTHY Bunnell其他文献
H TIMOTHY Bunnell的其他文献
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
{{ truncateString('H TIMOTHY Bunnell', 18)}}的其他基金
Personalized speech output for communication devices
通信设备的个性化语音输出
- 批准号:
7219783 - 财政年份:2003
- 资助金额:
$ 27.58万 - 项目类别:
Personalizing Speech Output for Communication Devices
个性化通信设备的语音输出
- 批准号:
6749031 - 财政年份:2003
- 资助金额:
$ 27.58万 - 项目类别:
Personalizing Speech Output for Communication Devices
个性化通信设备的语音输出
- 批准号:
6646704 - 财政年份:2003
- 资助金额:
$ 27.58万 - 项目类别:
Personalized speech output for communication devices
通信设备的个性化语音输出
- 批准号:
7337320 - 财政年份:2003
- 资助金额:
$ 27.58万 - 项目类别:
相似国自然基金
采用新型视觉-电刺激配对范式长期、特异性改变成年期动物视觉系统功能可塑性
- 批准号:32371047
- 批准年份:2023
- 资助金额:50 万元
- 项目类别:面上项目
破解老年人数字鸿沟:老年人采用数字技术的决策过程、客观障碍和应对策略
- 批准号:72303205
- 批准年份:2023
- 资助金额:30.00 万元
- 项目类别:青年科学基金项目
通过抑制流体运动和采用双能谱方法来改进烧蚀速率测量的研究
- 批准号:12305261
- 批准年份:2023
- 资助金额:30.00 万元
- 项目类别:青年科学基金项目
采用多种稀疏自注意力机制的Transformer隧道衬砌裂缝检测方法研究
- 批准号:62301339
- 批准年份:2023
- 资助金额:30.00 万元
- 项目类别:青年科学基金项目
政策激励、信息传递与农户屋顶光伏技术采用提升机制研究
- 批准号:72304103
- 批准年份:2023
- 资助金额:30 万元
- 项目类别:青年科学基金项目
相似海外基金
Implementing SafeCare Kenya to Reduce Noncommunicable Disease Burden: Building Community Health Workers' Capacity to Support Parents with Young Children
实施 SafeCare Kenya 以减少非传染性疾病负担:建设社区卫生工作者支持有幼儿的父母的能力
- 批准号:
10672785 - 财政年份:2023
- 资助金额:
$ 27.58万 - 项目类别:
Impact of Medicaid Prescription Cap Policies on Treatment Outcomes for Opioid Use Disorder: A National Mixed Methods Study
医疗补助处方上限政策对阿片类药物使用障碍治疗结果的影响:一项国家混合方法研究
- 批准号:
10637024 - 财政年份:2023
- 资助金额:
$ 27.58万 - 项目类别:
Optimization of electromechanical monitoring of engineered heart tissues
工程心脏组织机电监测的优化
- 批准号:
10673513 - 财政年份:2023
- 资助金额:
$ 27.58万 - 项目类别:
Relationships Between Pain-Related Psychological Factors, Gait Quality, and Attention in Chronic Low Back Pain
慢性腰痛中疼痛相关心理因素、步态质量和注意力之间的关系
- 批准号:
10679189 - 财政年份:2023
- 资助金额:
$ 27.58万 - 项目类别:
The RaDIANT Health Systems Intervention for Equity in Kidney Transplantation
Radiant 卫生系统干预肾移植的公平性
- 批准号:
10681998 - 财政年份:2023
- 资助金额:
$ 27.58万 - 项目类别: