Research on singing rendering systems design based on an active auditory perception model

基于主动听觉感知模型的歌唱渲染系统设计研究

基本信息

  • 批准号:
    14380165
  • 负责人:
  • 金额:
    $ 9.15万
  • 依托单位:
  • 依托单位国家:
    日本
  • 项目类别:
    Grant-in-Aid for Scientific Research (B)
  • 财政年份:
    2002
  • 资助国家:
    日本
  • 起止时间:
    2002 至 2004
  • 项目状态:
    已结题

项目摘要

The goal of this project is to investigate the source of reason why vocal music is attractive even without lyrics. This general goal was broken down to several sub-goal which consists of new research tool development and winning prize as the best artificial singing system at international contests. These goals were fulfilled even though the success introduced more questions than the answered questions. Firstly, the piece of chorus with artificially manipulated synthesized voices (that is an excerpt of a composition made by Toru Takemitsu titled "small sky") won the first prize among four synthetic singing systems at RENCON'04, the satellite event of the international conference on computer based entertainment systems (NIME'04) held in Shizuoka in 2004. The piece was made using a STRAIGHT based singing synthesis program. Secondly, the singing synthesis system is based on the auditory morphing algorithm invented for this research project. The morphing algorithm made a substantial impact … More on speech perception and music perception research and the algorithm is currently used in many research institutes worldwide. Thirdly, a new algorithm called "senza vibrato" was developed to made it possible to morph vibrato that is an essential ingredient of singing voice, and at the same time, is an obstacle that made morphing of singing voice very difficult. Fourthly, important experiences were obtained by performing actual investigations based on the "systematic downgrading strategy" that was proposed to characterize the current research project. Those accomplishments were reported at various international/domestic conferences and scientific journals. Those publications and the new research tools based on STRAIGHT made a research trend that is characterized by ecological views on auditory and speech perception. In conclusion, the project was a great success. However, it is important to note that even with all the accomplishments in this project, there still remains a huge gap between synthetic singers and human singers. There is a huge room for investigations to bridge this gap. The prospective research project may need to put attentions on methods for generalization from relatively small number of instances, because, based on experiences in this research project, it is generally impractical to provide sufficient number of singing voice instances to function the "systematic downgrading strategy" in its full extent. Less
该项目的目标是调查声乐即使没有歌词也有吸引力的根源,这个总体目标被分解为几个子目标,其中包括开发新的研究工具和赢得国际最佳人工歌唱系统奖。尽管成功提出的问题比回答的问题还多,但这些目标还是实现了。四个合成歌唱系统中的奖项RENCON'04是2004年在静冈举行的国际计算机娱乐系统会议(NIME'04)的卫星活动。该作品是使用基于STRAIGHT的歌唱合成程序制作的。其次,歌唱合成系统是基于听觉的。该研究项目发明的变形算法对语音感知和音乐感知研究产生了重大影响,该算法目前在全球许多研究机构中使用。第三,一种名为“senza”的新算法。 “颤音”的开发是为了使颤音变形成为可能,而颤音是歌声的重要组成部分,同时也是使歌声变形变得非常困难的障碍。第四,通过基于颤音的实际调查,获得了重要的经验。提出的表征当前研究项目的“系统降级策略”在各种国际/国内会议和科学期刊上都有报道,这些出版物和基于STRAIGHT的新研究工具形成了一种以生态观点为特征的研究趋势。总之,该项目取得了巨大成功,但值得注意的是,即使该项目取得了所有成就,合成歌手与人类歌手之间仍然存在巨大差距。未来的研究项目可能需要关注从相对较少的实例中进行概括的方法,因为根据本研究项目的经验,提供足够数量的歌声实例通常是不切实际的。发挥“系统降级”作用战略”的完整范围。更少

项目成果

期刊论文数量(99)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
Underlying principles of a high-quality speech manipulation system STRAIGHT and its application to speech segregation
Glottal closure instant synchronous sinusoidal model for high quality speech analysis/synthesis
用于高质量语音分析/合成的声门闭合即时同步正弦模型
A new acoustic measurement and compensation method based on logarithmic transformation of the time axis and multi-location acquisition
基于时间轴对数变换和多点采集的声学测量与补偿新方法
Speech segregation using an event-synchronous auditory image and STRAIGHT
使用事件同步听觉图像和 STRAIGHT 进行语音分离
"senza vibrato" : a key component for morphing singing
“senza vibrato”:变形歌唱的关键组成部分
{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ monograph.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ sciAawards.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ conferencePapers.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ patent.updateTime }}

KAWAHARA Hideki其他文献

KAWAHARA Hideki的其他文献

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

{{ truncateString('KAWAHARA Hideki', 18)}}的其他基金

Functional speech design framework based on auditory information representation
基于听觉信息表示的功能语音设计框架
  • 批准号:
    26540087
  • 财政年份:
    2014
  • 资助金额:
    $ 9.15万
  • 项目类别:
    Grant-in-Aid for Challenging Exploratory Research
Investigations on design methodology of functional voices
功能声音设计方法研究
  • 批准号:
    24650085
  • 财政年份:
    2012
  • 资助金额:
    $ 9.15万
  • 项目类别:
    Grant-in-Aid for Challenging Exploratory Research
Advanced speech analysis, modification and synthesis framework based on auditory information representations
基于听觉信息表示的高级语音分析、修改和合成框架
  • 批准号:
    24300073
  • 财政年份:
    2012
  • 资助金额:
    $ 9.15万
  • 项目类别:
    Grant-in-Aid for Scientific Research (B)
Development of research tools based on visualization of artistic expression in vocal performance of Noh
基于能乐声乐表演艺术表现可视化的研究工具开发
  • 批准号:
    22650042
  • 财政年份:
    2010
  • 资助金额:
    $ 9.15万
  • 项目类别:
    Grant-in-Aid for Challenging Exploratory Research
Integrated research on augmentation and assisting technologies forauditory and speech functions
听觉与言语功能增强与辅助技术综合研究
  • 批准号:
    19200017
  • 财政年份:
    2007
  • 资助金额:
    $ 9.15万
  • 项目类别:
    Grant-in-Aid for Scientific Research (A)
Korean philosophy and China and Europe-in the maritime cross-cultural exchange in East Asia
韩国哲学与中欧——东亚海上跨文化交流
  • 批准号:
    17083005
  • 财政年份:
    2005
  • 资助金额:
    $ 9.15万
  • 项目类别:
    Grant-in-Aid for Scientific Research on Priority Areas
Investigations on active perception in speech production and hearing systems
言语产生和听觉系统主动感知的研究
  • 批准号:
    11650425
  • 财政年份:
    1999
  • 资助金额:
    $ 9.15万
  • 项目类别:
    Grant-in-Aid for Scientific Research (C)

相似国自然基金

基于语言知识增强及细粒度韵律建模的老挝语语音合成方法研究
  • 批准号:
    62376111
  • 批准年份:
    2023
  • 资助金额:
    51 万元
  • 项目类别:
    面上项目
适老化语音合成关键技术研究
  • 批准号:
    62306260
  • 批准年份:
    2023
  • 资助金额:
    30 万元
  • 项目类别:
    青年科学基金项目
基于发声机理表征的可控个性化语音合成
  • 批准号:
    62302046
  • 批准年份:
    2023
  • 资助金额:
    30 万元
  • 项目类别:
    青年科学基金项目
基于深度学习的民族语言语音识别及合成关键问题研究
  • 批准号:
    62266048
  • 批准年份:
    2022
  • 资助金额:
    33 万元
  • 项目类别:
    地区科学基金项目
对话场景下语音合成深度情感建模研究
  • 批准号:
  • 批准年份:
    2022
  • 资助金额:
    30 万元
  • 项目类别:
    青年科学基金项目

相似海外基金

Development of Speech Synthesis System for Controlling Speaker Identity through Text Prompts and Visual Interfaces
通过文本提示和可视化界面控制说话人身份的语音合成系统的开发
  • 批准号:
    23K20017
  • 财政年份:
    2023
  • 资助金额:
    $ 9.15万
  • 项目类别:
    Grant-in-Aid for Research Activity Start-up
Leveraging Natural Language Processing for Reverberant Speech Enhancement in Cochlear Implants
利用自然语言处理增强人工耳蜗的混响语音
  • 批准号:
    10755798
  • 财政年份:
    2023
  • 资助金额:
    $ 9.15万
  • 项目类别:
Emotional Text-to-Speech Synthesis with Verbal Speech and Nonverbal Vocalizations
具有言语语音和非言语发声的情感文本到语音合成
  • 批准号:
    23KJ0828
  • 财政年份:
    2023
  • 资助金额:
    $ 9.15万
  • 项目类别:
    Grant-in-Aid for JSPS Fellows
Everyday conversation speech synthesis
日常对话语音合成
  • 批准号:
    22K12107
  • 财政年份:
    2022
  • 资助金额:
    $ 9.15万
  • 项目类别:
    Grant-in-Aid for Scientific Research (C)
Generating Personalized Synthetic Speech for Progressive Dysarthria Using Severity-Appropriate Adaptation Strategies for Neural Text-to-Speech and Voice Conversion
使用神经文本到语音和语音转换的严重程度适当的适应策略为进行性构音障碍生成个性化合成语音
  • 批准号:
    10525903
  • 财政年份:
    2022
  • 资助金额:
    $ 9.15万
  • 项目类别:
{{ showInfoDetail.title }}

作者:{{ showInfoDetail.author }}

知道了