Research on singing rendering systems design based on an active auditory perception model

基于主动听觉感知模型的歌唱渲染系统设计研究

基本信息

  • 批准号:
    14380165
  • 负责人:
  • 金额:
    $ 9.15万
  • 依托单位:
  • 依托单位国家:
    日本
  • 项目类别:
    Grant-in-Aid for Scientific Research (B)
  • 财政年份:
    2002
  • 资助国家:
    日本
  • 起止时间:
    2002 至 2004
  • 项目状态:
    已结题

项目摘要

The goal of this project is to investigate the source of reason why vocal music is attractive even without lyrics. This general goal was broken down to several sub-goal which consists of new research tool development and winning prize as the best artificial singing system at international contests. These goals were fulfilled even though the success introduced more questions than the answered questions. Firstly, the piece of chorus with artificially manipulated synthesized voices (that is an excerpt of a composition made by Toru Takemitsu titled "small sky") won the first prize among four synthetic singing systems at RENCON'04, the satellite event of the international conference on computer based entertainment systems (NIME'04) held in Shizuoka in 2004. The piece was made using a STRAIGHT based singing synthesis program. Secondly, the singing synthesis system is based on the auditory morphing algorithm invented for this research project. The morphing algorithm made a substantial impact … More on speech perception and music perception research and the algorithm is currently used in many research institutes worldwide. Thirdly, a new algorithm called "senza vibrato" was developed to made it possible to morph vibrato that is an essential ingredient of singing voice, and at the same time, is an obstacle that made morphing of singing voice very difficult. Fourthly, important experiences were obtained by performing actual investigations based on the "systematic downgrading strategy" that was proposed to characterize the current research project. Those accomplishments were reported at various international/domestic conferences and scientific journals. Those publications and the new research tools based on STRAIGHT made a research trend that is characterized by ecological views on auditory and speech perception. In conclusion, the project was a great success. However, it is important to note that even with all the accomplishments in this project, there still remains a huge gap between synthetic singers and human singers. There is a huge room for investigations to bridge this gap. The prospective research project may need to put attentions on methods for generalization from relatively small number of instances, because, based on experiences in this research project, it is generally impractical to provide sufficient number of singing voice instances to function the "systematic downgrading strategy" in its full extent. Less
该项目的目的是调查即使没有歌词的人声音乐也很有吸引力的原因。这个一般目标被分解为几个子目标,其中包括新的研究工具开发和获奖奖,作为国际比赛中最佳人工唱歌系统。即使成功提出的问题比回答的问题更多的问题也实现了这些目标。 Firstly, the piece of chorus with artificially manipulated synthesized Voices (that is an excerpt of a composition made by Toru Takemitsu titled "small sky") won the first prize among four synthetic singing systems at RENCON'04, the satellite event of the international conference on computer based entertainment systems (NIME'04) held in Shimaoka in 2004. The piece was made using a STRAIGHT based singing synthesis program.其次,唱歌合成系统基于该研究项目发明的听觉变形算法。这种变形算法产生了重大影响……更多地对语音感知和音乐感知研究和算法进行了更多影响,目前在全球许多研究机构中使用了算法。第三,开发了一种称为“ Senza Vibrato”的新算法是为了使颤音成为演唱声音的重要组成部分,与此同时,这是一个障碍,使演唱声音的变形非常困难。第四,通过基于“系统降级策略”进行实际调查来获得重要的经验,该研究提出了当前研究项目的特征。这些成就是在各种国际/国内会议和科学期刊上报告的。这些出版物和基于直系的新研究工具使研究趋势以对听觉和语音感知的生态观点为特征。总之,该项目取得了巨大的成功。但是,重要的是要注意,即使有了这个项目中的所有成就,合成歌手和人类歌手之间仍然存在巨大差距。有一个巨大的投资空间来弥合这一差距。前瞻性研究项目可能需要注意相对少数实例的概括方法,因为根据该研究项目的经验,提供足够数量的歌声实例以全面运行“系统性降级策略”通常是不切实际的。较少的

项目成果

期刊论文数量(99)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
Glottal closure instant synchronous sinusoidal model for high quality speech analysis/synthesis
用于高质量语音分析/合成的声门闭合即时同步正弦模型
A new acoustic measurement and compensation method based on logarithmic transformation of the time axis and multi-location acquisition
基于时间轴对数变换和多点采集的声学测量与补偿新方法
Loudspeaker equalization based on multi-location observation with reliable time-frequency region selection and its evaluation using sound propagation measurement
基于可靠时频区域选择的多位置观测扬声器均衡及其声传播测量评估
  • DOI:
  • 发表时间:
    2004
  • 期刊:
  • 影响因子:
    0
  • 作者:
    Masanori Morise;Masanori Morise
  • 通讯作者:
    Masanori Morise
Hideki Kawahara: "Systematic Downgrading for Investigating "Naturaless" in Synthesized singing using STRAIGHT : A High Quality VOCODER"143th MEETING OF THE ACOUSTICAL SOCIETY OF AMERICA. Vol.111, No.5, Pt.2. 2334-2334 (2002)
Hideki Kawahara:“使用 STRAIGHT 调查合成歌唱中的“不自然”的系统降级:高品质声码器”美国声学学会第 143 届会议。
  • DOI:
  • 发表时间:
  • 期刊:
  • 影响因子:
    0
  • 作者:
  • 通讯作者:
Speech segregation using an event-synchronous auditory image and STRAIGHT
使用事件同步听觉图像和 STRAIGHT 进行语音分离
{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ monograph.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ sciAawards.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ conferencePapers.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ patent.updateTime }}

KAWAHARA Hideki其他文献

KAWAHARA Hideki的其他文献

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

{{ truncateString('KAWAHARA Hideki', 18)}}的其他基金

Functional speech design framework based on auditory information representation
基于听觉信息表示的功能语音设计框架
  • 批准号:
    26540087
  • 财政年份:
    2014
  • 资助金额:
    $ 9.15万
  • 项目类别:
    Grant-in-Aid for Challenging Exploratory Research
Investigations on design methodology of functional voices
功能声音设计方法研究
  • 批准号:
    24650085
  • 财政年份:
    2012
  • 资助金额:
    $ 9.15万
  • 项目类别:
    Grant-in-Aid for Challenging Exploratory Research
Advanced speech analysis, modification and synthesis framework based on auditory information representations
基于听觉信息表示的高级语音分析、修改和合成框架
  • 批准号:
    24300073
  • 财政年份:
    2012
  • 资助金额:
    $ 9.15万
  • 项目类别:
    Grant-in-Aid for Scientific Research (B)
Development of research tools based on visualization of artistic expression in vocal performance of Noh
基于能乐声乐表演艺术表现可视化的研究工具开发
  • 批准号:
    22650042
  • 财政年份:
    2010
  • 资助金额:
    $ 9.15万
  • 项目类别:
    Grant-in-Aid for Challenging Exploratory Research
Integrated research on augmentation and assisting technologies forauditory and speech functions
听觉与言语功能增强与辅助技术综合研究
  • 批准号:
    19200017
  • 财政年份:
    2007
  • 资助金额:
    $ 9.15万
  • 项目类别:
    Grant-in-Aid for Scientific Research (A)
Korean philosophy and China and Europe-in the maritime cross-cultural exchange in East Asia
韩国哲学与中欧——东亚海上跨文化交流
  • 批准号:
    17083005
  • 财政年份:
    2005
  • 资助金额:
    $ 9.15万
  • 项目类别:
    Grant-in-Aid for Scientific Research on Priority Areas
Investigations on active perception in speech production and hearing systems
言语产生和听觉系统主动感知的研究
  • 批准号:
    11650425
  • 财政年份:
    1999
  • 资助金额:
    $ 9.15万
  • 项目类别:
    Grant-in-Aid for Scientific Research (C)

相似海外基金

Development of Speech Synthesis System for Controlling Speaker Identity through Text Prompts and Visual Interfaces
通过文本提示和可视化界面控制说话人身份的语音合成系统的开发
  • 批准号:
    23K20017
  • 财政年份:
    2023
  • 资助金额:
    $ 9.15万
  • 项目类别:
    Grant-in-Aid for Research Activity Start-up
Emotional Text-to-Speech Synthesis with Verbal Speech and Nonverbal Vocalizations
具有言语语音和非言语发声的情感文本到语音合成
  • 批准号:
    23KJ0828
  • 财政年份:
    2023
  • 资助金额:
    $ 9.15万
  • 项目类别:
    Grant-in-Aid for JSPS Fellows
Everyday conversation speech synthesis
日常对话语音合成
  • 批准号:
    22K12107
  • 财政年份:
    2022
  • 资助金额:
    $ 9.15万
  • 项目类别:
    Grant-in-Aid for Scientific Research (C)
Sustainably Developable Speech Synthesis Based on Continual Learning
基于持续学习的可持续发展语音合成
  • 批准号:
    21K21305
  • 财政年份:
    2021
  • 资助金额:
    $ 9.15万
  • 项目类别:
    Grant-in-Aid for Research Activity Start-up
Collaborative Research: RI: Medium: Flexible Deep Speech Synthesis through Gestural Modeling
合作研究:RI:Medium:通过手势建模进行灵活的深度语音合成
  • 批准号:
    2106928
  • 财政年份:
    2021
  • 资助金额:
    $ 9.15万
  • 项目类别:
    Standard Grant
{{ showInfoDetail.title }}

作者:{{ showInfoDetail.author }}

知道了