Mathematical modeling of the temporal acpects of speech based on human perceptual and cognitive mechanisms

基于人类感知和认知机制的语音时间方面的数学建模

基本信息

项目摘要

The purpose of this project was to find clues by which humans retrieve the temporal structure of speech, to understand their usage, and to establish a quantitative method to evaluate the temporal adequateness or naturalness of a given speech sound that can replicate the performance of human judgment. For this purpose, three fundamental investigative tasks were implemented a study at the psychophysical level, a study at the linguistic level, and the construction of an evaluation model. One distinguishing feature of this project is that it emphasized the psychophysical aspects nearly as much as the linguistic, even though its primary object was spoken language.Since a person can easily recognize fast speech, even in the case of a foreign language where the meaning is unknown, it is assumed that the processing of the temporal aspects of speech undoubtedly involve non-linguistic and therefore language-independent activities. By concentrating on such processing that is independent of a give … More n language, we developed the basic technology toward a system with small overhead for language processing as well as simple extensibility to multiple languages. The major results follow.(I) Psychophysical level : An algorithm was developed to predict temporal reference points in a given speech by replicating the function of human auditory processing. This algorithm's most important benefit is its applicability to virtually unlimited language variations since it doesn't require any linguistic knowledge.(II) Linguistic level : An empirical study levealed that factors, which affect the perception of prosodic units, vary depending on the particular language's choice of units. This finding provides practical implications concerning how much weight should be placed on prosodic factors when designing effective foreign-language training methods.(III) Modeling : By integrating auditory functions derived from investigation at the psychophysical level, a mathematical model was implemented to automatically evaluate the naturalness of the speech of the English learners. The model's performance closely approximated the subjective evaluation of a native-speaking English instructor. This result not only suggests the importance of psychophysical factors in the adequateness or naturalness evaluation of speech but it also implies potential extensibility of the proposed model to multiple languages. Less
目的是找到人类检索语音时间结构的线索,并建立一种评估给定声音的时间适当性,可以为此目的复制Humanant的表现,这是三个基本的调查任务,是由三个基本的调查任务。在心理物理级别,E评估模型的E分解是该项目的一个区别特征。假设言语的时间方面无疑涉及非语言和与语言无关的活动。语言处理和简单的扩展性,主要的结果是人类的听觉处理特定语言的单位选择在设计有效的外国语言培训方法(III)时,数学模型被迫评估英国学习者的自然性。讲师的结果是atens的成像性因素。

项目成果

期刊论文数量(340)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
Effects of auditory feedback in the practive phase of imitating a piano performance
听觉反馈在模仿钢琴演奏练习阶段的效果
Prosody generation for communicative speech synthesis.
用于交际语音合成的韵律生成。
音楽知覚研究用ツールSTRAIGHT&aimmatの機能
音乐感知研究工具STRAIGHT&aimmat的特点
Judgment of onset asynchrony of two tone components and its relation to the cochlear delay.
两个音调成分起病不同步的判断及其与耳蜗延迟的关系。
第二言語の音声学習-知覚と生成および処理階層間の相互作用-(招待講演)
第二语言语音学习——感知、产生和处理层之间的交互——(特邀演讲)
{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ monograph.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ sciAawards.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ conferencePapers.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ patent.updateTime }}

KATO Hiroaki其他文献

KATO Hiroaki的其他文献

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

{{ truncateString('KATO Hiroaki', 18)}}的其他基金

Recognition of pathogen-derived sphingolipid in plants.
植物中病原体衍生鞘脂的识别。
  • 批准号:
    20K15528
  • 财政年份:
    2020
  • 资助金额:
    $ 31.62万
  • 项目类别:
    Grant-in-Aid for Early-Career Scientists
Structural basis for discrimination between multi-drug exporters and lipid floppies
多种药物出口商和脂质软盘之间歧视的结构基础
  • 批准号:
    19K22495
  • 财政年份:
    2019
  • 资助金额:
    $ 31.62万
  • 项目类别:
    Grant-in-Aid for Challenging Research (Exploratory)
Structural basis for optimization of molecular probe using P-glycoprotein in vivo imaging
P-糖蛋白体内成像优化分子探针的结构基础
  • 批准号:
    24659018
  • 财政年份:
    2012
  • 资助金额:
    $ 31.62万
  • 项目类别:
    Grant-in-Aid for Challenging Exploratory Research
Protein structural data mining based on the Neighborhood Fragment Spectra representation
基于邻域片段谱表示的蛋白质结构数据挖掘
  • 批准号:
    22500130
  • 财政年份:
    2010
  • 资助金额:
    $ 31.62万
  • 项目类别:
    Grant-in-Aid for Scientific Research (C)
Studies on stability of fission yeast heterochromatin and its regulators
裂殖酵母异染色质及其调控因子的稳定性研究
  • 批准号:
    21870024
  • 财政年份:
    2009
  • 资助金额:
    $ 31.62万
  • 项目类别:
    Grant-in-Aid for Research Activity Start-up
High frequency and high field magnetic resonance in panoscopic- assembled rare earth magnets
全景组装稀土磁体中的高频高场磁共振
  • 批准号:
    20900111
  • 财政年份:
    2008
  • 资助金额:
    $ 31.62万
  • 项目类别:
A Modeling of Prosody Perception for Second Language Learning
第二语言学习的韵律感知建模
  • 批准号:
    20300069
  • 财政年份:
    2008
  • 资助金额:
    $ 31.62万
  • 项目类别:
    Grant-in-Aid for Scientific Research (B)
Three-dimensional structural similarity search of proteins based on Geometrical Fragment Spectra
基于几何碎片谱的蛋白质三维结构相似性搜索
  • 批准号:
    19700139
  • 财政年份:
    2007
  • 资助金额:
    $ 31.62万
  • 项目类别:
    Grant-in-Aid for Young Scientists (B)
Struchural biology of membrane protein trabsporters
膜蛋白转运蛋白的结构生物学
  • 批准号:
    17380066
  • 财政年份:
    2005
  • 资助金额:
    $ 31.62万
  • 项目类别:
    Grant-in-Aid for Scientific Research (B)
Basic Study on Decentralized Society : role of State Governments in the USA and Provincial governments in Canada
去中心化社会的基础研究:美国州政府和加拿大省政府的作用
  • 批准号:
    16310166
  • 财政年份:
    2004
  • 资助金额:
    $ 31.62万
  • 项目类别:
    Grant-in-Aid for Scientific Research (B)

相似海外基金

Studies of speech, image and natural language processing for multimodal spoken document retrieval
多模态语音文档检索的语音、图像和自然语言处理研究
  • 批准号:
    23K11216
  • 财政年份:
    2023
  • 资助金额:
    $ 31.62万
  • 项目类别:
    Grant-in-Aid for Scientific Research (C)
Determining the mechanisms of spoken language processing delay for children with cochlear implants
确定人工耳蜗植入儿童口语处理延迟的机制
  • 批准号:
    10537470
  • 财政年份:
    2022
  • 资助金额:
    $ 31.62万
  • 项目类别:
Determining the mechanisms of spoken language processing delay for children with cochlear implants
确定人工耳蜗植入儿童口语处理延迟的机制
  • 批准号:
    10669599
  • 财政年份:
    2022
  • 资助金额:
    $ 31.62万
  • 项目类别:
Doctoral Dissertation Research: Determining the mechanisms of spoken language processing delay for children with cochlear implants
博士论文研究:确定人工耳蜗儿童口语处理延迟的机制
  • 批准号:
    2141399
  • 财政年份:
    2022
  • 资助金额:
    $ 31.62万
  • 项目类别:
    Standard Grant
Doctoral Dissertation Research: Examining the Role of Literacy on Predictive Processing during Spoken Language Comprehension
博士论文研究:检验读写能力在口语理解过程中预测处理中的作用
  • 批准号:
    2146232
  • 财政年份:
    2022
  • 资助金额:
    $ 31.62万
  • 项目类别:
    Standard Grant
{{ showInfoDetail.title }}

作者:{{ showInfoDetail.author }}

知道了