Mathematical modeling of the temporal acpects of speech based on human perceptual and cognitive mechanisms
基于人类感知和认知机制的语音时间方面的数学建模
基本信息
- 批准号:16200016
- 负责人:
- 金额:$ 31.62万
- 依托单位:
- 依托单位国家:日本
- 项目类别:Grant-in-Aid for Scientific Research (A)
- 财政年份:2004
- 资助国家:日本
- 起止时间:2004 至 2007
- 项目状态:已结题
- 来源:
- 关键词:
项目摘要
The purpose of this project was to find clues by which humans retrieve the temporal structure of speech, to understand their usage, and to establish a quantitative method to evaluate the temporal adequateness or naturalness of a given speech sound that can replicate the performance of human judgment. For this purpose, three fundamental investigative tasks were implemented a study at the psychophysical level, a study at the linguistic level, and the construction of an evaluation model. One distinguishing feature of this project is that it emphasized the psychophysical aspects nearly as much as the linguistic, even though its primary object was spoken language.Since a person can easily recognize fast speech, even in the case of a foreign language where the meaning is unknown, it is assumed that the processing of the temporal aspects of speech undoubtedly involve non-linguistic and therefore language-independent activities. By concentrating on such processing that is independent of a give … More n language, we developed the basic technology toward a system with small overhead for language processing as well as simple extensibility to multiple languages. The major results follow.(I) Psychophysical level : An algorithm was developed to predict temporal reference points in a given speech by replicating the function of human auditory processing. This algorithm's most important benefit is its applicability to virtually unlimited language variations since it doesn't require any linguistic knowledge.(II) Linguistic level : An empirical study levealed that factors, which affect the perception of prosodic units, vary depending on the particular language's choice of units. This finding provides practical implications concerning how much weight should be placed on prosodic factors when designing effective foreign-language training methods.(III) Modeling : By integrating auditory functions derived from investigation at the psychophysical level, a mathematical model was implemented to automatically evaluate the naturalness of the speech of the English learners. The model's performance closely approximated the subjective evaluation of a native-speaking English instructor. This result not only suggests the importance of psychophysical factors in the adequateness or naturalness evaluation of speech but it also implies potential extensibility of the proposed model to multiple languages. Less
该项目的目的是找到集群,人类通过这些群体检索言语的临时结构,了解其用法,并建立一种定量方法来评估给定语音的临时充分性或自然性,以复制人类判断的表现。为此,在心理物理水平,语言水平的研究以及评估模型的构建上实施了三项基本调查任务。该项目的一个区别特征是,它强调了心理物理方面几乎与语言学一样多,即使其主要对象是口语的。通过专注于独立于给予……更多n语言的处理,我们将基本技术开发了一个针对语言处理的小额开销以及对多种语言的简单扩展性的系统。 (i)心理物理水平:开发了一种算法,以通过复制人类听觉处理的功能来预测给定语音中的临时参考点。该算法最重要的好处是它适用于几乎无限的语言变化,因为它不需要任何语言知识。(ii)语言层面:一项经验研究措施阐明了因素,这些因素会影响对韵律单位的感知,这取决于特定语言的单位选择。这一发现提供了有关在设计有效的外语训练方法时应在韵律因素上放大重量的实际含义。(iii)建模:通过整合从心理物理级别的投资中获得的听觉功能,实施了数学模型,以自动评估英语学习者演讲的自然性。该模型的表现紧密近似于讲英语讲师的主题评估。该结果不仅表明了心理物理因素在适当性或自然性评估中的重要性,而且还暗示了所提出的模型对多种语言的潜在可扩展性。较少的
项目成果
期刊论文数量(340)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
Effects of auditory feedback in the practive phase of imitating a piano performance
听觉反馈在模仿钢琴演奏练习阶段的效果
- DOI:
- 发表时间:2006
- 期刊:
- 影响因子:0
- 作者:Takahashi;Noriyuki
- 通讯作者:Noriyuki
Prosody generation for communicative speech synthesis.
用于交际语音合成的韵律生成。
- DOI:
- 发表时间:2005
- 期刊:
- 影响因子:0
- 作者:Sagisaka;Y.;Kokenawa;Y.;Tsuzaki;M.;and Kato;H.
- 通讯作者:H.
Judgment of onset asynchrony of two tone components and its relation to the cochlear delay.
两个音调成分起病不同步的判断及其与耳蜗延迟的关系。
- DOI:
- 发表时间:2005
- 期刊:
- 影响因子:0
- 作者:Aiba;E.;Matsui;T.;and Tsuzaki;M.
- 通讯作者:M.
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ patent.updateTime }}
KATO Hiroaki其他文献
KATO Hiroaki的其他文献
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
{{ truncateString('KATO Hiroaki', 18)}}的其他基金
Recognition of pathogen-derived sphingolipid in plants.
植物中病原体衍生鞘脂的识别。
- 批准号:
20K15528 - 财政年份:2020
- 资助金额:
$ 31.62万 - 项目类别:
Grant-in-Aid for Early-Career Scientists
Structural basis for discrimination between multi-drug exporters and lipid floppies
多种药物出口商和脂质软盘之间歧视的结构基础
- 批准号:
19K22495 - 财政年份:2019
- 资助金额:
$ 31.62万 - 项目类别:
Grant-in-Aid for Challenging Research (Exploratory)
Structural basis for optimization of molecular probe using P-glycoprotein in vivo imaging
P-糖蛋白体内成像优化分子探针的结构基础
- 批准号:
24659018 - 财政年份:2012
- 资助金额:
$ 31.62万 - 项目类别:
Grant-in-Aid for Challenging Exploratory Research
Protein structural data mining based on the Neighborhood Fragment Spectra representation
基于邻域片段谱表示的蛋白质结构数据挖掘
- 批准号:
22500130 - 财政年份:2010
- 资助金额:
$ 31.62万 - 项目类别:
Grant-in-Aid for Scientific Research (C)
Studies on stability of fission yeast heterochromatin and its regulators
裂殖酵母异染色质及其调控因子的稳定性研究
- 批准号:
21870024 - 财政年份:2009
- 资助金额:
$ 31.62万 - 项目类别:
Grant-in-Aid for Research Activity Start-up
High frequency and high field magnetic resonance in panoscopic- assembled rare earth magnets
全景组装稀土磁体中的高频高场磁共振
- 批准号:
20900111 - 财政年份:2008
- 资助金额:
$ 31.62万 - 项目类别:
A Modeling of Prosody Perception for Second Language Learning
第二语言学习的韵律感知建模
- 批准号:
20300069 - 财政年份:2008
- 资助金额:
$ 31.62万 - 项目类别:
Grant-in-Aid for Scientific Research (B)
Three-dimensional structural similarity search of proteins based on Geometrical Fragment Spectra
基于几何碎片谱的蛋白质三维结构相似性搜索
- 批准号:
19700139 - 财政年份:2007
- 资助金额:
$ 31.62万 - 项目类别:
Grant-in-Aid for Young Scientists (B)
Struchural biology of membrane protein trabsporters
膜蛋白转运蛋白的结构生物学
- 批准号:
17380066 - 财政年份:2005
- 资助金额:
$ 31.62万 - 项目类别:
Grant-in-Aid for Scientific Research (B)
Basic Study on Decentralized Society : role of State Governments in the USA and Provincial governments in Canada
去中心化社会的基础研究:美国州政府和加拿大省政府的作用
- 批准号:
16310166 - 财政年份:2004
- 资助金额:
$ 31.62万 - 项目类别:
Grant-in-Aid for Scientific Research (B)
相似海外基金
Studies of speech, image and natural language processing for multimodal spoken document retrieval
多模态语音文档检索的语音、图像和自然语言处理研究
- 批准号:
23K11216 - 财政年份:2023
- 资助金额:
$ 31.62万 - 项目类别:
Grant-in-Aid for Scientific Research (C)
Determining the mechanisms of spoken language processing delay for children with cochlear implants
确定人工耳蜗植入儿童口语处理延迟的机制
- 批准号:
10537470 - 财政年份:2022
- 资助金额:
$ 31.62万 - 项目类别:
Determining the mechanisms of spoken language processing delay for children with cochlear implants
确定人工耳蜗植入儿童口语处理延迟的机制
- 批准号:
10669599 - 财政年份:2022
- 资助金额:
$ 31.62万 - 项目类别:
Doctoral Dissertation Research: Determining the mechanisms of spoken language processing delay for children with cochlear implants
博士论文研究:确定人工耳蜗儿童口语处理延迟的机制
- 批准号:
2141399 - 财政年份:2022
- 资助金额:
$ 31.62万 - 项目类别:
Standard Grant
Doctoral Dissertation Research: Examining the Role of Literacy on Predictive Processing during Spoken Language Comprehension
博士论文研究:检验读写能力在口语理解过程中预测处理中的作用
- 批准号:
2146232 - 财政年份:2022
- 资助金额:
$ 31.62万 - 项目类别:
Standard Grant