Large Bocabularly Spoken Word Recognition system Using Phonemic Segmentation Units

使用音素分割单元的大型口语单词识别系统

基本信息

  • 批准号:
    05555102
  • 负责人:
  • 金额:
    $ 6.34万
  • 依托单位:
  • 依托单位国家:
    日本
  • 项目类别:
    Grant-in-Aid for Developmental Scientific Research (B)
  • 财政年份:
    1993
  • 资助国家:
    日本
  • 起止时间:
    1993 至 1994
  • 项目状态:
    已结题

项目摘要

In this research project, we substantiated that the speech recognition method based on the phonemic segmentation and phoneme labeling was very effective for the large vocabulary spoken word recognition, and we developed a high performance large vocabulary spoken word recognition system using the phonemic segmentation units.This spoken word recognition system is composed of the following subsystems : an acoustic analysis subsystem, phonemic segmentation units, phoneme labeling subsystem and word matching subsystem. one ofproblem of this word recognition system errors in the phonemic segmentation and phoneme labeling. We tried to improve the system in the phonemic segmentation and phoneme labeling.Trough this research project, we got the following good results.(1) We realized a high performance automatic phonemic segmentation unit for speaker and context independent Japanese speech recognition system. We substantiated that this segmentation unit was effective for the large vocabulary word recognition.(2) We developed a higher performance large vocabulary spoken word recognition system using the phonemic segmentation unit and phoneme labeling system.Experiments were carried out using the dictionaries of 1845 words and 4915 words to evaluate the system. The word recognition rates for the first candidate were found to be 96.5% and 94.5% for 1845 word and 4915 word dictionaries respectively. An estimated recognition rate for 20000 word dictionary was approximately 90%.(3) We proposed the parallel phonemic segmentation method in order to achieve a higher word recognition rate. Using the parallel phonemic segmentation unit, we obtained 1 or 2% higher recognition rate for 4915 word dictionay. We also proposed the parallel phoneme labeling method, and substantiated the method is very effective for realizing a higher recognition rate.
In this research project, we substantiated that the speech recognition method based on the phonemic segmentation and phoneme labeling was very effective for the large vocabulary spoken word recognition, and we developed a high performance large vocabulary spoken word recognition system using the phonemic segmentation units.This spoken word recognition system is composed of the following subsystems : an acoustic analysis subsystem, phonemic segmentation units, phoneme labeling subsystem and word匹配子系统。音调分割和音素标记中此单词识别系统错误的问题。我们试图改善音素分割和音素标签中的系统。陷入研究项目,我们得到了以下良好的结果。(1)我们意识到了扬声器和上下文独立的日本语音识别系统的高性能自动音素分割单元。我们证实了该分割单元对大型词汇识别识别是有效的。(2)我们使用语音分割单元和音素标记系统开发了更高的性能大型词汇式口语识别系统。使用1845个单词和4915个单词来评估该系统。发现第一个候选人的一词识别率分别为1845 Word和4915个单词词典的96.5%和94.5%。 20000个单词词典的估计识别率约为90%。(3)我们提出了平行的音素分割方法,以达到更高的单词识别率。使用平行的语音分割单元,我们获得了4915个单词dictionay的识别率提高1或2%。我们还提出了平行的音素标记方法,并证实该方法对于实现更高的识别率非常有效。

项目成果

期刊论文数量(25)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
C.Furuich, K.Aizawa and S.Imai: ""Automatic Phonemic Segmentation System of English Continuous Speech by Using Speaker-Independent Features" (in Japanese)" Trans.IEICE. J78-A[3]. 295-304 (1995)
C.Furuich、K.Aizawa 和 S.Imai:“使用与说话人无关的功能实现英语连续语音的自动音素分割系统”(日语)” Trans.IEICE。
  • DOI:
  • 发表时间:
  • 期刊:
  • 影响因子:
    0
  • 作者:
  • 通讯作者:
今井 聖: "信号処理工学" コロナ社(テレビジョン学会編), 201 (1993)
今井清:《信号处理工程》Coronasha(电视协会编),201(1993)
  • DOI:
  • 发表时间:
  • 期刊:
  • 影响因子:
    0
  • 作者:
  • 通讯作者:
今井 聖: "並列音素ラベリング(PPL)方式による話者独立単語音声認識システム" 電子情報通信学会論文誌. J77-A. 143-152 (1994)
Kiyoshi Imai:“使用并行音素标记 (PPL) 方法的独立于说话人的单词语音识别系统”,电子、信息和通信工程师学会汇刊 J77-152 (1994)。
  • DOI:
  • 发表时间:
  • 期刊:
  • 影响因子:
    0
  • 作者:
  • 通讯作者:
L.Hu and S.Imai: ""Tone Recognition for Continuous Mandarin Speech" (in Japanese)" Trans.IEICE. J78-A[7]. (1995)
L.Hu 和 S.Imai:““连续普通话语音的声调识别”(日语)” Trans.IEICE。
  • DOI:
  • 发表时间:
  • 期刊:
  • 影响因子:
    0
  • 作者:
  • 通讯作者:
T.Kanno, T.Kobayashi and S.Imai: ""On the Use of a priori Information in Generalized Cepstral Modeling of Degraded Speech" (in Japanese)" Trans.IEICE. J77-A[7]. 945-953 (1994)
T.Kanno、T.Kobayashi 和 S.Imai:“关于在退化语音的广义倒谱建模中使用先验信息”(日语)”Trans.IEICE。
  • DOI:
  • 发表时间:
  • 期刊:
  • 影响因子:
    0
  • 作者:
  • 通讯作者:
{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ monograph.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ sciAawards.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ conferencePapers.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ patent.updateTime }}

IMAI Satoshi其他文献

IMAI Satoshi的其他文献

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

{{ truncateString('IMAI Satoshi', 18)}}的其他基金

Phonemic Segmentation and Word Recognition for Dialogue Level Continuous Speech
对话级连续语音的音位分割和单词识别
  • 批准号:
    01420028
  • 财政年份:
    1989
  • 资助金额:
    $ 6.34万
  • 项目类别:
    Grant-in-Aid for General Scientific Research (A)
Study on a Speech Recognition-Synthesis System Based on Mel Cepstral Acoustic Processing and Multi-Level Knowledge Processing.
基于梅尔倒谱声学处理和多层次知识处理的语音识别合成系统研究。
  • 批准号:
    61460131
  • 财政年份:
    1986
  • 资助金额:
    $ 6.34万
  • 项目类别:
    Grant-in-Aid for General Scientific Research (B)

相似海外基金

ハイパーサーミア・シミュレーションに向けた大規模並列連成計算
热疗模拟的大规模并行耦合计算
  • 批准号:
    21K04200
  • 财政年份:
    2021
  • 资助金额:
    $ 6.34万
  • 项目类别:
    Grant-in-Aid for Scientific Research (C)
実環境の音声信号に適した統計モデリングおよび最適化アルゴリズムの開発
开发适合真实世界音频信号的统计建模和优化算法
  • 批准号:
    18J14238
  • 财政年份:
    2018
  • 资助金额:
    $ 6.34万
  • 项目类别:
    Grant-in-Aid for JSPS Fellows
To realize high-accuacy hyper-thermia simulation by parallel coupled analysis
通过并行耦合分析实现高精度热疗模拟
  • 批准号:
    17H03256
  • 财政年份:
    2017
  • 资助金额:
    $ 6.34万
  • 项目类别:
    Grant-in-Aid for Scientific Research (B)
Study on finite element methods for wave problems in unbounded domains and development of associated FEM softwares
无界域波浪问题的有限元方法研究及相关有限元软件的开发
  • 批准号:
    23540127
  • 财政年份:
    2011
  • 资助金额:
    $ 6.34万
  • 项目类别:
    Grant-in-Aid for Scientific Research (C)
Designing an ultra-hispeed search engine for big data of spoken documents
语音文档大数据超高速搜索引擎的设计
  • 批准号:
    22300060
  • 财政年份:
    2010
  • 资助金额:
    $ 6.34万
  • 项目类别:
    Grant-in-Aid for Scientific Research (B)
{{ showInfoDetail.title }}

作者:{{ showInfoDetail.author }}

知道了