Development for speech interface for form -based in formation access services on Web

基于表单的Web信息访问服务语音接口的开发

基本信息

批准号：
13558033
负责人：
NAKAGAWA Seiichi
金额：
$ 4.29万
依托单位：
Toyohashi University Technology
依托单位国家：
日本
项目类别：
Grant-in-Aid for Scientific Research (B)
财政年份：
2001
资助国家：
日本
起止时间：
2001 至 2003
项目状态：
已结题

项目摘要

While some speech interface systems have been developed for accessing Web resources, they are limited for accessing some specific contents and they don't provide a universal interface for arbitrary information retrieval services on the WWW. We propose an interactive speech user interface system, which could be applied to many form-based information retrieval services of the WVVW. In particular, our system was implemented based on a client-server, a Web proxy-centered architecture and employed an information extraction and language processing of HTML documents for providing a general-purpose interface for many form-based WWW contents. We also performed some experiments by 12 subjects for the comparison of the usability under different usage conditions. As a result, the proposed system attained comparative and higher expected usability measures over the pen-touch input method under the condition of an ideal speech recognition performance, and could be expected to achieve the effectivenes … More s or the superiority over a pen touch-only interface in terms of the usability as their usage condition approaches to a realistic PDA usage condition.We also proposed an. interface for a name input based on speech recognition using syllable-based N-gram and a word dictionary, which was frequently required to input into form-based web pages. User first utters a name and then chooses the correct word/syllables by pen touch from word/syllable candidates which were obtained from speech recognition. Name utterance is hard to recognize accurately because of the large vocabulary size, so the system uses continuous syllable recognition with syllable-based N-gram and isolated word recognition with a dictionary containing frequent words. The user can find the correct the answer from word candidates or syllable sequence candidates at a rate of 82-86%, and can input correct name at a rate of 94-96% with syllable selection from the syllable lattice. Some subjects used this interface and felt that it was useful. Less

尽管已经开发了一些语音接口系统用于访问Web资源，但它们限制用于访问某些特定内容，并且没有为www上的任意信息检索服务提供通用界面。我们提出了一个交互式语音用户界面系统，该系统可以应用于WVVW的许多基于表单的信息检索服务。特别是，我们的系统是基于客户端服务器，以Web代理为中心的体系结构实现的，并采用了HTML文档的信息提取和语言处理，以为许多基于表单的WWW内容提供通用界面。我们还通过12名受试者进行了一些实验，以比较不同使用条件下的可用性。结果，在理想的语音识别性能的条件下，提出的系统在笔触输入方法上实现了比较和更高的预期可用性度量，并且可以预期可以实现有效性……在使用ANERACIST PDA使用条件下，我们的使用条件方法更高的是纯可用性的唯一性界面。基于语音识别的名称输入的接口使用基于Sylable的n-gram和一个单词词典，通常需要将其输入基于表单的网页。用户首先说明一个名称，然后从单词/sylable候选者中选择正确的单词/sylables，这些单词/sylables是从语音识别中获得的。由于词汇大小较大，因此很难准确地识别名称话语，因此系统使用基于音节的N-gram的连续音节识别和带有经常包含单词的字典的孤立单词识别。用户可以以82-86％的速率找到正确的答案或音节序列的答案，并且可以以94-96％的速度输入正确的名称，并从音节晶格中选择音节。一些受试者使用了此界面，并认为它很有用。较少的