RI: Large:Collaborative Research: Richer Representations for Machine Translation
RI:大型:协作研究:更丰富的机器翻译表示
基本信息
- 批准号:0910611
- 负责人:
- 金额:$ 54万
- 依托单位:
- 依托单位国家:美国
- 项目类别:Continuing Grant
- 财政年份:2009
- 资助国家:美国
- 起止时间:2009-09-01 至 2013-08-31
- 项目状态:已结题
- 来源:
- 关键词:
项目摘要
Research in machine translation of human languages has made substantial progress recently, and surface patterns gleaned automatically from online bilingual texts work remarkably well for some language pairs. However, for many language pairs, the output of even the best systems is garbled, ungrammatical, and difficult to interpret. Chinese-to-English systems need particular improvement, despite the importance of this language pair, while English-to-Chinese translation, equally important for communication between individuals, is rarely studied. This project develops methods for automatically learning correspondences between Chinese and English at a semantic rather than surface level, allowing machine translation to benefit from recent work in semantic analysis of text and natural language generation. One part of this work determines what types of semantic analysis of source language sentences can best inform a translation system, focusing on analyzing dropped arguments, co-reference links, and discourse relations between clauses. These linguistic phenomena must generally be made more explicit when translating from Chinese to English. A second part of the work integrates natural language generation into statistical machine translation, leveraging generation technology to determine sentence boundaries, ordering of constituents, and production of function words that translation systems tend to get wrong. A third part develops and compares algorithms for training and decoding machine translation models defined on semantic representations. All of this research exploits newly-developed linguistic resources for semantic analysis of both Chinese and English. The ultimate benefits of improved machine translation technology are easier access to information and easier communication between individuals. This in turn leads to increased opportunities for trade, as well as better understanding between cultures. This project's systems for both Chinese-to-English and English-to-Chinese are developed with the expectation that the approaches will be applied to other language pairs in the future.
人类语言机器翻译的研究最近取得了实质性进展,从在线双语文本自动收集的表面模式对于某些语言对来说效果非常好。然而,对于许多语言对来说,即使是最好的系统的输出也是乱码、不符合语法且难以解释的。尽管这种语言对很重要,但汉译英系统需要特别改进,而对于个人之间的交流同样重要的英译汉翻译却很少被研究。该项目开发了在语义而非表面层面自动学习中文和英文之间对应关系的方法,使机器翻译能够从文本语义分析和自然语言生成的最新工作中受益。这项工作的一部分确定了哪些类型的源语言句子语义分析可以最好地为翻译系统提供信息,重点分析删除的论点、共指链接和子句之间的话语关系。在汉译英时,这些语言现象通常必须更加明确。该工作的第二部分将自然语言生成集成到统计机器翻译中,利用生成技术来确定句子边界、成分排序以及翻译系统容易出错的功能词的生成。第三部分开发并比较了用于训练和解码基于语义表示定义的机器翻译模型的算法。所有这些研究都利用新开发的语言资源进行中文和英语的语义分析。改进的机器翻译技术的最终好处是更容易获取信息和人与人之间的交流。这反过来又带来了贸易机会的增加以及文化之间更好的理解。该项目开发了汉英和英汉系统,期望这些方法将来能够应用于其他语言对。
项目成果
期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
数据更新时间:{{ journalArticles.updateTime }}
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ patent.updateTime }}
Daniel Gildea其他文献
Daniel Gildea的其他文献
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
{{ truncateString('Daniel Gildea', 18)}}的其他基金
RI: Small: Cache transition systems for sentence understanding and generation
RI:小型:用于句子理解和生成的缓存转换系统
- 批准号:
1813823 - 财政年份:2018
- 资助金额:
$ 54万 - 项目类别:
Standard Grant
EAGER: Collaborative Research: Scaling Up Discriminative Learning for Natural Language Understanding and Translation
EAGER:协作研究:扩大自然语言理解和翻译的判别学习
- 批准号:
1446996 - 财政年份:2014
- 资助金额:
$ 54万 - 项目类别:
Standard Grant
CAREER: Semantics for Statistical Machine Translation
职业:统计机器翻译语义
- 批准号:
0546554 - 财政年份:2006
- 资助金额:
$ 54万 - 项目类别:
Continuing Grant
相似国自然基金
大环超分子对有机污染物及其降解中间体的自由基激发与诱导机制
- 批准号:52370168
- 批准年份:2023
- 资助金额:50 万元
- 项目类别:面上项目
基于fMRI大尺度时变网络变异性的个体ERP波形预测研究
- 批准号:82372084
- 批准年份:2023
- 资助金额:48 万元
- 项目类别:面上项目
开发区跨界合作网络的形成机理与区域效应:以三大城市群为例
- 批准号:42301183
- 批准年份:2023
- 资助金额:30 万元
- 项目类别:青年科学基金项目
抵挡汤早期干预抑制外膜滋养血管新生减轻血管钙化延缓2型糖尿病大血管病变发生的作用机制研究
- 批准号:82374247
- 批准年份:2023
- 资助金额:49 万元
- 项目类别:面上项目
利用衬底轨道过滤效应构筑大能隙二维拓扑绝缘体的研究
- 批准号:12304199
- 批准年份:2023
- 资助金额:30 万元
- 项目类别:青年科学基金项目
相似海外基金
Collaborative Research: RI:Medium:MoDL:Mathematical and Conceptual Understanding of Large Language Models
合作研究:RI:Medium:MoDL:大型语言模型的数学和概念理解
- 批准号:
2211779 - 财政年份:2022
- 资助金额:
$ 54万 - 项目类别:
Standard Grant
Collaborative Research: RI:Medium:MoDL:Mathematical and Conceptual Understanding of Large Language Models
合作研究:RI:Medium:MoDL:大型语言模型的数学和概念理解
- 批准号:
2211780 - 财政年份:2022
- 资助金额:
$ 54万 - 项目类别:
Standard Grant
Collaborative Research: RI: Medium: A Rigorous, General Framework for Tractable Learning of Large-Scale DAGs from Data
协作研究:RI:Medium:从数据中轻松学习大规模 DAG 的严格通用框架
- 批准号:
1956330 - 财政年份:2020
- 资助金额:
$ 54万 - 项目类别:
Continuing Grant
Collaborative Research: RI: Medium: A Rigorous, General Framework for Tractable Learning of Large-Scale DAGs from Data
协作研究:RI:Medium:从数据中轻松学习大规模 DAG 的严格通用框架
- 批准号:
1955532 - 财政年份:2020
- 资助金额:
$ 54万 - 项目类别:
Continuing Grant
RI: Medium: Collaborative Research: Incorporating Biologically-Motivated Circuit Motifs into Large-Scale Deep Neural Network Models of the Brain
RI:中:协作研究:将生物驱动的电路基序纳入大脑的大规模深度神经网络模型
- 批准号:
1704938 - 财政年份:2017
- 资助金额:
$ 54万 - 项目类别:
Standard Grant