Professional to Plain Language Neural Translation: A Path Toward Actionable Health Information
专业到通俗语言的神经翻译:通向可行健康信息的道路
基本信息
- 批准号:10349319
- 负责人:
- 金额:$ 19.04万
- 依托单位:
- 依托单位国家:美国
- 项目类别:
- 财政年份:2022
- 资助国家:美国
- 起止时间:2022-03-01 至 2024-02-29
- 项目状态:已结题
- 来源:
- 关键词:BlindedCOVID-19 pandemicCharacteristicsClinicalComplexComprehensionConsumptionCustomDatabasesEnsureEvaluationFaceGeneral PopulationGenerationsHealthHealth ProfessionalHealth behaviorHealthcareHumanKnowledgeLanguageLearningLiteratureManualsMechanicsMedicalMethodsModelingNatural Language ProcessingParticipantPatient RecruitmentsPeer ReviewPerformanceProceduresReadabilityReaderReadingResearchSourceStructureSystemTextTrainingTranslatingTranslationsTreatment outcomeVocabularyWorkbasecomputer generateddeep learningdeep learning modelhealth literacyimprovedimproved outcomeknowledge baseliteracymachine translationmodel buildingmulti-task learningneural modelneural networknoveloptimal treatmentsrelating to nervous systemstemsystematic reviewtransfer learning
项目摘要
Health literacy is key to making well-informed health decisions that improve outcomes. However, while the peer-
reviewed clinical literature contains valuable information to guide health decisions, it is generally written for an
audience of healthcare professionals. Even in the context of good general literacy, medical jargon and the
complex structure of professional language make this information especially hard to interpret. While efforts
have been made to summarize some of this literature in plain language to make it accessible to the general
public, these efforts depend on human expertise. This approach cannot scale to match the rapid pace at which
new findings emerge in the literature. Thus, there is an urgent unmet need for automated methods to enhance
the accessibility of the canonical biomedical literature to the general public. This problem can be framed as a
type of translation problem, between the language of healthcare professionals, and that of healthcare
consumers. The proposed research builds on recent advances in deep learning stemming from neural sequence-
to-sequence models, which were originally evaluated in machine translation tasks. In our recent work, we
showed these models can be effectively adapted to the task of translating between abstracts in the Cochrane
Database of Systematic Reviews (CDSR) and corresponding professionally-authored plain language
summaries. The resulting automatically-generated summaries outperformed those from other models in their
alignment with professionally-authored summaries. Furthermore, in a pilot user evaluation in which participants
were blinded as to summary provenance, they were generally judged favorably to their expert-authored
counterparts. In the proposed research we will develop this line of research further, by evaluating the utility of
additional pre-training and auxiliary fine-tuning tasks as a means to improve the quality of generated summaries.
We will also customize the models concerned to enhance their factual accuracy and readability using novel
auxiliary training objectives and post-processing procedures. We will evaluate our methods as compared with
robust baseline models in system-centric evaluations of content alignment with reference summaries, readability
and factual correctness. Using Mechanical Turk, we will conduct user-centric evaluations of the ease with which
summaries from best-performing models can be understood, as compared with CDSR expert-authored plain
language summaries. These evaluations will consider both perceived interpretability, and actual comprehension,
with the latter evaluated using sets of multiple choice questions to probe comprehension, recall and learning. In
doing so, the proposed research will advance the state-of-the-art in automated simplification and summarization
of the biomedical literature for consumption by the general public.
健康素养是做出明智的健康决策以改善结果的关键。但是,而同行
回顾的临床文献包含有价值的信息来指导健康决策,通常是为
医疗保健专业人员的受众。即使在良好的一般识字率的背景下,医疗术语和
专业语言的复杂结构使此信息特别难以解释。努力
已经用简单的语言总结了一些文献,以使其通用访问
公众,这些努力取决于人类的专业知识。这种方法无法扩展以匹配快速的步伐
文献中出现了新发现。因此,紧急未满足的自动化方法可以增强
规范生物医学文学对公众的可访问性。这个问题可以被构成
翻译问题的类型,医疗保健专业人员的语言与医疗保健的语言之间
消费者。拟议的研究是基于从神经序列的最新进展的基础
最初在机器翻译任务中评估的序列模型。在我们最近的工作中,我们
表明这些模型可以有效地适应于Cochrane中摘要之间翻译的任务
系统评价数据库(CDSR)和相应的专业纯语言
摘要。由此产生的自动生成的摘要的表现优于其他模型的摘要
与专业作品的摘要保持一致。此外,在参与者的试点用户评估中
对于摘要出处而言,他们对他们的专家作者有利。
同行。在拟议的研究中,我们将通过评估效用
其他预培训和辅助微调任务是提高生成摘要质量的一种手段。
我们还将自定义有关模型,以增强其事实准确性和使用新颖性
辅助培训目标和后处理程序。我们将评估我们的方法
参考摘要,可读性的内容对齐方式评估中以系统为中心的基线模型
和事实正确性。使用机械土耳其人,我们将对以用户为中心的评估来进行轻松的评估
与CDSR专家纯正的Plain相比,可以理解表现最佳模型的摘要
语言摘要。这些评估将考虑可解释性和实际理解力,
使用后者使用一组多项选择问题进行评估,以探究理解,回忆和学习。在
这样做,拟议的研究将推进自动简化和汇总的最新研究
公众消费的生物医学文献。
项目成果
期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
数据更新时间:{{ journalArticles.updateTime }}
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ patent.updateTime }}
Trevor Cohen其他文献
Trevor Cohen的其他文献
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
{{ truncateString('Trevor Cohen', 18)}}的其他基金
DeconDTN: Deconfounding Deep Transformer Networks for Clinical NLP
DeconDTN:为临床 NLP 解构深度 Transformer 网络
- 批准号:
10626888 - 财政年份:2022
- 资助金额:
$ 19.04万 - 项目类别:
Professional to Plain Language Neural Translation: A Path Toward Actionable Health Information
专业到通俗语言的神经翻译:通向可行健康信息的道路
- 批准号:
10579898 - 财政年份:2022
- 资助金额:
$ 19.04万 - 项目类别:
DeconDTN: Deconfounding Deep Transformer Networks for Clinical NLP
DeconDTN:为临床 NLP 解构深度 Transformer 网络
- 批准号:
10467107 - 财政年份:2022
- 资助金额:
$ 19.04万 - 项目类别:
DeconDTN: Deconfounding Deep Transformer Networks for Clinical NLP
DeconDTN:为临床 NLP 解构深度 Transformer 网络
- 批准号:
10711315 - 财政年份:2022
- 资助金额:
$ 19.04万 - 项目类别:
Computerized assessment of linguistic indicators of lucidity in Alzheimer's Disease dementia
阿尔茨海默病痴呆症语言清醒度指标的计算机化评估
- 批准号:
10093304 - 财政年份:2020
- 资助金额:
$ 19.04万 - 项目类别:
Using Biomedical Knowledge to Identify Plausible Signals for Pharmacovigilance
利用生物医学知识识别药物警戒的合理信号
- 批准号:
8914098 - 财政年份:2013
- 资助金额:
$ 19.04万 - 项目类别:
Using Biomedical Knowledge to Identify Plausible Signals for Pharmacovigilance
利用生物医学知识识别药物警戒的合理信号
- 批准号:
8727094 - 财政年份:2013
- 资助金额:
$ 19.04万 - 项目类别:
Encoding Semantic Knowledge in Vector Space for Biomedical Information
在生物医学信息的向量空间中编码语义知识
- 批准号:
8138564 - 财政年份:2010
- 资助金额:
$ 19.04万 - 项目类别:
Encoding Semantic Knowledge in Vector Space for Biomedical Information
在生物医学信息的向量空间中编码语义知识
- 批准号:
7977263 - 财政年份:2010
- 资助金额:
$ 19.04万 - 项目类别:
相似海外基金
Clinical Trial Readiness for Children 0-5 years with Congenital Muscular Dystrophy Secondary to LAMA2 Mutations
0-5 岁 LAMA2 突变继发先天性肌营养不良症儿童的临床试验准备情况
- 批准号:
10686586 - 财政年份:2023
- 资助金额:
$ 19.04万 - 项目类别:
Professional to Plain Language Neural Translation: A Path Toward Actionable Health Information
专业到通俗语言的神经翻译:通向可行健康信息的道路
- 批准号:
10579898 - 财政年份:2022
- 资助金额:
$ 19.04万 - 项目类别:
Addressing disparities in ASD diagnosis using a direct-to-home telemedicine tool: Evaluation of diagnostic accuracy, psychometric properties, and family engagement
使用直接到家远程医疗工具解决 ASD 诊断中的差异:评估诊断准确性、心理测量特性和家庭参与度
- 批准号:
10277413 - 财政年份:2021
- 资助金额:
$ 19.04万 - 项目类别: