AGILE: A Cloud Approach to Automatic Gene Expression Pattern Recognition and Annotation Over Large-Scale Images

AGILE:大规模图像上自动基因表达模式识别和注释的云方法

基本信息

  • 批准号:
    BB/K004077/1
  • 负责人:
  • 金额:
    $ 14.1万
  • 依托单位:
  • 依托单位国家:
    英国
  • 项目类别:
    Research Grant
  • 财政年份:
    2012
  • 资助国家:
    英国
  • 起止时间:
    2012 至 无数据
  • 项目状态:
    已结题

项目摘要

Modern biomedical research makes significant use of large datasets. Cloud computing is emerging as a cost-effective solution by providing virtual computers and storage disks on demand to store and process massive data efficiently without large upfront costs.Despite some progress made, the use of cloud computing in the biomedical research is still at the very early stage. There exist various concerns on how to best utilise the cloud for accelerating large-scale biomedical applications. Especially, can a biomedical application be directly migrated to the cloud without requiring any modification? How to develop a cloud-based biomedical application? What are the performance and the cost of an application in the cloud? Are the performance and the cost acceptable? Do we have optimal methods to keep both performance and the overall cost of applications within the acceptable range in the cloud?This project will develop a cloud approach for a real biomedical data intensive task for effective gene expression pattern recognition and annotation over large-scale image data through addressing the concerns above. This task is chosen largely for its importance in the biomedical research. This type of intensive data-analysis task is increasingly common in the biomedical sciences. This particular task concerns developmental anatomy of mouse embryo: it is of great interest to identify gene interactions and networks that are associated with developmental and physiological functions in the embryo by using anatomical annotation. The gene expression pattern recognition and annotation represents labelling embryo images with anatomical terms for mouse development. If an image is tagged with a term, it means the corresponding anatomical component shows expression of that gene. Currently, this task is mainly taken manually by domain experts. However, with the availability of the vast amount of data, a manual annotation is expensive and time consuming. Additionally, the manual annotation may also produce the inconsistency of labels across images introduced by the human annotators as it proves to be highly subjective. To alleviate issues with the manual annotation, we have employed data mining techniques to automatically identify an anatomical component in the embryo image and annotate the image using the provided terms. As this task involves the use of very large-scale images, we intend to exploit cloud computing for this task to address the massive data problems.It is expected that the successful completion of this project will provide a typical exemplar for accessing and exploiting cloud computing technologies to analyse large-scale image-based biomedical data. An important, and novel, aspect of this proposal is that the major concerns that limit the more widespread use of cloud computing for biomedical applications will be addressed. The theoretical component of the work aims to provide (1) a practical user-friendly biomedical data-mining tool based on the cloud for effective gene expression pattern recognition and annotation and (2) a set of standard services (e.g. image processing algorithms, data mining algorithms) and a novel automatic data reuse mechanism for performance enhancement and cost reduction, which can be reused and plugged into the class of similar biomedical applications.
现代生物医学研究大量使用大型数据集。云计算正在成为一种经济高效的解决方案,它通过按需提供虚拟计算机和存储磁盘来有效地存储和处理海量数据,而无需大量的前期成本。尽管取得了一些进展,但云计算在生物医学研究中的使用仍然处于起步阶段。早期。关于如何最好地利用云来加速大规模生物医学应用存在各种担忧。尤其是生物医学应用可以不做任何修改直接迁移到云端吗?如何开发基于云的生物医学应用?云中应用程序的性能和成本是多少?性能和成本是否可以接受?我们是否有最佳方法来将应用程序的性能和总体成本保持在可接受的范围内?该项目将为真正的生物医学数据密集型任务开发一种云方法,以便在大规模图像上进行有效的基因表达模式识别和注释数据通过解决上述问题。选择这项任务主要是因为它在生物医学研究中的重要性。这种类型的密集数据分析任务在生物医学科学中越来越常见。这项特殊任务涉及小鼠胚胎的发育解剖学:通过使用解剖注释来识别与胚胎发育和生理功能相关的基因相互作用和网络非常有意义。基因表达模式识别和注释代表用小鼠发育的解剖学术语标记胚胎图像。如果图像标有术语,则意味着相应的解剖成分显示该基因的表达。目前,该任务主要由领域专家手动完成。然而,由于存在大量数据,手动注释既昂贵又耗时。此外,手动注释还可能导致人类注释者引入的图像之间的标签不一致,因为它被证明是高度主观的。为了缓解手动注释的问题,我们采用数据挖掘技术来自动识别胚胎图像中的解剖成分,并使用提供的术语对图像进行注释。由于该任务涉及超大规模图像的使用,我们打算利用云计算来解决该任务的海量数据问题。预计该项目的成功完成将为云计算的接入和利用提供典型范例分析大规模基于图像的生物医学数据的技术。该提案的一个重要且新颖的方面是,限制云计算在生物医学应用中更广泛使用的主要问题将得到解决。这项工作的理论部分旨在提供(1)一个基于云的实用的用户友好的生物医学数据挖掘工具,用于有效的基因表达模式识别和注释;(2)一套标准服务(例如图像处理算法、数据挖掘算法)和一种新颖的自动数据重用机制,用于增强性能和降低成本,可以重用并插入到类似的生物医学应用程序中。

项目成果

期刊论文数量(5)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
Automatic data reuse for accelerating data intensive applications in the Cloud
A genetic algorithm enhanced automatic data flow management solution for facilitating data intensive applications in the cloud
Enhancing Parallelism of Data-Intensive Bioinformatics Applications
增强数据密集型生物信息学应用的并行性
  • DOI:
    10.1109/eurosim.2013.93
  • 发表时间:
    2013
  • 期刊:
  • 影响因子:
    0
  • 作者:
    Xie Z
  • 通讯作者:
    Xie Z
Augmented Petri Net Cost Model for Optimisation of Large Bioinformatics Workflows Using Cloud
  • DOI:
    10.1109/ems.2013.35
  • 发表时间:
    2013-11
  • 期刊:
  • 影响因子:
    0
  • 作者:
    Zheng Xie;Liangxiu Han;R. Baldock
  • 通讯作者:
    Zheng Xie;Liangxiu Han;R. Baldock
Parallel data intensive applications using MapReduce: a data mining case study in biomedical sciences
  • DOI:
    10.1007/s10586-014-0405-9
  • 发表时间:
    2015-03
  • 期刊:
  • 影响因子:
    0
  • 作者:
    Liangxiu Han;Hwee Yong Ong
  • 通讯作者:
    Liangxiu Han;Hwee Yong Ong
{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ monograph.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ sciAawards.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ conferencePapers.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ patent.updateTime }}

Liangxiu Han其他文献

Dual Attention Multi-Instance Deep Learning for Alzheimer’s Disease Diagnosis With Structural MRI
使用结构 MRI 进行阿尔茨海默病诊断的双重关注多实例深度学习
  • DOI:
    10.1109/tmi.2021.3077079
  • 发表时间:
    2021-05
  • 期刊:
  • 影响因子:
    10.6
  • 作者:
    Wenyong Zhu;Liang Sun;Jiashuang Huang;Liangxiu Han;Daoqiang Zhang
  • 通讯作者:
    Daoqiang Zhang
Analyzing Gene Expression Imaging Data in Developmental Biology
分析发育生物学中的基因表达成像数据
  • DOI:
    10.1002/9781118540343.ch16
  • 发表时间:
    2013
  • 期刊:
  • 影响因子:
    2.1
  • 作者:
    Liangxiu Han;Jano van Hemert;I. Overton;Paolo Besana;R. Baldock
  • 通讯作者:
    R. Baldock
Supervised Hyperalignment for Multisubject fMRI Data Alignment
用于多主体 fMRI 数据对齐的监督超对齐
The self-adaptation to dynamic failures for efficient virtual organization formations in grid computing context
网格计算环境下高效虚拟组织形成的动态故障自适应
  • DOI:
  • 发表时间:
    2009
  • 期刊:
  • 影响因子:
    0
  • 作者:
    Liangxiu Han
  • 通讯作者:
    Liangxiu Han
A new approach to journal co-citation matrix construction based on the number of co-cited articles in journals
基于期刊共被引文章数构建期刊共被引矩阵的新方法
  • DOI:
    10.1007/s11192-019-03141-9
  • 发表时间:
    2019
  • 期刊:
  • 影响因子:
    3.9
  • 作者:
    Lijun Yang;Liangxiu Han;N. Liu
  • 通讯作者:
    N. Liu

Liangxiu Han的其他文献

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

{{ truncateString('Liangxiu Han', 18)}}的其他基金

Synergising Process-Based and Machine Learning Models for Accurate and Explainable Crop Yield Prediction along with Environmental Impact Assessment
协同基于流程和机器学习模型,实现准确且可解释的作物产量预测以及环境影响评估
  • 批准号:
    BB/Y513763/1
  • 财政年份:
    2024
  • 资助金额:
    $ 14.1万
  • 项目类别:
    Research Grant
EYE-SCREEN-4-DPN: Development of an innovative Intelligent EYE imaging solution for SCREENing of Diabetic Peripheral Neuropathy
EYE-SCREEN-4-DPN:开发创新的智能眼部成像解决方案,用于筛查糖尿病周围神经病变
  • 批准号:
    EP/X013707/1
  • 财政年份:
    2023
  • 资助金额:
    $ 14.1万
  • 项目类别:
    Research Grant
UK-China Agritech Challenge: CropDoc - Precision Crop Disease Management for Farm Productivity and Food Security
中英农业科技挑战赛:CropDoc - 精准作物病害管理,提高农业生产力和粮食安全
  • 批准号:
    BB/S020969/1
  • 财政年份:
    2019
  • 资助金额:
    $ 14.1万
  • 项目类别:
    Research Grant
EPIC: An automated diagnostic tool for Potato Late Blight disease detection from images
EPIC:一种从图像检测马铃薯晚疫病的自动化诊断工具
  • 批准号:
    BB/R019983/1
  • 财政年份:
    2018
  • 资助金额:
    $ 14.1万
  • 项目类别:
    Research Grant

相似国自然基金

基于天气雷达回波参量的雷暴云电荷结构反演方法研究
  • 批准号:
    42305085
  • 批准年份:
    2023
  • 资助金额:
    30 万元
  • 项目类别:
    青年科学基金项目
车载三维点云道路要素单体化分割理论与方法研究
  • 批准号:
    42371343
  • 批准年份:
    2023
  • 资助金额:
    49 万元
  • 项目类别:
    面上项目
海面舰船目标InISAR三维密集点云精确成像方法研究
  • 批准号:
    62301396
  • 批准年份:
    2023
  • 资助金额:
    30 万元
  • 项目类别:
    青年科学基金项目
顾及云绝热率变化的气溶胶-海洋低云交互评估方法研究
  • 批准号:
    42301402
  • 批准年份:
    2023
  • 资助金额:
    30 万元
  • 项目类别:
    青年科学基金项目
LiDAR点云分布变化场景下三维目标跟踪的鲁棒与域泛化方法研究
  • 批准号:
    62301562
  • 批准年份:
    2023
  • 资助金额:
    30 万元
  • 项目类别:
    青年科学基金项目

相似海外基金

Genomic and Bioinformatic Core
基因组和生物信息学核心
  • 批准号:
    10556539
  • 财政年份:
    2023
  • 资助金额:
    $ 14.1万
  • 项目类别:
Molecular and Computational Tools for Identifying Somatic Mosaicism in Human Tissues
识别人体组织中体细胞镶嵌的分子和计算工具
  • 批准号:
    10661147
  • 财政年份:
    2023
  • 资助金额:
    $ 14.1万
  • 项目类别:
AnVIL Clinical Environment for Innovation and Translation (ACE-IT)
AnVIL 创新与转化临床环境 (ACE-IT)
  • 批准号:
    10747551
  • 财政年份:
    2023
  • 资助金额:
    $ 14.1万
  • 项目类别:
A comprehensive platform for low-cost screening and image-guided photodynamic therapy (PDT) of pre-malignant and malignant oral lesions in low resource settings
一个综合平台,用于在资源匮乏的环境中对癌前和恶性口腔病变进行低成本筛查和图像引导光动力治疗 (PDT)
  • 批准号:
    10648426
  • 财政年份:
    2023
  • 资助金额:
    $ 14.1万
  • 项目类别:
A high throughput multiplexed pipeline for models of Alzheimer’s Disease
用于阿尔茨海默病模型的高通量多重管道
  • 批准号:
    10766665
  • 财政年份:
    2023
  • 资助金额:
    $ 14.1万
  • 项目类别:
{{ showInfoDetail.title }}

作者:{{ showInfoDetail.author }}

知道了