Flexible and Efficient Text Storage, Management, and Retrieval
灵活高效的文本存储、管理和检索
基本信息
- 批准号:RGPIN-2016-03659
- 负责人:
- 金额:$ 1.89万
- 依托单位:
- 依托单位国家:加拿大
- 项目类别:Discovery Grants Program - Individual
- 财政年份:2020
- 资助国家:加拿大
- 起止时间:2020-01-01 至 2021-12-31
- 项目状态:已结题
- 来源:
- 关键词:
项目摘要
Applications relying on text must be built upon solid foundations of data organization similar to those that underlie conventional database systems. Thus the central long-term objective of the research program is to support document storage and management by applying sound database principles to this domain of information. The challenge is to discover how the complexity of text, with its intricate structure and diversity of expression, can be efficiently and effectively accessed and managed. In the short term, this objective will be addressed through the design and implementation of improved search engines.
The research program will include three related investigations. I will explore problems of general-purpose search engines, including aspects specific to reference texts, where nested structure is central to locating data. For example, I will investigate how the mechanism of database views can be best applied to describe, design, and implement an index that provides efficient search, including support for phrasal queries. I will also examine aspects specific to providing optometry researchers with access to Electronic Medical Records, where free-text fields include many spelling errors and where ensuring patient privacy is mandatory. In the third thrust, I will concentrate on aspects specific to mathematics information retrieval, where the relative positioning of symbols is at least as important as the particular symbols used. The inadequacy of existing solutions leads to the design of improved algorithms, which I then analyze, implement, and evaluate. Standard benchmarks provide the basis for comparative evaluations, but often additional test collections must be developed to examine specific aspects of the problems from unexplored perspectives. Graduate students will participate in all aspects of this research, resulting in HQP well-prepared to contribute to this area.
The centrality of documents in recording and preserving information suggests that this research remains timely and crucial to economic growth in Canada. The integration of information retrieval and database management is well-recognized as being important to Canadian business and could be equally important in promoting Canadian culture. More specifically, the reliance on text search has become a universal need, with search engines providing critical roles in gathering information internet-wide, from across one or more enterprises, or from one's own personal data collections. In addition, other text applications are found throughout business and government. Publishers, data providers (e.g., via the World Wide Web), and organizations that rely on any form of text-dominated knowledge base for conducting their internal and external business will benefit from specific tools that arise from this research as well as from the theory, which will provide a framework for designing their text management systems.
依赖文本的应用必须建立在数据组织的稳固基础上,类似于传统数据库系统的应用程序。因此,研究计划的核心长期目标是通过将声音数据库原理应用于此信息领域来支持文档存储和管理。面临的挑战是发现文本的复杂性如何具有复杂的结构和表达的多样性,可以有效地访问和管理。在短期内,该目标将通过改进的搜索引擎的设计和实施来解决。
该研究计划将包括三个相关研究。我将探讨通用搜索引擎的问题,包括特定于参考文本的方面,其中嵌套结构对于查找数据至关重要。例如,我将研究如何最好地应用数据库视图的机制来描述,设计和实施一个提供有效搜索的索引,包括对短语查询的支持。我还将研究特定于为验光研究人员提供电子病历的特定方面,其中自由文本字段包括许多拼写错误以及确保患者隐私是强制性的。在第三个推力中,我将专注于数学信息检索的方面,其中符号的相对定位至少与所使用的特定符号一样重要。现有解决方案的不足导致设计改进的算法,然后我分析,实施和评估。标准基准为比较评估提供了基础,但是通常必须开发其他测试集来从未开发的角度检查问题的特定方面。研究生将参与这项研究的各个方面,从而为HQP做好了充分的准备,为这一领域做出了贡献。
记录和保存信息中文件的中心地位表明,这项研究对于加拿大的经济增长仍然及时和至关重要。信息检索和数据库管理的整合被公认为对加拿大业务很重要,并且在促进加拿大文化中同样重要。更具体地说,对文本搜索的依赖已成为普遍的需求,搜索引擎在从一个或多个企业中或从自己的个人数据收集中收集信息中提供了关键的作用。此外,在整个企业和政府中都可以找到其他文本应用程序。出版商,数据提供商(例如,通过万维网)以及依靠任何形式的文本主导知识库进行内部和外部业务的组织将受益于本研究以及该理论所产生的特定工具,该理论将为设计其文本管理系统提供一个框架。
项目成果
期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
数据更新时间:{{ journalArticles.updateTime }}
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ patent.updateTime }}
Tompa, Frank其他文献
Tompa, Frank的其他文献
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
{{ truncateString('Tompa, Frank', 18)}}的其他基金
Flexible and Efficient Text Storage, Management, and Retrieval
灵活高效的文本存储、管理和检索
- 批准号:
RGPIN-2016-03659 - 财政年份:2021
- 资助金额:
$ 1.89万 - 项目类别:
Discovery Grants Program - Individual
Flexible and Efficient Text Storage, Management, and Retrieval
灵活高效的文本存储、管理和检索
- 批准号:
RGPIN-2016-03659 - 财政年份:2019
- 资助金额:
$ 1.89万 - 项目类别:
Discovery Grants Program - Individual
Flexible and Efficient Text Storage, Management, and Retrieval
灵活高效的文本存储、管理和检索
- 批准号:
RGPIN-2016-03659 - 财政年份:2018
- 资助金额:
$ 1.89万 - 项目类别:
Discovery Grants Program - Individual
Flexible and Efficient Text Storage, Management, and Retrieval
灵活高效的文本存储、管理和检索
- 批准号:
RGPIN-2016-03659 - 财政年份:2017
- 资助金额:
$ 1.89万 - 项目类别:
Discovery Grants Program - Individual
Flexible and Efficient Text Storage, Management, and Retrieval
灵活高效的文本存储、管理和检索
- 批准号:
RGPIN-2016-03659 - 财政年份:2016
- 资助金额:
$ 1.89万 - 项目类别:
Discovery Grants Program - Individual
Flexible, efficient text databases
灵活、高效的文本数据库
- 批准号:
9292-2010 - 财政年份:2015
- 资助金额:
$ 1.89万 - 项目类别:
Discovery Grants Program - Individual
Flexible, efficient text databases
灵活、高效的文本数据库
- 批准号:
9292-2010 - 财政年份:2013
- 资助金额:
$ 1.89万 - 项目类别:
Discovery Grants Program - Individual
Flexible, efficient text databases
灵活、高效的文本数据库
- 批准号:
9292-2010 - 财政年份:2012
- 资助金额:
$ 1.89万 - 项目类别:
Discovery Grants Program - Individual
Flexible, efficient text databases
灵活、高效的文本数据库
- 批准号:
9292-2010 - 财政年份:2011
- 资助金额:
$ 1.89万 - 项目类别:
Discovery Grants Program - Individual
Flexible, efficient text databases
灵活、高效的文本数据库
- 批准号:
9292-2010 - 财政年份:2010
- 资助金额:
$ 1.89万 - 项目类别:
Discovery Grants Program - Individual
相似国自然基金
基于太赫兹行波管放大器的高效率多路功率合成技术的研究
- 批准号:62371102
- 批准年份:2023
- 资助金额:49 万元
- 项目类别:面上项目
基于近地遥感与模型耦合的小麦氮高效表型精确预测评估
- 批准号:32371990
- 批准年份:2023
- 资助金额:50 万元
- 项目类别:面上项目
多源环境能量协同作用的微功率高效整流机制研究
- 批准号:62301348
- 批准年份:2023
- 资助金额:30 万元
- 项目类别:青年科学基金项目
面向移动边缘网络的高效智能云边端协同调度机制
- 批准号:62302343
- 批准年份:2023
- 资助金额:30 万元
- 项目类别:青年科学基金项目
考虑桩-土-水耦合效应的饱和砂土变形与流动问题的SPH模型与高效算法研究
- 批准号:12302257
- 批准年份:2023
- 资助金额:30 万元
- 项目类别:青年科学基金项目
相似海外基金
Flexible and Efficient Text Storage, Management, and Retrieval
灵活高效的文本存储、管理和检索
- 批准号:
RGPIN-2016-03659 - 财政年份:2021
- 资助金额:
$ 1.89万 - 项目类别:
Discovery Grants Program - Individual
Flexible and Efficient Text Storage, Management, and Retrieval
灵活高效的文本存储、管理和检索
- 批准号:
RGPIN-2016-03659 - 财政年份:2019
- 资助金额:
$ 1.89万 - 项目类别:
Discovery Grants Program - Individual
Flexible and Efficient Text Storage, Management, and Retrieval
灵活高效的文本存储、管理和检索
- 批准号:
RGPIN-2016-03659 - 财政年份:2018
- 资助金额:
$ 1.89万 - 项目类别:
Discovery Grants Program - Individual
Flexible and Efficient Text Storage, Management, and Retrieval
灵活高效的文本存储、管理和检索
- 批准号:
RGPIN-2016-03659 - 财政年份:2017
- 资助金额:
$ 1.89万 - 项目类别:
Discovery Grants Program - Individual
Flexible and Efficient Text Storage, Management, and Retrieval
灵活高效的文本存储、管理和检索
- 批准号:
RGPIN-2016-03659 - 财政年份:2016
- 资助金额:
$ 1.89万 - 项目类别:
Discovery Grants Program - Individual