CAREER: From Data to Knowledge: Extracting and Utilizing Concept Graphs in Online Environments
职业:从数据到知识:在线环境中提取和利用概念图
基本信息
- 批准号:1652674
- 负责人:
- 金额:$ 49.99万
- 依托单位:
- 依托单位国家:美国
- 项目类别:Continuing Grant
- 财政年份:2017
- 资助国家:美国
- 起止时间:2017-06-01 至 2017-12-31
- 项目状态:已结题
- 来源:
- 关键词:
项目摘要
Knowledge bases today are central to the successful utilization of information available in the large and growing amounts of digital data on the Web. Such technologies have started to unleash a transformation of Web search from a keyword match to discovery, learning, and creativity, which are crucial to promoting the goal of knowledge discovery. Unfortunately, the search for information remains inherently difficult for significant portions of the Web such as the Scholarly Web, which contains many millions of scientific documents. For example, PubMed has over 20 million documents, whereas Google Scholar is estimated to have more than 100 million. Open-access digital libraries such as CiteSeerX, which acquire freely-available research articles from the Web, witness an increase in their document collections as well. Despite attractive advancements by scholarly search portals, semantic search technologies that "understand" complex concepts and their relations and can systematically satisfy users' intricate information needs have yet to be investigated on the Scholarly Web. The goal of this project is to design solutions to make information more accessible and comprehensible to Scholarly Web users in particular, and Web users in general, and to help them discover knowledge more effectively and efficiently. The approach taken will be to develop an integrated framework, focusing on the extraction and utilization of scholarly knowledge graphs in online scholarly environments. Educationally, this work will involve: training of graduate, undergraduate, and high-school students, particularly encouraging the participation of women and underrepresented groups in the research efforts; curriculum development and integration of research into courses taught by the PI; exposure of students to industry and international experiences; and education for the general public. The project will target the following research objectives: (1) explore the construction of scholarly knowledge graphs that combine evidence from multiple resources in an open information extraction framework; (2) design and develop novel algorithms for the detection and analysis of interesting and previously unknown connections between concepts, in order to enforce knowledge discovery on the Scholarly Web; and (3) investigate the utility of scholarly knowledge graphs in a question answering system. The results of this research will be integrated into the CiteSeerX digital library (http://citeseerx.ist.psu.edu). The software, tools, and benchmark datasets, which will be developed during the course of this project will be made publicly available. All findings will be shared with the research community through publications in academic journals and presented in Information Retrieval, Text Mining and Natural Language Processing conferences. For further information, see the project web page: http://www.cse.unt.edu/~ccaragea/skg.html.
当今的知识库是成功利用网络上大量数字数据中可用信息的核心。这些技术已经开始释放从关键字匹配到发现,学习和创造力的Web搜索转换,这对于促进知识发现的目标至关重要。不幸的是,对于大部分网络(例如学术网络),对信息的搜索仍然固有地困难,其中包含数百万个科学文档。例如,PubMed拥有超过2000万个文件,而Google Scholar估计拥有超过1亿个文件。从网络中获取可自由获取的研究文章的开放式数字图书馆,例如Citeseerx,也见证了其文档藏品的增加。尽管学术搜索门户网站取得了诱人的进步,但语义搜索技术“了解”复杂的概念及其关系,并且可以系统地满足用户的复杂信息需求,尚未在学术网络上进行调查。该项目的目的是设计解决方案,以使信息更容易访问,尤其是学术网络用户,以及一般的Web用户,并帮助他们更有效,更有效地发现知识。采用的方法是开发一个集成的框架,重点是在线学术环境中学术知识图的提取和利用。在教育上,这项工作将涉及:培训研究生,本科生和高中生,特别是鼓励妇女和人数不足的群体参与研究工作;课程开发和整合在PI教授的课程中;学生接触行业和国际经验;和对公众的教育。该项目将针对以下研究目标:(1)探索在开放信息提取框架中结合了来自多个资源的证据的学术知识图的构建; (2)设计和开发新颖的算法,以检测和分析概念之间有趣且以前未知的联系,以便在学术网络上实施知识发现; (3)在问题回答系统中研究学术知识图的实用性。这项研究的结果将集成到Citeseerx数字图书馆(http://citeseerx.ist.psu.edu)中。将在该项目过程中开发的软件,工具和基准数据集将公开使用。所有发现将通过学术期刊的出版物与研究社区共享,并在信息检索,文本挖掘和自然语言处理会议中介绍。有关更多信息,请参见项目网页:http://www.cse.unt.edu/~ccaragea/skg.html。
项目成果
期刊论文数量(0)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
数据更新时间:{{ journalArticles.updateTime }}
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ patent.updateTime }}
Cornelia Caragea其他文献
Metadata Repository
元数据存储库
- DOI:
10.1007/978-0-387-39940-9_3058 - 发表时间:
2009 - 期刊:
- 影响因子:0
- 作者:
Cornelia Caragea;Vasant G Honavar;P. Boncz;P. Larson;S. Dietrich;Gonzalo Navarro;B. Thuraisingham;Yan Luo;Ouri E. Wolfson;S. Beitzel;Eric C. Jensen;O. Frieder;C. Jensen;N. Tradisauskas;E. Munson;A. Wun;K. Goda;Stephen E. Fienberg;Jiashun Jin;Guimei Liu;Nick Craswell;T. Pedersen;Cesare Pautasso;M. Moro;S. Manegold;B. Carminati;Marina Blanton;S. Bouchenak;Noël de Palma;Wei Tang;C. Quix;M. Jeusfeld;R. K. Pon;David J. Buttler;W. Meng;P. Zezula;Michal Batko;Vlastislav Dohnal;J. Domingo;Denilson Barbosa;I. Manolescu;Jeffrey Xu Yu;E. Cecchet;Vivien Quéma;Xifeng Yan;G. Santucci;D. Zeinalipour;Panos K. Chrysanthis;A. Deshpande;Carlos Guestrin;S. Madden;C. Leung;R. H. Güting;Amarnath Gupta;Heng Tao Shen;G. Weikum;Ramesh Jain;J. Yu;P. Ciaccia;K. Candan;M. Sapino;C. Meghini;F. Sebastiani;U. Straccia;F. Nack;V. S. Subrahmanian;Maria Vanina Martinez;D. Reforgiato;T. Westerveld;M. Sebillo;G. Vitiello;M. De Marsico;K. Voruganti;C. Parent;S. Spaccapietra;C. Vangenot;E. Zimányi;Prasan Roy;S. Sudarshan;E. Puppo;Peer Kröger;M. Renz;H. Schuldt;Solmaz Kolahi;A. Unwin;W. Cellary - 通讯作者:
W. Cellary
Scientific Keyphrase Identification and Classification by Pre-Trained Language Models Intermediate Task Transfer Learning
通过预训练语言模型进行科学的关键词识别和分类中间任务迁移学习
- DOI:
- 发表时间:
2020 - 期刊:
- 影响因子:0
- 作者:
Seoyeon Park;Cornelia Caragea - 通讯作者:
Cornelia Caragea
Semantic Tokenizer for Enhanced Natural Language Processing
用于增强自然语言处理的语义分词器
- DOI:
10.48550/arxiv.2304.12404 - 发表时间:
2023 - 期刊:
- 影响因子:0
- 作者:
Sandeep Mehta;Darpan Shah;Ravindra Kulkarni;Cornelia Caragea - 通讯作者:
Cornelia Caragea
A Group-Based Personalized Model for Image Privacy Classification and Labeling
基于群体的个性化图像隐私分类和标签模型
- DOI:
10.24963/ijcai.2017/552 - 发表时间:
2017 - 期刊:
- 影响因子:0
- 作者:
Haoti Zhong;A. Squicciarini;David J. Miller;Cornelia Caragea - 通讯作者:
Cornelia Caragea
MEDLINE/ PubMed
MEDLINE/PubMed
- DOI:
10.1007/978-0-387-39940-9_3039 - 发表时间:
2004 - 期刊:
- 影响因子:3.8
- 作者:
Cornelia Caragea;V. Honavar;P. Boncz;P. Larson;S. Dietrich;Gonzalo Navarro;Bhavani Thuraisingham;Yan Luo;Ouri E. Wolfson;S. Beitzel;Eric C. Jensen;Ophir Frieder;Christian S. Jensen;N. Tradisauskas;Ethan V. Munson;A. Wun;K. Goda;Stephen E. Fienberg;Jiashun Jin;Guimei Liu;Nick Craswell;T. Pedersen;Cesare Pautasso;M. Moro;S. Manegold;B. Carminati;Marina Blanton;Sara Bouchenak;Noël de Palma;Wei Tang;Christoph Quix;M. Jeusfeld;R. K. Pon;David J. Buttler;W. Meng;P. Zezula;Michal Batko;Vlastislav Dohnal;J. Domingo;Denilson Barbosa;Ioana Manolescu;Jeffrey Xu Yu;Emmanuel Cecchet;Vivien Quéma;Xifeng Yan;G. Santucci;D. Zeinalipour;Panos K. Chrysanthis;Amol Deshpande;Carlos Guestrin;Samuel Madden;Carson Kai;R. H. Güting;Amarnath Gupta;Heng Tao Shen;G. Weikum;Ramesh Jain;Jeffrey Xu Yu;Paolo Ciaccia;K. Candan;M. Sapino;C. Meghini;F. Sebastiani;U. Straccia;F. Nack;V. S. Subrahmanian;Maria Vanina Martinez;D. Reforgiato;T. Westerveld;M. Sebillo;G. Vitiello;Maria De Marsico;K. Voruganti;C. Parent;S. Spaccapietra;Christelle Vangenot;Esteban Zimányi;Prasan Roy;S. Sudarshan;E. Puppo;Peer Kröger;Matthias Renz;H. Schuldt;Solmaz Kolahi;A. Unwin;W. Cellary - 通讯作者:
W. Cellary
Cornelia Caragea的其他文献
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
{{ truncateString('Cornelia Caragea', 18)}}的其他基金
CHS: Small: Collaborative Research: Automating Relevance and Trust Detection in Social Media Data for Emergency Response
CHS:小型:协作研究:自动化社交媒体数据中的相关性和信任检测以进行紧急响应
- 批准号:
1903963 - 财政年份:2018
- 资助金额:
$ 49.99万 - 项目类别:
Standard Grant
TWC: Small: Collaborative: Towards Privacy Preserving Online Image Sharing
TWC:小型:协作:实现隐私保护在线图像共享
- 批准号:
1903714 - 财政年份:2018
- 资助金额:
$ 49.99万 - 项目类别:
Standard Grant
CRI: CI-SUSTAIN: Collaborative Research: CiteSeerX: Toward Sustainable Support of Scholarly Big Data
CRI:CI-SUSTAIN:协作研究:CiteSeerX:迈向学术大数据的可持续支持
- 批准号:
1853919 - 财政年份:2018
- 资助金额:
$ 49.99万 - 项目类别:
Standard Grant
CRI: CI-SUSTAIN: Collaborative Research: CiteSeerX: Toward Sustainable Support of Scholarly Big Data
CRI:CI-SUSTAIN:协作研究:CiteSeerX:迈向学术大数据的可持续支持
- 批准号:
1823292 - 财政年份:2018
- 资助金额:
$ 49.99万 - 项目类别:
Standard Grant
BIGDATA: IA: Collaborative Research: Domain Adaptation Approaches for Classifying Crisis Related Data on Social Media
大数据:IA:协作研究:社交媒体上危机相关数据分类的领域适应方法
- 批准号:
1741353 - 财政年份:2018
- 资助金额:
$ 49.99万 - 项目类别:
Standard Grant
CAREER: From Data to Knowledge: Extracting and Utilizing Concept Graphs in Online Environments
职业:从数据到知识:在线环境中提取和利用概念图
- 批准号:
1802358 - 财政年份:2017
- 资助金额:
$ 49.99万 - 项目类别:
Continuing Grant
III: Small: Collaborative Research: Keyphrase Extraction in Document Networks
III:小:协作研究:文档网络中的关键词提取
- 批准号:
1813571 - 财政年份:2017
- 资助金额:
$ 49.99万 - 项目类别:
Continuing Grant
BIGDATA: IA: Collaborative Research: Domain Adaptation Approaches for Classifying Crisis Related Data on Social Media
大数据:IA:协作研究:社交媒体上危机相关数据分类的领域适应方法
- 批准号:
1802284 - 财政年份:2017
- 资助金额:
$ 49.99万 - 项目类别:
Standard Grant
TWC: Small: Collaborative: Towards Privacy Preserving Online Image Sharing
TWC:小型:协作:实现隐私保护在线图像共享
- 批准号:
1814255 - 财政年份:2017
- 资助金额:
$ 49.99万 - 项目类别:
Standard Grant
CHS: Small: Collaborative Research: Automating Relevance and Trust Detection in Social Media Data for Emergency Response
CHS:小型:协作研究:自动化社交媒体数据中的相关性和信任检测以进行紧急响应
- 批准号:
1814271 - 财政年份:2017
- 资助金额:
$ 49.99万 - 项目类别:
Standard Grant
相似国自然基金
知识和数据协同驱动的车联网关键技术研究
- 批准号:62371309
- 批准年份:2023
- 资助金额:53 万元
- 项目类别:面上项目
数据与知识双驱动的抗体分子智能设计方法研究
- 批准号:62372204
- 批准年份:2023
- 资助金额:51 万元
- 项目类别:面上项目
知识与数据混合驱动的含缺陷点阵结构不确定性分析与优化方法研究
- 批准号:12302149
- 批准年份:2023
- 资助金额:30 万元
- 项目类别:青年科学基金项目
数据-知识融合驱动的半导体硅单晶质量监控与批次学习控制方法研究
- 批准号:62303376
- 批准年份:2023
- 资助金额:30 万元
- 项目类别:青年科学基金项目
基于浅水方程知识和数据融合驱动的城市内涝高效精细模拟及其可迁移性研究
- 批准号:52379010
- 批准年份:2023
- 资助金额:51 万元
- 项目类别:面上项目
相似海外基金
The Influence of Lifetime Occupational Experience on Cognitive Trajectories Among Mexican Older Adults
终生职业经历对墨西哥老年人认知轨迹的影响
- 批准号:
10748606 - 财政年份:2024
- 资助金额:
$ 49.99万 - 项目类别:
Climate Change Effects on Pregnancy via a Traditional Food
气候变化通过传统食物对怀孕的影响
- 批准号:
10822202 - 财政年份:2024
- 资助金额:
$ 49.99万 - 项目类别:
Executive functions in urban Hispanic/Latino youth: exposure to mixture of arsenic and pesticides during childhood
城市西班牙裔/拉丁裔青年的执行功能:童年时期接触砷和农药的混合物
- 批准号:
10751106 - 财政年份:2024
- 资助金额:
$ 49.99万 - 项目类别: