FreeTxt: supporting bilingual free-text survey and questionnaire data analysis
FreeTxt:支持双语自由文本调查和问卷数据分析
基本信息
- 批准号:AH/W004844/1
- 负责人:
- 金额:$ 10.28万
- 依托单位:
- 依托单位国家:英国
- 项目类别:Research Grant
- 财政年份:2022
- 资助国家:英国
- 起止时间:2022 至 无数据
- 项目状态:已结题
- 来源:
- 关键词:
项目摘要
In a modern consumer-led culture, obtaining and responding to qualitative feedback (i.e. often free-text comments/written feedback) is embedded in the professional practice of many walks of life.Surveys are used, for example, in staff development, professional training, product design and testing, and in various forms of service provision across the public and private sector. Surveys and questionnaires often produce a combination of quantitative and qualitative forms of data. Quantitative forms, such as rating scales (e.g. likert scale responses), multiple choice questions and rank order questions can be numerated (i.e. quantified) with ease, the analysis of which can be conducted in a systematic and often automated way. By contrast, more qualitative questions, which prompt open ended, free-text comment responses, or, in the context of the tourism and heritage sector, written feedback from exhibitions, events and/or historical sites on social media channels or websites including Trip Advisor and Trust Pilot, pose a more difficult challenge for the analyst. Tackling written, text-based feedback often requires a more labour-intensive and manual approach to analysis. Compounding this challenge is where feedback is presented in both English and Welsh, as is often the case in Wales, with Wales representing the largest bilingual community in the UK. The successful analysis of bilingual data relies on the workforce having the appropriate linguistic expertise to process it.While a range of sophisticated digital tools for the analysis of text-based data are available, particularly for researchers working in academia, in marketing and public relations contexts etc., many of the digital resources used are not necessarily affordable, quick and easy to use, and/or accessible to non-expert users. Specifically, these tools currently do not fully support the task of systematically processing free-text responses in Welsh.This project aims to bridge this gap by building the novel 'FreeTxt' toolkit which is designed to support the analysis and visualisation of multiple forms of open-ended, free-text data in both English and Welsh. FreeTxt will draw on existing open-source bilingual corpus-based utilities and methodologies, repackaging these and taking them in a new direction so that they are relevant to new audiences/user-groups. We will work closely with project partners Cadw and National Trust Wales to co-design, co-construct and test FreeTxt to ensure that the resource is fit-for-purpose and fairly and consistently meets the needs of Welsh and English-language responses.Existing tools that we will draw on include those developed as part of the CorCenCC project (Corpws Cenedlaethol Cymraeg Cyfoes - The National Corpus of Contemporary Welsh). This includes CorCenCC's semantic (i.e. meaning based categorisations of individual words and phrases) and part of speech (POS - i.e. grammar-based categorisations of individual words and phrases - e.g. nouns, verbs) taggers and tagsets for Welsh language, and corpus functionalities for the querying of language, amongst others. These tools will be integrated into a user-friendly, online interface that users can paste/upload their texts into, to search for patterns of meaning that emerge in survey responses and feedback; to see which words are most often used in relation to a given theme, place, topic; to understand what visitors particularly enjoyed about a service or attraction, and what they think could be improved.The final version of the tool will be made freely-available and will be adaptable in terms of who can use it and when. It will contain generic analysis features that enable it to be used by any public and/or professional company and institution dealing with varying datasets of qualitative survey data and will be of relevance to academic researchers analysing and visualising survey data. The accessibility and usability of this tool will help provide a direct route to potential impact.
在现代消费者主导的文化中,获取和回应定性反馈(即通常是自由文本评论/书面反馈)已融入各行各业的专业实践中。例如,调查可用于员工发展、专业培训、产品设计和测试,以及公共和私营部门各种形式的服务提供。调查和问卷通常会产生定量和定性形式的数据组合。定量形式,例如评级量表(例如李克特量表反应)、多项选择题和排名顺序问题可以轻松地进行计数(即量化),并且可以以系统且通常自动化的方式进行分析。相比之下,更多的定性问题会引发开放式、自由文本评论回复,或者在旅游和遗产部门的背景下,来自社交媒体渠道或网站(包括 Trip Advisor)上的展览、活动和/或历史遗址的书面反馈和 Trust Pilot,给分析师带来了更艰巨的挑战。处理基于文本的书面反馈通常需要更加劳动密集型的手动分析方法。使这一挑战更加复杂的是,反馈以英语和威尔士语提供,就像威尔士的情况一样,威尔士代表着英国最大的双语社区。双语数据的成功分析依赖于具有适当语言专业知识的劳动力来处理它。虽然可以使用一系列复杂的数字工具来分析基于文本的数据,特别是对于学术界、营销和公共关系领域的研究人员来说等等,所使用的许多数字资源不一定是负担得起的、快速且易于使用和/或非专家用户可以访问的。具体来说,这些工具目前并不完全支持系统处理威尔士语自由文本响应的任务。该项目旨在通过构建新颖的“FreeTxt”工具包来弥补这一差距,该工具包旨在支持多种形式的开放文本的分析和可视化。 - 英语和威尔士语结尾的自由文本数据。 FreeTxt 将利用现有的基于开源双语语料库的实用程序和方法,重新包装它们并将它们引向新的方向,以便它们与新的受众/用户组相关。我们将与项目合作伙伴 Cadw 和 National Trust Wales 密切合作,共同设计、共同构建和测试 FreeTxt,以确保该资源适合用途,并公平、一致地满足威尔士语和英语回答的需求。我们将利用的工具包括作为 CorCenCC 项目一部分开发的工具(Corpws Cenedlaethol Cymraeg Cyfoes - 当代威尔士语国家语料库)。这包括 CorCenCC 的语义(即单个单词和短语的基于含义的分类)和词性(POS - 即单个单词和短语的基于语法的分类 - 例如名词、动词)威尔士语的标记器和标记集,以及威尔士语的语料库功能语言查询等。这些工具将集成到一个用户友好的在线界面中,用户可以将文本粘贴/上传到其中,以搜索调查回复和反馈中出现的含义模式;查看与给定主题、地点、话题相关的最常用的单词;了解游客特别喜欢某个服务或景点的哪些方面,以及他们认为可以改进的地方。该工具的最终版本将免费提供,并将根据谁可以使用它以及何时使用进行调整。它将包含通用分析功能,使其能够被处理不同定性调查数据数据集的任何公共和/或专业公司和机构使用,并将与分析和可视化调查数据的学术研究人员相关。该工具的可访问性和可用性将有助于提供实现潜在影响的直接途径。
项目成果
期刊论文数量(2)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
FreeTxt: a corpus-based qualitative text analysis toolkit (conference presentation)
FreeTxt:基于语料库的定性文本分析工具包(会议演示)
- DOI:
- 发表时间:2022
- 期刊:
- 影响因子:0
- 作者:Knight D
- 通讯作者:Knight D
FreeTxt: A corpus-based qualitative analysis toolkit
FreeTxt:基于语料库的定性分析工具包
- DOI:
- 发表时间:2023
- 期刊:
- 影响因子:0
- 作者:Knight D
- 通讯作者:Knight D
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ patent.updateTime }}
Dawn Knight其他文献
Multimodal Corpora
多模态语料库
- DOI:
10.1007/978-3-030-46216-1_16 - 发表时间:
2020 - 期刊:
- 影响因子:0
- 作者:
Dawn Knight;S. Adolphs - 通讯作者:
S. Adolphs
Building a spoken corpus: what are the basics?
构建口语语料库:基础知识是什么?
- DOI:
10.4324/9780203856949-12 - 发表时间:
2010 - 期刊:
- 影响因子:0
- 作者:
S. Adolphs;Dawn Knight - 通讯作者:
Dawn Knight
PriPA: A Tool for Privacy-Preserving Analytics of Linguistic Data
PriPA:语言数据隐私保护分析工具
- DOI:
- 发表时间:
2022 - 期刊:
- 影响因子:0
- 作者:
Jérémie Clos;Emma Mcclaughlin;Pepita Barnard;Elena Nichele;Dawn Knight;Derek McAuley;S. Adolphs - 通讯作者:
S. Adolphs
I’m having a Spring Clear Out: A Corpus-based Analysis of e-transactional Discourse
我正在进行春季清理:基于语料库的电子交易话语分析
- DOI:
10.1093/applin/amv019 - 发表时间:
2017 - 期刊:
- 影响因子:3.6
- 作者:
Dawn Knight;Steve Walsh;S. Papagiannidis - 通讯作者:
S. Papagiannidis
1.3 Designing a National Corpus in a Minoritised Language
1.3 设计小语种国家语料库
- DOI:
- 发表时间:
2021 - 期刊:
- 影响因子:0
- 作者:
Dawn Knight;Steve Morris;T. Fitzpatrick - 通讯作者:
T. Fitzpatrick
Dawn Knight的其他文献
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
{{ truncateString('Dawn Knight', 18)}}的其他基金
Interactional variation online: harnessing emerging technologies in the digital humanities to analyse online discourse in different workplace contexts
在线互动变化:利用数字人文中的新兴技术来分析不同工作场所环境中的在线话语
- 批准号:
AH/W001608/1 - 财政年份:2021
- 资助金额:
$ 10.28万 - 项目类别:
Research Grant
Corpws Cenedlaethol Cymraeg Cyfoes (The National Corpus of Contemporary Welsh): A community driven approach to linguistic corpus construction
Corpws Cenedlaethol Cymraeg Cyfoes(当代威尔士语国家语料库):社区驱动的语言语料库建设方法
- 批准号:
ES/M011348/1 - 财政年份:2016
- 资助金额:
$ 10.28万 - 项目类别:
Research Grant
相似国自然基金
基于HPT轴“GnRH1-FSH-SCs”通路探讨龟鹿补肾丸“补肾”提高睾丸支持细胞数量和屏障功能改善雄性生殖能力的机制研究
- 批准号:82304904
- 批准年份:2023
- 资助金额:30 万元
- 项目类别:青年科学基金项目
支持深度伪造检测的精细化高效训练集生成方法研究
- 批准号:62372423
- 批准年份:2023
- 资助金额:51 万元
- 项目类别:面上项目
农业支持保护补贴对种植大户非粮化行为的影响机理与政策优化研究
- 批准号:72304270
- 批准年份:2023
- 资助金额:30 万元
- 项目类别:青年科学基金项目
牧区婚俗改革及其福利效应研究:基于代际支持视角
- 批准号:72363027
- 批准年份:2023
- 资助金额:28 万元
- 项目类别:地区科学基金项目
支持细胞Mettl16功能异常在睾丸发育障碍中的作用与机制研究
- 批准号:82371606
- 批准年份:2023
- 资助金额:50 万元
- 项目类别:面上项目
相似海外基金
The role and effects of Bilingual Learning Assistants in supporting multilingual learners in schools
双语学习助理在支持学校多语言学习者方面的作用和效果
- 批准号:
2737845 - 财政年份:2022
- 资助金额:
$ 10.28万 - 项目类别:
Studentship
Preparing and Supporting Bilingual STEM Teachers in Puerto Rico
为波多黎各的双语 STEM 教师提供准备和支持
- 批准号:
1950139 - 财政年份:2020
- 资助金额:
$ 10.28万 - 项目类别:
Continuing Grant
Mother Tongue and Inherited Language to Nurture Language that Connect Parents and Children: Supporting Academic Learning and Publishing Bilingual Creative Picture Books
母语与传承语言,培育连接亲子的语言:支持学术学习,出版双语创意绘本
- 批准号:
19K13245 - 财政年份:2019
- 资助金额:
$ 10.28万 - 项目类别:
Grant-in-Aid for Early-Career Scientists
Ecosystem for credentialing and supporting a bilingual provider workforce
用于认证和支持双语提供商劳动力的生态系统
- 批准号:
9201815 - 财政年份:2016
- 资助金额:
$ 10.28万 - 项目类别:
Ecosystem for credentialing and supporting a bilingual provider workforce
用于认证和支持双语提供商劳动力的生态系统
- 批准号:
9367539 - 财政年份:2016
- 资助金额:
$ 10.28万 - 项目类别: