Speech privacy protection by high-quality, invertible, and extendable speech anonymization and de-anonymization
通过高质量、可逆、可扩展的语音匿名化和去匿名化保护语音隐私
基本信息
- 批准号:21K17775
- 负责人:
- 金额:$ 2.91万
- 依托单位:
- 依托单位国家:日本
- 项目类别:Grant-in-Aid for Early-Career Scientists
- 财政年份:2021
- 资助国家:日本
- 起止时间:2021-04-01 至 2024-03-31
- 项目状态:已结题
- 来源:
- 关键词:
项目摘要
The second year's work consists of three parts: Part 1) Based on the previous year's work, the second VoicePrivacy challenge was organized by us and other universities. We defined new evaluation frameworks and conducted solid evaluations. In addition to many findings, we found that the new baseline, which was the research outcome of the previous year, outperformed the legacy baseline. We also saw submissions that outperformed the new baseline, which indicates the advancement of the research field brought by the VoicePrivacy challenge.Part 2) Based on the framework of the voice privacy challenge, we did a deep analysis of the common approaches to generating anonymized speaker identity representation (i.e., pseudo speaker embedding). Through a large-scale experiment, we identified good strategies to choose and assign the pseudo-speaker, including random gender selection and utterance-level anonymization. We also found that a simple percentile-based pitch conversion reduced the risk against the strongest (Semi-Informed) attacker. These findings were published in a top IEEE journal.Part 3) We followed the research plan and extended the language-independent speaker anonymization framework. Although the framework is language-independent, its performance degrades when processing unseen languages. We found that using multilingual training data for the waveform generator was helpful. We also proposed a correlation-alignment-based strategy to alleviate channel mismatch. Additionally, we extended the framework to hide gender information. Both works were published in top conferences.
第二年的工作由三个部分组成: 第1部分)在前一年工作的基础上,我们和其他大学组织了第二届VoicePrivacy挑战赛。我们定义了新的评估框架并进行了扎实的评估。除了许多发现之外,我们还发现新基线(即前一年的研究成果)优于旧基线。我们还看到了优于新基线的提交,这表明了语音隐私挑战带来的研究领域的进步。第2部分)基于语音隐私挑战的框架,我们对生成匿名说话人的常见方法进行了深入分析身份表示(即伪说话者嵌入)。通过大规模实验,我们确定了选择和分配伪说话者的良好策略,包括随机性别选择和话语级匿名化。我们还发现,简单的基于百分位数的音高转换降低了针对最强(半知情)攻击者的风险。这些研究结果发表在IEEE顶级期刊上。第3部分)我们按照研究计划扩展了与语言无关的说话者匿名化框架。尽管该框架与语言无关,但在处理不可见的语言时其性能会下降。我们发现对波形发生器使用多语言训练数据很有帮助。我们还提出了一种基于相关对齐的策略来缓解通道不匹配。此外,我们扩展了框架以隐藏性别信息。两部作品均在顶级会议上发表。
项目成果
期刊论文数量(7)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
Automatic speaker verification spoofing and deepfake detection using wav2vec 2.0 and data augmentation
- DOI:10.21437/odyssey.2022-16
- 发表时间:2022-02
- 期刊:
- 影响因子:0
- 作者:Hemlata Tak;M. Todisco;Xin Wang;Jee-weon Jung;J. Yamagishi;N. Evans
- 通讯作者:Hemlata Tak;M. Todisco;Xin Wang;Jee-weon Jung;J. Yamagishi;N. Evans
Benchmarking and challenges in security and privacy for voice biometrics
语音生物识别安全和隐私方面的基准测试和挑战
- DOI:
- 发表时间:2021
- 期刊:
- 影响因子:0
- 作者:Jean-Francois Bonastre; Hector Delgado; Nicholas Evans; Tomi Kinnunen; Kong Aik Lee; Xuechen Liu;Andreas Nautsch; Paul-Gauthier NoE;Jose Patino; Md Sahidullah; Brij Mohan Lal Srivastava;Massimiliano Todisco; Natalia Tomashenko; Emmanuel Vince
- 通讯作者:Emmanuel Vince
Hiding Speaker’s Sex in Speech Using Zero-Evidence Speaker Representation in an Analysis/Synthesis Pipeline
- DOI:10.1109/icassp49357.2023.10096749
- 发表时间:2022-11
- 期刊:
- 影响因子:0
- 作者:Paul-Gauthier Noé;Xiaoxiao Miao;Xin Wang;J. Yamagishi;J. Bonastre;D. Matrouf
- 通讯作者:Paul-Gauthier Noé;Xiaoxiao Miao;Xin Wang;J. Yamagishi;J. Bonastre;D. Matrouf
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
数据更新时间:{{ journalArticles.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ monograph.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ sciAawards.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ conferencePapers.updateTime }}
{{ item.title }}
- 作者:
{{ item.author }}
数据更新时间:{{ patent.updateTime }}
Wang Xin其他文献
Dual-Modality Imaging Probes with High Magnetic Relaxivity and Near-Infrared Fluorescence Based Highly Aminated Mesoporous Silica Nanoparticles
基于高氨基化介孔二氧化硅纳米颗粒的具有高磁弛豫率和近红外荧光的双模态成像探针
- DOI:
10.1155/2016/6502127 - 发表时间:
2016-05 - 期刊:
- 影响因子:0
- 作者:
Zhu Fei-Peng;Chen Guo-Tao;Wang Shou-Ju;Liu Ying;Tang Yu-Xia;Tian Ying;Wang Jian-Dong;Wang Chun-Yan;Wang Xin;Sun Jing;Teng Zhao-Gang(滕兆刚);Lu Guang-Ming - 通讯作者:
Lu Guang-Ming
Construction of a 2D Graphene-Like MoS2/C3N4 Heterojunction with Enhanced Visible-Light Photocatalytic Activity and Photoelectrochemical Activity
具有增强可见光光催化活性和光电化学活性的二维类石墨烯MoS2/C3N4异质结的构建
- DOI:
10.1002/chem.201503660 - 发表时间:
2016 - 期刊:
- 影响因子:0
- 作者:
Yan Jia;Chen Zhigang;Ji Haiyan;Liu Zheng;Wang Xin;Xu Yuanguo;She Xiaojie;Huang Liying;Xu Li;Xu Hui;Li Huaming - 通讯作者:
Li Huaming
Novel insights into redox system and the mechanism of redox regulation
对氧化还原系统和氧化还原调节机制的新见解
- DOI:
10.1007/s11033-016-4022-y - 发表时间:
2016-06 - 期刊:
- 影响因子:2.8
- 作者:
Wang Xin;Hai Chunxu - 通讯作者:
Hai Chunxu
Assessment of the SMAP Soil Emission Model and Soil Moisture Retrieval Algorithms for a Tibetan Desert Ecosystem
青藏高原沙漠生态系统 SMAP 土壤排放模型和土壤水分反演算法的评估
- DOI:
10.1109/tgrs.2018.2811318 - 发表时间:
2018-03 - 期刊:
- 影响因子:8.2
- 作者:
Zheng Donghai;van der Velde Rogier;Wen Jun;Wang Xin;Ferrazzoli Paolo;Schwank Mike;Colli;er Andreas;Bindlish Rajat;Su Zhongbo - 通讯作者:
Su Zhongbo
Illuminating the Function of the Hydroxyl Radical in the Brains of Mice with Depression Phenotypes by Two-Photon Fluorescence Imaging
通过双光子荧光成像阐明抑郁表型小鼠大脑中羟基自由基的功能
- DOI:
10.1002/anie.201901318 - 发表时间:
2019 - 期刊:
- 影响因子:0
- 作者:
Wang Xin;Li Ping;Ding Qi;Wu Chuanchen;Zhang Wen;Tang Bo - 通讯作者:
Tang Bo
Wang Xin的其他文献
{{
item.title }}
{{ item.translation_title }}
- DOI:
{{ item.doi }} - 发表时间:
{{ item.publish_year }} - 期刊:
- 影响因子:{{ item.factor }}
- 作者:
{{ item.authors }} - 通讯作者:
{{ item.author }}
{{ truncateString('Wang Xin', 18)}}的其他基金
One model for all sounds: fast and high-quality neural source-filter model for speech and non-speech waveform modeling
适用于所有声音的一种模型:用于语音和非语音波形建模的快速且高质量的神经源滤波器模型
- 批准号:
19K24371 - 财政年份:2019
- 资助金额:
$ 2.91万 - 项目类别:
Grant-in-Aid for Research Activity Start-up
Development of a One-Dimensional Shear Beam Model for Buildings Based on Nonlinear Wave Propagation Theory and Its Application to Damage Prediction
基于非线性波传播理论的建筑物一维剪力梁模型的建立及其在损伤预测中的应用
- 批准号:
17H04732 - 财政年份:2018
- 资助金额:
$ 2.91万 - 项目类别:
Grant-in-Aid for Young Scientists (A)
Development and applicability examination of a method to evaluate story-by-story damage of super high-rise buildings based on ambient noise measurement
基于环境噪声测量的超高层建筑逐层损伤评估方法开发及适用性检验
- 批准号:
15K20872 - 财政年份:2015
- 资助金额:
$ 2.91万 - 项目类别:
Grant-in-Aid for Young Scientists (B)
相似海外基金
Speech security on human-computer interaction
人机交互的语音安全
- 批准号:
22K21304 - 财政年份:2022
- 资助金额:
$ 2.91万 - 项目类别:
Grant-in-Aid for Research Activity Start-up
Language-independent speaker anonymization with multiple privacy-related attributes
具有多个隐私相关属性的独立于语言的说话人匿名化
- 批准号:
22K21319 - 财政年份:2022
- 资助金额:
$ 2.91万 - 项目类别:
Grant-in-Aid for Research Activity Start-up
Study on Audio Information Hiding Based on Human Auditory Perception with Phase Modulation
基于人类听觉感知的相位调制音频信息隐藏研究
- 批准号:
20J20580 - 财政年份:2020
- 资助金额:
$ 2.91万 - 项目类别:
Grant-in-Aid for JSPS Fellows
A comprehensive speaker anonymization method for speech data privacy protection
一种用于语音数据隐私保护的综合说话人匿名化方法
- 批准号:
20K19832 - 财政年份:2020
- 资助金额:
$ 2.91万 - 项目类别:
Grant-in-Aid for Early-Career Scientists