Speech privacy protection by high-quality, invertible, and extendable speech anonymization and de-anonymization

通过高质量、可逆、可扩展的语音匿名化和去匿名化保护语音隐私

基本信息

  • 批准号:
    21K17775
  • 负责人:
  • 金额:
    $ 2.91万
  • 依托单位:
  • 依托单位国家:
    日本
  • 项目类别:
    Grant-in-Aid for Early-Career Scientists
  • 财政年份:
    2021
  • 资助国家:
    日本
  • 起止时间:
    2021-04-01 至 2024-03-31
  • 项目状态:
    已结题

项目摘要

The second year's work consists of three parts: Part 1) Based on the previous year's work, the second VoicePrivacy challenge was organized by us and other universities. We defined new evaluation frameworks and conducted solid evaluations. In addition to many findings, we found that the new baseline, which was the research outcome of the previous year, outperformed the legacy baseline. We also saw submissions that outperformed the new baseline, which indicates the advancement of the research field brought by the VoicePrivacy challenge.Part 2) Based on the framework of the voice privacy challenge, we did a deep analysis of the common approaches to generating anonymized speaker identity representation (i.e., pseudo speaker embedding). Through a large-scale experiment, we identified good strategies to choose and assign the pseudo-speaker, including random gender selection and utterance-level anonymization. We also found that a simple percentile-based pitch conversion reduced the risk against the strongest (Semi-Informed) attacker. These findings were published in a top IEEE journal.Part 3) We followed the research plan and extended the language-independent speaker anonymization framework. Although the framework is language-independent, its performance degrades when processing unseen languages. We found that using multilingual training data for the waveform generator was helpful. We also proposed a correlation-alignment-based strategy to alleviate channel mismatch. Additionally, we extended the framework to hide gender information. Both works were published in top conferences.
第二年的工作由三个部分组成:第1部分)根据上一年的工作,第二个语音私人挑战是由美国和其他大学组织的。我们定义了新的评估框架并进行了稳固的评估。除了许多发现外,我们还发现,这是上一年的研究结果的新基线表现优于旧基线。我们还看到了表现优于新基线的提交,这表明语音挑战挑战带来的研究领域的进步。第2部分)基于语音隐私挑战的框架,我们对产生匿名说话者的常见方法进行了深入的分析(即,pseudo speaudo speader smerverding)。通过大规模的实验,我们确定了良好的策略来选择和分配伪演讲者,包括随机的性别选择和话语级别的匿名化。我们还发现,基于百分位的音高转换降低了对最强(半信息)攻击者的风险。这些发现发表在最高的IEEE期刊上。第3部分)我们遵循了研究计划,并扩展了与语言无关的说话者匿名框架。尽管该框架与语言无关,但在处理看不见的语言时,其性能会降低。我们发现,为波形生成器使用多语言培训数据很有帮助。我们还提出了一种基于相关的策略,以减轻渠道不匹配。此外,我们扩展了框架以隐藏性别信息。这两部作品均在顶级会议上发表。

项目成果

期刊论文数量(7)
专著数量(0)
科研奖励数量(0)
会议论文数量(0)
专利数量(0)
Automatic speaker verification spoofing and deepfake detection using wav2vec 2.0 and data augmentation
  • DOI:
    10.21437/odyssey.2022-16
  • 发表时间:
    2022-02
  • 期刊:
  • 影响因子:
    0
  • 作者:
    Hemlata Tak;M. Todisco;Xin Wang;Jee-weon Jung;J. Yamagishi;N. Evans
  • 通讯作者:
    Hemlata Tak;M. Todisco;Xin Wang;Jee-weon Jung;J. Yamagishi;N. Evans
Benchmarking and challenges in security and privacy for voice biometrics
语音生物识别安全和隐私方面的基准测试和挑战
  • DOI:
  • 发表时间:
    2021
  • 期刊:
  • 影响因子:
    0
  • 作者:
    Jean-Francois Bonastre; Hector Delgado; Nicholas Evans; Tomi Kinnunen; Kong Aik Lee; Xuechen Liu;Andreas Nautsch; Paul-Gauthier NoE;Jose Patino; Md Sahidullah; Brij Mohan Lal Srivastava;Massimiliano Todisco; Natalia Tomashenko; Emmanuel Vince
  • 通讯作者:
    Emmanuel Vince
Tutorial on speaker anonymization (software)
说话者匿名化教程(软件)
  • DOI:
  • 发表时间:
  • 期刊:
  • 影响因子:
    0
  • 作者:
  • 通讯作者:
Languange-independent speaker anonymization system
与语言无关的说话者匿名系统
  • DOI:
  • 发表时间:
  • 期刊:
  • 影响因子:
    0
  • 作者:
  • 通讯作者:
Hiding Speaker’s Sex in Speech Using Zero-Evidence Speaker Representation in an Analysis/Synthesis Pipeline
{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

数据更新时间:{{ journalArticles.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ monograph.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ sciAawards.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ conferencePapers.updateTime }}

{{ item.title }}
  • 作者:
    {{ item.author }}

数据更新时间:{{ patent.updateTime }}

Wang Xin其他文献

Novel insights into redox system and the mechanism of redox regulation
对氧化还原系统和氧化还原调节机制的新见解
  • DOI:
    10.1007/s11033-016-4022-y
  • 发表时间:
    2016-06
  • 期刊:
  • 影响因子:
    2.8
  • 作者:
    Wang Xin;Hai Chunxu
  • 通讯作者:
    Hai Chunxu
Dual-Modality Imaging Probes with High Magnetic Relaxivity and Near-Infrared Fluorescence Based Highly Aminated Mesoporous Silica Nanoparticles
基于高氨基化介孔二氧化硅纳米颗粒的具有高磁弛豫率和近红外荧光的双模态成像探针
  • DOI:
    10.1155/2016/6502127
  • 发表时间:
    2016-05
  • 期刊:
  • 影响因子:
    0
  • 作者:
    Zhu Fei-Peng;Chen Guo-Tao;Wang Shou-Ju;Liu Ying;Tang Yu-Xia;Tian Ying;Wang Jian-Dong;Wang Chun-Yan;Wang Xin;Sun Jing;Teng Zhao-Gang(滕兆刚);Lu Guang-Ming
  • 通讯作者:
    Lu Guang-Ming
Numerical study on the breakup mechanisms and characteristics of liquid sheets
液层破碎机理及特性的数值研究
Transformation of arsenic during realgar tailings stabilization using ferrous sulfate in a pilot-scale treatment
在中试规模处理中使用硫酸亚铁稳定雄黄尾矿期间砷的转化
  • DOI:
    10.1016/j.scitotenv.2019.02.289
  • 发表时间:
    2019
  • 期刊:
  • 影响因子:
    9.8
  • 作者:
    Wang Xin;Zhang He;Wang Linling;Chen Jing;Xu Shiqi;Hou Huijie;Shi Yao;Zhang Jingdong;Ma Miao;Tsang Daniel C W;Crittenden John C
  • 通讯作者:
    Crittenden John C
Thermodynamic and transport properties of spiro-(1,1 ')-bipyrrolidinium tetrafluoroborate and acetonitrile mixtures: A molecular dynamics study
  • DOI:
    10.1088/1674-1056/25/6/066102
  • 发表时间:
    2016
  • 期刊:
  • 影响因子:
    1.7
  • 作者:
    Zhang Qingyin;Xie Peng;Wang Xin;Yu Xuewen;Shi Zhiqiang;Zhao Shihua
  • 通讯作者:
    Zhao Shihua

Wang Xin的其他文献

{{ item.title }}
{{ item.translation_title }}
  • DOI:
    {{ item.doi }}
  • 发表时间:
    {{ item.publish_year }}
  • 期刊:
  • 影响因子:
    {{ item.factor }}
  • 作者:
    {{ item.authors }}
  • 通讯作者:
    {{ item.author }}

{{ truncateString('Wang Xin', 18)}}的其他基金

One model for all sounds: fast and high-quality neural source-filter model for speech and non-speech waveform modeling
适用于所有声音的一种模型:用于语音和非语音波形建模的快速且高质量的神经源滤波器模型
  • 批准号:
    19K24371
  • 财政年份:
    2019
  • 资助金额:
    $ 2.91万
  • 项目类别:
    Grant-in-Aid for Research Activity Start-up
Development of a One-Dimensional Shear Beam Model for Buildings Based on Nonlinear Wave Propagation Theory and Its Application to Damage Prediction
基于非线性波传播理论的建筑物一维剪力梁模型的建立及其在损伤预测中的应用
  • 批准号:
    17H04732
  • 财政年份:
    2018
  • 资助金额:
    $ 2.91万
  • 项目类别:
    Grant-in-Aid for Young Scientists (A)
Development and applicability examination of a method to evaluate story-by-story damage of super high-rise buildings based on ambient noise measurement
基于环境噪声测量的超高层建筑逐层损伤评估方法开发及适用性检验
  • 批准号:
    15K20872
  • 财政年份:
    2015
  • 资助金额:
    $ 2.91万
  • 项目类别:
    Grant-in-Aid for Young Scientists (B)

相似海外基金

Speech security on human-computer interaction
人机交互的语音安全
  • 批准号:
    22K21304
  • 财政年份:
    2022
  • 资助金额:
    $ 2.91万
  • 项目类别:
    Grant-in-Aid for Research Activity Start-up
Language-independent speaker anonymization with multiple privacy-related attributes
具有多个隐私相关属性的独立于语言的说话人匿名化
  • 批准号:
    22K21319
  • 财政年份:
    2022
  • 资助金额:
    $ 2.91万
  • 项目类别:
    Grant-in-Aid for Research Activity Start-up
Study on Audio Information Hiding Based on Human Auditory Perception with Phase Modulation
基于人类听觉感知的相位调制音频信息隐藏研究
  • 批准号:
    20J20580
  • 财政年份:
    2020
  • 资助金额:
    $ 2.91万
  • 项目类别:
    Grant-in-Aid for JSPS Fellows
A comprehensive speaker anonymization method for speech data privacy protection
一种用于语音数据隐私保护的综合说话人匿名化方法
  • 批准号:
    20K19832
  • 财政年份:
    2020
  • 资助金额:
    $ 2.91万
  • 项目类别:
    Grant-in-Aid for Early-Career Scientists
{{ showInfoDetail.title }}

作者:{{ showInfoDetail.author }}

知道了