有限像素下仿生视觉假体图像语义翻译研究

结题报告

项目介绍

AI项目解读

基本信息

批准号：
61806190
项目类别：
青年科学基金项目
资助金额：
26.0万
负责人：
夏轩
依托单位：
深圳市人工智能与机器人研究院
学科分类：
F0609.认知与神经科学启发的人工智‍能
结题年份：
2021
批准年份：
2018
项目状态：
已结题
起止时间：
2019-01-01 至2021-12-31

项目参与者：
李南；朱红梅；黄勤劲；邱松德；沈骏；
关键词：
人工视网膜视觉计算理论视觉修复客体识别图形背景分离

项目摘要

The pixels-limited bionic visual prosthesis can hardly reflect the "real" of the physical world, so the current bionic visual prosthesis can not satisfy the needs of human visual perception perfectly. Studies have shown that the difficulty of pixels-limited bionic visual prosthesis image processing lies in how to keep the image semantic information best after downsampling, and the traditional image processing technology can't handle it. Artificial intelligence technology is the best way to realize image semantic translation now, so in order to solve the problem, this topic propose the translation of biomimetic visual prosthesis image semantics by generative adversarial nets. It can achieve the pixel reconstruction of human, object and space structure, transforms the physical "real" to the sensory "real", so that humans can acquire conceptual knowledge easier. This topic will establish a conceptual mapping network by generative adversarial nets, achieve the semantic translation of pixels-limited bionic visual prosthesis image and get rid of the information loss problem caused by the downsampling, develop a control method of image semantic and improve the quality of training samples, establish a set of semantic consistency evaluation indicator and improve the quality of the generated samples, improve the intelligent level of bionic visual prosthesis and enhance the action ability and the social ability of blind patients.

有限像素的仿生视觉假体图像难以反映物理世界的“真实”，所以当前仿生视觉假体无法健全地满足人类视觉感知的需求。研究表明，有限像素下的仿生视觉假体图像处理的难点在于如何在降采样后仍能最佳地保留图像的语义信息，而传统的图像处理技术无法做到这一点。鉴于人工智能技术是当前实现图像语义翻译的最佳手段，所以为了解决该问题，本课题提出使用生成式对抗网络翻译仿生视觉假体图像的语义。该翻译过程将人、物、空间结构进行像素化的重建，把物理的“真实”转换为感觉的“真实”，从而让人类更方便地获取概念性知识。本课题预计将使用生成式对抗网络建立一个概念映射网络，实现有限像素下仿生视觉假体图像的语义翻译，解决降采样造成的信息丢失问题；开发一种图像语义控制方法，提高网络训练样本的质量；建立一套语义一致性评价指标，提高网络生成样本的质量；提升仿生视觉假体的智能化水平，增强失明患者的行动能力与社交能力。

结项摘要

为了提升仿生视觉假体的智能化水平，增强失明患者的行动能力与社交能力，本项目致力于对有限像素下仿生视觉假体图像的语义翻译进行研究。本研究产生了多项理论成果与技术成果：.1.我们揭示了在有限像素下，仿生视觉假体图像存在两种语义损失：空间语义损失和频谱语义损失。针对这两种损失，提出了语义一致性评价标准，及其相应的基于生成对抗网络的图像语义翻译训练策略，提高了生成样本的图像质量，并通过心理物理学实验进行了证明了我们的评价标准和训练策略的有效性.2.使用生成式对抗网络成功建立了概念映射网络，名为F2Pnet (face to Pixel networks)的人脸像素化网络。该网络能够在保持语义不变性和可理解性的前提下，将高分辨率的人脸转换为低分辨率的仿生视觉假体像素人脸，实现了有限像素下仿生视觉假体人脸图像的语义翻译.3.建立了第一个为仿生视觉假体应用设计的像素人脸数据库，命名为AIRS- PFD。它包含1278个像素大小为25×23的人脸图像，以及各种带有标签的特征和表情。在此基础上，我们通过心理物理实验建立了该数据库的人类基准，并将所有数据开放给世界各地的研究者.4.将概念映射网络在树莓派4B上进行了部署并达到了17.7fps的实时性要求.本研究产出期刊论文5篇（SCI 收录3篇，EI收录1篇，SCI投稿1篇），EI收录国际会议文章4 篇，提交申请专利4项。培养两名硕士生，一名已毕业，一名在读。.项目直接经费26万元，已使用248033.11元，结余11966.89元，各项支出与预算基本相符。项目结余经费将用于发表论文、著作所产生的费用和知识产权事务费用。

项目成果

期刊论文数量（4）

专著数量（0）

科研奖励数量（0）

会议论文数量（3）

专利数量（4）

Influence of common feature points distribution on uncertainty of 3D coordinate transformation parameter estimation

共同特征点分布对3D坐标变换参数估计不确定性的影响

DOI：
10.13695/j.cnki.12-1222/o3.2019.04.007
发表时间：
2019
期刊：
Journal of Chinese Inertial Technology
影响因子：
--
作者：
Zhu Chengguang;Zhao Jiankang;Long Haihui;Xia Xuan;Wang Hongyu
通讯作者：
Wang Hongyu

Stereovision-based initial pose estimation relative to non-cooperative space target

相对于非合作空间目标的基于立体视觉的初始姿态估计

DOI：
10.1049/iet-rsn.2019.0476
发表时间：
2020
期刊：
IET Radar Sonar and Navigation
影响因子：
1.7
作者：
Zhu Chengguang;Zhao Jiankang;Wang Hongyu;Long Haihui;Xia Xuan
通讯作者：
Xia Xuan

GAN-based anomaly detection: A review

基于 GAN 的异常检测：回顾

DOI：
10.1016/j.neucom.2021.12.093
发表时间：
--
期刊：
Neurocomputing
影响因子：
6
作者：
Xuan Xia;Xizhou Pan;Nan Li;Xing He;Lin Ma;Xiaoguang Zhang;Ning Ding
通讯作者：
Ning Ding

Self-Attention-Masking Semantic Decomposition and Segmentation for Facial Attribute Manipulation

用于面部属性操作的自注意力掩蔽语义分解和分割

DOI：
10.1109/access.2020.2974239
发表时间：
2020
期刊：
IEEE Access
影响因子：
3.9
作者：
Xia Xuan;Yu Fengqi;Li Nan;Qu Yansong;Zhang Jiajia;Zhu Chengguang
通讯作者：
Zhu Chengguang

数据更新时间：{{ journalArticles.updateTime }}

DOI：
{{ item.doi || "--"}}
发表时间：
{{ item.publish_year || "--" }}
期刊：
{{ item.journal_name }}
影响因子：
{{ item.factor || "--"}}
作者：
{{ item.authors }}
通讯作者：
{{ item.author }}

数据更新时间：{{ journalArticles.updateTime }}

作者：
{{ item.authors }}

数据更新时间：{{ monograph.updateTime }}

作者：
{{ item.authors }}

数据更新时间：{{ sciAawards.updateTime }}

作者：
{{ item.authors }}

数据更新时间：{{ conferencePapers.updateTime }}

作者：
{{ item.authors }}

数据更新时间：{{ patent.updateTime }}

其他文献

基于DSP的主动视觉系统

DOI：
--
发表时间：
2012
期刊：
机器人
影响因子：
--
作者：
夏轩;刘华平;许伟明;孙富春
通讯作者：
孙富春

基于MapReduce的轨迹压缩并行化方法

DOI：
--
发表时间：
2017
期刊：
计算机应用
影响因子：
--
作者：
吴家皋;夏轩;刘林峰
通讯作者：
刘林峰

基于DSP的主动视觉系统

DOI：
--
发表时间：
2012
期刊：
机器人
影响因子：
--
作者：
夏轩;刘华平;许伟明;孙富春;XIA Xuan~(1;2);LIU Huaping~2;XU Weiming~1;SUN Fuch;2.Tsinghua National Laboratory for Information Sci
通讯作者：
2.Tsinghua National Laboratory for Information Sci

其他文献

DOI：
{{ item.doi || "--" }}
发表时间：
{{ item.publish_year || "--"}}
期刊：
{{ item.journal_name }}
影响因子：
{{ item.factor || "--" }}
作者：
{{ item.authors }}
通讯作者：
{{ item.author }}

内容获取失败，请点击重试

重试

联系客服

开始分析

查看分析示例

此项目为已结题，我已根据课题信息分析并撰写以下内容，帮您拓宽课题思路：

会员权益说明：

有限像素下仿生视觉假体图像语义翻译研究

基本信息

项目摘要

结项摘要

项目成果

其他文献

其他文献

AI项目摘要

AI项目思路

AI技术路线图

相似国自然基金

相似海外基金

AI项目解读示例

AI项目摘要：

AI项目思路：

AI技术路线图