With the development of mobile Internet and communication technologies, multimedia communication technology has become a major demand for the development of the national information industry and is widely applied in video conferencing, various live-streaming applications, telemedicine, remote monitoring, and distance education, etc. However, large-capacity multimedia communication services face the pressure of network bandwidth. This paper introduces media computing into the communication system, establishes a new research paradigm for multimedia communication, and forms new multimedia encoding and transmission methods from the perspective of enhancing the quality of experience (QoE), fundamentally reducing the pressure on network bandwidth requirements. The quality of experience refers to the subjective evaluation given by the information receiver on the relevant performance of the objective information carrier in combination with their own expectations, and it is a communication quality evaluation criterion different from the quality of service (QoS). This paper introduces the evaluation criteria of QoE, which are divided into user-based evaluation methods and objective-parameter-based evaluation methods. The user's feelings are analyzed and inferred through user subjective scoring or by measuring relevant physiological and psychological indicators of the user; or the evaluation of the quality of experience is achieved through the subjective modification of objective service indicators. This paper reviews multimedia encoding methods, mainly including waveform-based encoding and content-based encoding methods. The former effectively encodes any video signal without analyzing the video content, such as a series of video encoding standards; the latter identifies objects and related regions in a video sequence and encodes them. This paper elaborates on new video transmission methods in the 5G + AI (artificial intelligence) era, such as multi-view video encoding, 4K, 8K video encoding, 3D stereoscopic video, point cloud, light field, AR (augmented reality), VR (virtual reality) and other video services.
随着移动互联网和通讯技术的发展,多媒体通信技术成为国家信息产业发展的重大需求,广泛地应用在视频会议、各类直播应用、远程医疗、远程监控和远程教育等方面。然而,大容量多媒体通信业务面临着网络带宽的压力。本文将媒体计算引入通信系统,建立新的多媒体通信研究范式,从提升体验质量(quality of experience, QoE)的角度,形成新的多媒体编码与传输方法,从根本上降低网络带宽需求的压力。体验质量即信息接收者结合自身期望对客观信息载体的有关性能给出的主观评价,是区别于服务质量(quality of service,QoS)的通信质量评价准则。本文介绍了QoE的评价准则,分为基于用户的评价方法和基于客观参数的评价方法,通过用户主观评分或对用户的相关生理、心理指标进行测量进而分析、推测用户的感受;或者通过对业务客观指标的主观化修正实现体验质量的评价。本文综述了多媒体编码方法,主要包括基于波形的编码和基于内容的编码方法。前者对任意视频信号进行有效编码而不需要分析视频内容,如一系列视频编码标准;后者识别视频序列中的物体和相关区域并对它们进行编码。本文阐述了5G + AI(artificial intelligence)时代的新型视频传输方法,如多视点视频编码、4 K、8 K视频编码,3 D立体视频,点云、光场、AR(augmented reality) 、VR(virtual reality)等视频业务。