基于多任务学习的无参考超分辨图像质量评估-AET-电子技术应用

基于多任务学习的无参考超分辨图像质量评估

信息技术与网络安全 8期

刘锡泽1，李志龙2，何欣泽3，范红1

(1.东华大学信息科学与技术学院，上海201620； 2.OPPO研究院，上海200030；3.上海大学通信与信息工程学院，上海200444)

摘要： 图像超分辨率重建旨在从低分辨率图像中恢复其对应的高分辨率图像，是计算机视觉中的经典问题。为改进传统超分辨图像质量评价方法与人眼感知不一致的问题，提出一种基于多任务学习的超分辨图像质量评估网络。网络采用多任务学习的方式，分别学习图像的局部频率特征与质量分数，其中局部频率特征用来辅助网络进行图像质量分数的回归，提高分数预测的准确性和泛化能力。另外，在网络中加入协调注意力模块，进一步增强了模型的预测能力。实验结果表明，所提出的算法在QADS数据集上的SROCC、PLCC等指标优于目前先进的无参考超分辨图像质量评价方法。

关键词： 超分辨图像质量评估多任务学习局部频率特征协调注意力模块

中图分类号： TP391
文献标识码： A
DOI： 10.19358/j.issn.2096-5133.2021.08.010
引用格式：刘锡泽，李志龙，何欣泽，等. 基于多任务学习的无参考超分辨图像质量评估[J].信息技术与网络安全，2021，40(8)：60-64.

No reference super resolution image quality assessment based on multi-task learning

Liu Xize1，Li Zhilong2，He Xinze3，Fan Hong1

(1.College of Information Science and Technology，Donghua University，Shanghai 201620，China； 2.OPPO Research Institute，Shanghai 200030，China； 3.College of Communication and Information Engineering，Shanghai University，Shanghai 200444，China)

Abstract： Image super-resolution reconstruction is to recover the corresponding high resolution images from low resolution images, which is a classic problem in computer vision. In order to improve the inconsistency between traditional super-resolution image quality evaluation methods and visual perception, a super-resolution image quality evaluation network based on multi-task learning is proposed. The network adopts a multi-task learning method to learn the local frequency features and quality scores of the image respectively. The local frequency features are used to assist the network in the regression of the image quality scores to improve the accuracy and generalization ability of score prediction. In addition, adding coordinate attention blocks to the network to further enhance the predictive ability of the model. The experimental results show that the SROCC and PLCC of the proposed algorithm on the QADS dataset are better than the current advanced no-reference super-resolution image quality evaluation methods.

Key words : super-resolution image quality assessment；multi-task learning；local frequency features；coordinate

0 引言

单幅图像超分辨率重建(Single Image Super-Resolution Reconstruction，SISR)是图像复原的一种，其通过信号处理或者图像处理的方法，将低分辨率(Low-Resolution，LR)图像转化为高分辨率(High-Resolution，HR)图像[1]。目前，SISR被广泛应用在医学影像、遥感图像、视频监控等领域当中。近年来，许多SISR算法相继被提出，因此需要一种可靠的方式来衡量各种算法重建图像的质量好坏。

最可靠的图像质量评估方式是主观评分，但这种方式需要耗费大量的人力和时间，所以往往使用客观评价指标来对超分辨(Super-Resolution，SR)图像进行质量评估。最常用的图像客观评价指标是峰值信噪比(Peak Signal-to-Noise Ratio，PSNR)和结构相似度(Structural Similarity，SSIM)。但在SISR领域中，这两个指标与人眼感知的一致性较低[2]。因此研究者们提出了一系列基于人类视觉系统(Human Visual System，HVS)的图像质量评估算法，如信息保真度(Information Fidelity Criterion，IFC)[3]、特征相似度(Feature Similarity，FSIM)[4]等算法，在图像质量评估数据库中的性能超过了PSNR、SSIM等传统算法。

本文详细内容请下载：http://www.chinaaet.com/resource/share/2000003726

作者信息：

刘锡泽1，李志龙2，何欣泽3，范红1

(1.东华大学信息科学与技术学院，上海201620；

2.OPPO研究院，上海200030；3.上海大学通信与信息工程学院，上海200444)

原创声明：此内容为AET网站原创，未经授权禁止转载。

相关内容