feat: ✨ add piqe metric

chaofengc · chaofengc · commit 5b5afb3cce50 · 2024-08-19T20:12:44.000+08:00
diff --git a/README.md b/README.md
@@ -9,23 +9,9 @@ An IQA toolbox with pure python and pytorch. Please refer to [Awesome-Image-Qual
 [![Awesome](https://cdn.rawgit.com/sindresorhus/awesome/d7305f38d29fed78fa85652e3a63e154dd8e8829/media/badge.svg)](https://github.com/chaofengc/Awesome-Image-Quality-Assessment)
 [![Citation](https://img.shields.io/badge/Citation-bibtex-green)](https://github.com/chaofengc/IQA-PyTorch/blob/main/README.md#bookmark_tabs-citation)
 
-<!-- ![demo](docs/demo.gif) -->
-
-<!-- - [:open\_book: Introduction](#open_book-introduction)
-- [:zap: Quick Start](#zap-quick-start)
-  - [Dependencies and Installation](#dependencies-and-installation)
-  - [Basic Usage](#basic-usage)
-- [:1st\_place\_medal: Benchmark Performances and Model Zoo](#1st_place_medal-benchmark-performances-and-model-zoo)
-  - [Results Calibration](#results-calibration)
-  - [Performance Evaluation Protocol](#performance-evaluation-protocol)
-  - [Benchmark Performance with Provided Script](#benchmark-performance-with-provided-script)
-- [:hammer\_and\_wrench: Train](#hammer_and_wrench-train)
-  - [Dataset Preparation](#dataset-preparation)
-  - [Example Train Script](#example-trai-script) -->
- 
 ## :open_book: Introduction
 
-This is a image quality assessment toolbox with **pure python and pytorch**. We provide reimplementation of many mainstream full reference (FR) and no reference (NR) metrics (results are calibrated with official matlab scripts if exist). **With GPU acceleration, most of our implementations are much faster than Matlab.** Please refer to the following documents for details:  
+This is a comprehensive image quality assessment (IQA) toolbox built with **pure Python and PyTorch**. We provide reimplementation of many mainstream full reference (FR) and no reference (NR) metrics (results are calibrated with official matlab scripts if exist). **With GPU acceleration, most of our implementations are much faster than Matlab.** Please refer to the following documents for details:  
 
 <div align="center">
 
@@ -36,19 +22,11 @@ This is a image quality assessment toolbox with **pure python and pytorch**. We
 ---
 
 ### :triangular_flag_on_post: Updates/Changelog
-- 🔥**Aug, 2024**. Add `lpips+` and `lpips-vgg+` proposed in our paper [TOPIQ](https://arxiv.org/abs/2308.03060). 
+- ✨**Aug, 2024**. Add `piqe` metric.
+- 💥**Aug, 2024**. Add `lpips+` and `lpips-vgg+` proposed in our paper [TOPIQ](https://arxiv.org/abs/2308.03060). 
 - 🔥**June, 2024**. Add `arniqa` and its variances trained on different datasets, refer to official repo [here](https://github.com/miccunifi/ARNIQA). Thanks for the contribution from [Lorenzo Agnolucci](https://github.com/LorenzoAgnolucci) 🤗.
 - **Apr 24, 2024**. Add `inception_score` and console entry point with `pyiqa` command.
 - **Mar 11, 2024**. Add `unique`, refer to official repo [here](https://github.com/zwx8981/UNIQUE). Thanks for the contribution from [Weixia Zhang](https://github.com/zwx8981) 🤗.
-- :boom: **Jan 31, 2024**. Add `qalign` for both NR and IAA. It is our most powerful unified metric based on large vision-language models, and shows remarkable performance and robustness. Refer [Q-Align](https://github.com/Q-Future/Q-Align) for more details. Use it with the following codes:
-  ```
-  qalign = create_metric('qalign').cuda()
-  quality_score = qalign(input, task_='quality')
-  aesthetic_score = qalign(input, task_='aesthetic')
-  ```
-- **Jan 19, 2024**. Add `wadiqam_fr` and `wadiqam_nr`. All implemented methods are usable now 🍻. 
-- **Dec 23, 2023**. Add `liqe` and `liqe_mix`. Thanks for the contribution from [Weixia Zhang](https://github.com/zwx8981) 🤗.
-- **Oct 09, 2023**. Add datasets: [PIQ2023](https://github.com/DXOMARK-Research/PIQ2023), [GFIQA](http://database.mmsp-kn.de/gfiqa-20k-database.html). Add metric `topiq_nr-face`. We release example results on FFHQ [here](tests/ffhq_score_topiq_nr-face.csv) for reference. 
 - [**More**](docs/history_changelog.md)
 
 ---
diff --git a/ResultsCalibra/calibration_summary.csv b/ResultsCalibra/calibration_summary.csv
@@ -43,6 +43,8 @@ paq2piq,44.134,73.6015,74.3297,76.8748,70.9153
 paq2piq(ours),44.1341,73.6015,74.3297,76.8748,70.9153
 pi,11.9235,3.072,2.618,2.8074,6.7713
 pi(ours),11.9286,3.073,2.6357,2.7979,6.9546
+piqe,100.0,21.62,35.86,41.15,76.95
+piqe(ours),100.0,21.6242,35.8646,41.147,76.9485
 psnr,21.11,20.99,27.01,23.3,21.62
 psnr(ours),21.1136,20.9872,27.0139,23.3002,21.6186
 ssim,0.6993,0.9978,0.9989,0.9669,0.6519
diff --git a/ResultsCalibra/results_official.csv b/ResultsCalibra/results_official.csv
@@ -9,6 +9,7 @@ lpips,0.7237,0.2572,0.05079,0.05205,0.4253
 mad,195.2796,80.8379,30.3918,84.3542,202.2371
 ms_ssim,0.6733,0.9996,0.9998,0.9566,0.8462
 niqe,15.7536293917814,3.65492152353770,3.23547743716998,3.18403333858339,8.63519663862637
+piqe,100.00,21.62,35.86,41.15,76.95
 nlpd,0.561610096893874,0.019534798560102,0.015915631543598,0.302802106557736,0.432604962261603
 psnr,21.11,20.99,27.01,23.3,21.62
 ssim,0.6993,0.9978,0.9989,0.9669,0.6519
@@ -19,7 +20,6 @@ pi,11.9235,3.0720,2.6180,2.8074,6.7713
 ilniqe,113.4801,23.9968,19.9750,22.4493,56.6721
 musiq,12.494,75.332,73.429,75.188,36.938
 musiq-ava,3.398,5.648,4.635,5.186,4.128
-musiq-koniq,12.494,75.332,73.429,75.188,36.938
 musiq-paq2piq,46.035,72.660,73.625,74.361,69.006
 musiq-spaq,17.685,70.492,78.740,79.015,49.105
 paq2piq,44.1340,73.6015,74.3297,76.8748,70.9153
diff --git a/docs/history_changelog.md b/docs/history_changelog.md
@@ -1,5 +1,14 @@
 # History of Changelog
 
+- :boom: **Jan 31, 2024**. Add `qalign` for both NR and IAA. It is our most powerful unified metric based on large vision-language models, and shows remarkable performance and robustness. Refer [Q-Align](https://github.com/Q-Future/Q-Align) for more details. Use it with the following codes:
+  ```
+  qalign = create_metric('qalign').cuda()
+  quality_score = qalign(input, task_='quality')
+  aesthetic_score = qalign(input, task_='aesthetic')
+  ```
+- **Jan 19, 2024**. Add `wadiqam_fr` and `wadiqam_nr`. All implemented methods are usable now 🍻. 
+- **Dec 23, 2023**. Add `liqe` and `liqe_mix`. Thanks for the contribution from [Weixia Zhang](https://github.com/zwx8981) 🤗.
+- **Oct 09, 2023**. Add datasets: [PIQ2023](https://github.com/DXOMARK-Research/PIQ2023), [GFIQA](http://database.mmsp-kn.de/gfiqa-20k-database.html). Add metric `topiq_nr-face`. We release example results on FFHQ [here](tests/ffhq_score_topiq_nr-face.csv) for reference.
 - **Aug 15, 2023**. Add `st-lpips` and `laion_aes`. Refer to official repo at [ShiftTolerant-LPIPS](https://github.com/abhijay9/ShiftTolerant-LPIPS) and [improved-aesthetic-predictor](https://github.com/christophschuhmann/improved-aesthetic-predictor)
 - **Aug 05, 2023**. Add our work [TOPIQ](https://arxiv.org/abs/2308.03060) with remarkable performance on almost all benchmarks via efficient Resnet50 backbone. Use it with `topiq_fr, topiq_nr, topiq_iaa` for Full-Reference, No-Reference and Aesthetic assessment respectively.
 - **March 30, 2023**. Add [URanker](https://github.com/RQ-Wu/UnderwaterRanker) for IQA of under water images. 
diff --git a/pyiqa/archs/piqe_arch.py b/pyiqa/archs/piqe_arch.py
@@ -0,0 +1,218 @@
+r"""PIQE metric implementation.
+
+Paper: 
+    N. Venkatanath, D. Praneeth, Bh. M. Chandrasekhar, S. S. Channappayya, and S. S. Medasani. "Blind Image Quality Evaluation Using Perception Based Features", In Proceedings of the 21st National Conference on Communications (NCC). Piscataway, NJ: IEEE, 2015.
+
+References:
+    - Matlab: https://www.mathworks.com/help/images/ref/piqe.html
+    - Python: https://github.com/michael-rutherford/pypiqe
+
+This PyTorch implementation by: Chaofeng Chen (https://github.com/chaofengc)
+"""
+
+import torch
+
+from pyiqa.utils.color_util import to_y_channel
+from pyiqa.utils import scandir_images, imread2tensor
+from pyiqa.matlab_utils import symm_pad
+from pyiqa.archs.func_util import normalize_img_with_guass
+from pyiqa.utils.registry import ARCH_REGISTRY
+
+
+def piqe(
+        img: torch.Tensor,
+        block_size: int = 16,
+        activity_threshold: float = 0.1,
+        block_impaired_threshold: float = 0.1,
+        window_size: int = 6,
+) -> torch.Tensor:
+    """
+        Calculates the Perceptual Image Quality Estimator (PIQE) score for an input image.
+        Args:
+            - img (torch.Tensor): The input image tensor.
+            - block_size (int, optional): The size of the blocks used for processing. Defaults to 16.
+            - activity_threshold (float, optional): The threshold for considering a block as active. Defaults to 0.1.
+            - block_impaired_threshold (float, optional): The threshold for considering a block as impaired. Defaults to 0.1.
+            - window_size (int, optional): The size of the window used for block analysis. Defaults to 6.
+        Returns:
+            - torch.Tensor: The PIQE score for the input image.
+    """
+
+    # RGB to Gray Conversion
+    if img.shape[1] == 3:
+        img = to_y_channel(img, out_data_range=1, color_space='yiq')
+    
+    # Convert input image to double and scaled to the range 0-255
+    img = torch.round(255 * (img / torch.max(img.flatten(1), dim=-1)[0].reshape(img.shape[0], 1, 1, 1)))
+
+    # Symmetric pad if image size is not divisible by block_size.
+    bsz, _, height, width = img.shape
+    col_pad, row_pad = width % block_size, height % block_size
+    img = symm_pad(img, (0, col_pad, 0, row_pad))
+
+    # Normalize image to zero mean and ~unit std
+    # used circularly-symmetric Gaussian weighting function sampled out 
+    # to 3 standard deviations.
+    img_normalized = normalize_img_with_guass(img, padding='replicate')
+
+    # Preallocation for masks
+    noticeable_artifacts_mask = torch.zeros_like(img_normalized, dtype=bool)
+    noise_mask = torch.zeros_like(img_normalized, dtype=bool)
+    activity_mask = torch.zeros_like(img_normalized, dtype=bool)
+    score = torch.zeros(bsz)
+
+    nsegments = block_size - window_size + 1
+    # Start of block by block processing
+    for b in range(0, bsz):
+        NHSA = 0
+        dist_block_scores = 0
+        for i in range(0, height, block_size):
+            for j in range(0, width, block_size):
+
+                # Weights Initialization
+                WNDC = WNC = 0
+
+                # Compute block variance
+                block = img_normalized[b, 0, i:i + block_size, j:j + block_size]
+                block_var = torch.var(block, unbiased=True)
+
+                # Considering spatially prominent blocks 
+                if block_var > activity_threshold:
+                    activity_mask[b, 0, i:i + block_size, j:j + block_size] = True
+                    WHSA = 1
+                    NHSA += 1
+
+                    # Analyze Block for noticeable artifacts
+                    block_impaired = notice_dist_criterion(block, nsegments, block_size - 1, window_size, block_impaired_threshold, block_size)
+
+                    if block_impaired:
+                        WNDC = 1
+                        noticeable_artifacts_mask[b, 0, i:i + block_size, j:j + block_size] = True
+
+                    # Analyze Block for Gaussian noise distortions
+                    block_sigma, block_beta = noise_criterion(block, block_size - 1, block_var)
+
+                    if block_sigma > 2 * block_beta:
+                        WNC = 1
+                        noise_mask[b, 0, i:i + block_size, j:j + block_size] = True
+
+                    # Pooling/ distortion assignment
+                    dist_block_scores += WHSA * WNDC * (1 - block_var) + WHSA * WNC * block_var
+
+        # Quality score computation
+        # C is a positive constant, it is included to prevent numerical instability
+        C = 1
+        score[b] = ((dist_block_scores + C) / (C + NHSA)) * 100
+
+    noticeable_artifacts_mask = noticeable_artifacts_mask[..., :height, :width]
+    noise_mask = noise_mask[..., :height, :width]
+    activity_mask = activity_mask[..., :height, :width]
+
+    return score, noticeable_artifacts_mask, noise_mask, activity_mask 
+
+
+def noise_criterion(block, block_size, block_var):
+    """Function to analyze block for Gaussian noise distortions.
+    """
+    # Compute block standard deviation
+    block_sigma = torch.sqrt(block_var)
+    # Compute ratio of center and surround standard deviation
+    cen_sur_dev = cal_center_sur_dev(block, block_size)
+    # Relation between center-surround deviation and the block standard deviation
+    block_beta = torch.abs(block_sigma - cen_sur_dev) / torch.max(block_sigma, cen_sur_dev)
+    return block_sigma, block_beta
+
+
+def cal_center_sur_dev(block, block_size):
+    """Function to compute center surround Deviation of a block.
+    """
+    # block center
+    center1 = (block_size + 1) // 2
+    center2 = center1 + 1
+    center = torch.cat((block[..., center1 - 1], block[..., center2 - 1]), dim=0)
+
+    # block surround
+    block = torch.cat((block[..., :center1 - 1], block[..., center1:]), dim=-1)
+    block = torch.cat((block[..., :center2 - 1], block[..., center2:]), dim=-1)
+
+    # Compute standard deviation of block center and block surround
+    center_std = torch.std(center, unbiased=True)
+    surround_std = torch.std(block, unbiased=True)
+    # Ratio of center and surround standard deviation
+    cen_sur_dev = center_std / surround_std
+    # Check for nan's
+    if torch.isnan(cen_sur_dev):
+        cen_sur_dev = 0
+    return cen_sur_dev
+
+
+def notice_dist_criterion(block, nsegments, block_size, window_size, block_impaired_threshold, N):
+    # Top edge of block
+    top_edge = block[0, :]
+    seg_top_edge = segment_edge(top_edge, nsegments, block_size, window_size)
+
+    # Right side edge of block
+    right_side_edge = block[:, N - 1]
+    seg_right_side_edge = segment_edge(right_side_edge, nsegments, block_size, window_size)
+
+    # Down side edge of block
+    down_side_edge = block[N - 1, :]
+    seg_down_side_edge = segment_edge(down_side_edge, nsegments, block_size, window_size)
+
+    # Left side edge of block
+    left_side_edge = block[:, 0]
+    seg_left_side_edge = segment_edge(left_side_edge, nsegments, block_size, window_size)
+
+    # Compute standard deviation of segments in left, right, top and down side edges of a block
+    seg_top_edge_std_dev = torch.std(seg_top_edge, dim=1, unbiased=True)
+    seg_right_side_edge_std_dev = torch.std(seg_right_side_edge, dim=1, unbiased=True)
+    seg_down_side_edge_std_dev = torch.std(seg_down_side_edge, dim=1, unbiased=True)
+    seg_left_side_edge_std_dev = torch.std(seg_left_side_edge, dim=1, unbiased=True)
+
+    # Check for segment in block exhibits impairedness, if the standard deviation of the segment is less than block_impaired_threshold.
+    block_impaired = 0
+    for seg_index in range(seg_top_edge.shape[0]):
+        if (
+            (seg_top_edge_std_dev[seg_index] < block_impaired_threshold)
+            or (seg_right_side_edge_std_dev[seg_index] < block_impaired_threshold)
+            or (seg_down_side_edge_std_dev[seg_index] < block_impaired_threshold)
+            or (seg_left_side_edge_std_dev[seg_index] < block_impaired_threshold)
+        ):
+            block_impaired = 1
+            break
+    
+    return block_impaired
+
+
+def segment_edge(block_edge, nsegments, block_size, window_size):
+    # Segment is defined as a collection of 6 contiguous pixels in a block edge
+    segments = torch.zeros(nsegments, window_size)
+    for i in range(nsegments):
+        segments[i, :] = block_edge[i: window_size]
+        if window_size <= (block_size + 1):
+            window_size += 1
+    return segments
+
+
+@ARCH_REGISTRY.register()
+class PIQE(torch.nn.Module):
+    """
+    PIQE module.
+
+    Args:
+        x (torch.Tensor): Input tensor of shape (B, C, H, W).
+
+    Returns:
+        torch.Tensor: PIQE score.
+    """
+    def get_masks(self,):
+        assert self.results is not None, "Please calculate the piqe score first."
+        return {
+            'noticeable_artifacts_mask': self.results[1],
+            'noise_mask': self.results[2],
+            'activity_mask': self.results[3], 
+        }
+
+    def forward(self, x: torch.Tensor) -> torch.Tensor:
+        self.results = piqe(x)
+        return self.results[0] 
diff --git a/pyiqa/default_model_configs.py b/pyiqa/default_model_configs.py
@@ -193,6 +193,14 @@
         'score_range': '0, ~',
     },
     # =============================================================
+    'piqe': {
+        'metric_opts': {
+            'type': 'PIQE',
+        },
+        'metric_mode': 'NR',
+        'lower_better': True,
+        'score_range': '0, 100',
+    },
     'niqe': {
         'metric_opts': {
             'type': 'NIQE',
diff --git a/pyiqa/matlab_utils/padding.py b/pyiqa/matlab_utils/padding.py
@@ -94,4 +94,7 @@ def __init__(self, kernel, stride=1, dilation=1, mode='same'):
         self.mode = mode
 
     def forward(self, x):
-        return exact_padding_2d(x, self.kernel, self.stride, self.dilation, self.mode)
+        if self.mode is None:
+            return x
+        else:
+            return exact_padding_2d(x, self.kernel, self.stride, self.dilation, self.mode)