Algorithm description¶

This section provides a brief overview of the DTCWT-based fusion algorithm as used in this project. The input to the algorithm is a set of input images, \(\mathcal{I}\).

Alignment¶

This step aligns each input image with a single translation to match as well as possible a template image. The central image of \(\mathcal{I}\) is selected as the template image \(T\). For each image \(I \in \mathcal{I}\):

Compute the cross-correlation image \(C = (I \cdot w) \star (T \cdot w)\) where \(w\) is a two-dimensional Hamming window, \(\cdot\) denotes pixel-wise multiplication and \(\star\) is the cross-correlation operator. Normalise this cross-correlation, \(C \rightarrow C / (w \star w)\) where \(/\) denotes element-wise division.
Find the location of the maximum of \(C\) and compute the corresponding translational shift for that location. The maximum is found ignoring an apron around the edge of the image to avoid over matching of small overlap-regions.
Warp \(I\) according to that translation.

Combine all aligned images into the set of aligned images, \(\mathcal{I}_a\).

Registration¶

This step locally warps each aligned image to best match the same template image as above. For each image \(I \in \mathcal{I}_a\):

Compute the local affine warp mapping \(I\) to \(T\) as described in [1, 2].
Warp \(I\) according to the registration.

Combine all registered images into the set of registered images, \(\mathcal{I}_r\).

Fusion¶

This step combines all images in \(\mathcal{I}_r\) into a single fused image. The fusion is performed in the wavelet domain and is based on the technique in[3]. The overall lowpass image is computed by taking the mean of the lowpass images corresponding to each image in \(\mathcal{I}_r\). Letting \(\theta^{(i)}_{d,\ell,j}\) correspond to the \(j\)-th highpass coeffiecient in direction \(d\) at level \(\ell\) of the DTCWT transform of the \(i\)-th image in \(\mathcal{I}_r\) we can construct the fused wavelet coefficients \(\theta_{d,\ell,j}\) in the following way:

Compute \(\Theta_{d,\ell,j} = \sum_{i} \theta^{(i)}_{d,\ell,j}\) and then \(\phi_{d,\ell,j} = \Theta_{d,\ell,j} / \left| \Theta_{d,\ell,j} \right|\). These unit-magnitude complex numbers represent the average phase of corresponding wavelet coefficients over all registered images.
Form the set \(\mathcal{T}_{d,\ell,j} = \left\{ \left| \theta^{(1)}_{d,\ell,j} \right|, \left| \theta^{(2)}_{d,\ell,j} \right|, \ \dots\ , \left| \theta^{(N)}_{d,\ell,j} \right| \right\}\) for the \(N\) images in \(\mathcal{I}_r\). Select \(T_{d, \ell, j}\) from this set using some heuristic. In the current implementation this can be one of: mean value, maximum value or maximum value after 2-sigma outliers are removed. Which strategy is best may depend on input imagery.
Compute \(\theta_{d,\ell,j} = T_{d, \ell, j} \, \phi_{d,\ell,j}\).
Inverse DTCWT to form the fused image \(I_f\).

Shrinkage¶

The wavelet coefficients of the fused image \(I_f\) were selected to maximise sharpness. This may cause noise to be incorrectly preserved in the output image. A wavelet coefficient shrinkage method based on that in [4] is then applied to give the final fused and denoised image.

References¶

Pang, Derek, Huizhong Chen, and Sherif Halawa. “Efficient video stabilization with dual-tree complex wavelet transform.” EE368 Project Report, Spring (2010).
Chen, Huizhong, and Nick Kingsbury. “Efficient registration of nonrigid 3-D bodies.” Image Processing, IEEE Transactions on 21.1 (2012): 262-272.
Anantrasirichai, Nantheera, et al. “Atmospheric Turbulence Mitigation using Complex Wavelet-based Fusion.” IEEE transactions on image processing: a publication of the IEEE Signal Processing Society (2013).
Loza, Artur, et al. “Non-Gaussian model-based fusion of noisy images in the wavelet domain.” Computer Vision and Image Understanding 114.1 (2010): 54-65.