DeepLight: Reconstructing High-Resolution Observations of Nighttime Light With Multi-Modal Remote Sensing Data

From geohpgc
Revision as of 03:11, 16 August 2024 by Zhang-yating (talk | contribs)
Jump to navigation Jump to search

General remarks

Nighttime light (NTL) remote sensing observation serves as a unique proxy for quantitatively assessing progress toward meeting a series of Sustainable Development Goals (SDGs), such as poverty estimation, urban sustainable development, and carbon emission. However, existing NTL observations often suffer from pervasive degradation and inconsistency, limiting their utility for computing the indicators defined by the SDGs.

In this research, we present a novel task of reconstructing high-resolution nighttime light images with multi-modality data. To support this research endeavor, we introduce DeepLightMD, a comprehensive dataset comprising data from five heterogeneous sensors, offering fine spatial resolution and rich spectral information at a national scale. Additionally, we present DeepLightSR, a calibration-aware method for building bridges between spatially heterogeneous modality data in the multi-modality super-resolution.

Dataset Description

DeepLightMD is a natioanal-scale multimodality SR dataset, consisting of training, validating, and testing material. The training material contains 18,032 pairs of clips, the validating material contains 2,254 pairs of samples. To demonstrate evaluation at both local and regional scale, the testing material contains 2,254 pairs of clips and three province-level samples, covering Anhui, Beijing, and Taiwan. Each pair of clips contains LR NTL (256 × 256 in size), HR NTL (2048 × 2048 in size), DMO (2048 × 2048 in size, 7 bands), DEM (2048 × 2048 in size), and ISP (2048 × 2048 in size).

Method Description

DeepLightSR framework partitions multi-modality data into four distinct categories, including main modality input (LR NTL), auxiliary modality inputs (DMO and DEM), main modality supervision (HR NTL), and auxiliary modality supervision (ISP).

DeepLightSR can be optimized with supervision from the HR NTL image NH ∈ ℝr**h × r**w and auxiliary supervision from the ISP MISP. Here, r, h, and w denote the ratio in the SR task, the height, and the width of NL, respectively. The comprehensive objective function for training DeepLightSR is formulated as follows:

where Θ is the parameters of DeepLightSR, the n and m denote the number of training samples and multiscale outputs, respectively. The NG, m ∈ ℝr**h × r**w represents the obtained SR NTL image in ℱ1. The α and β denote the weights of two functions and multiscale tasks.

The codes and datasets are under technical reviewing and will be available soon.