High Performance Computing

From geohpgc
Revision as of 08:48, 5 July 2024 by Zy (talk | contribs) (→‎XX: add swlcm)
Jump to navigation Jump to search

The HPC group currently has two PhD students, Yi Zhao and Jinxiao Zhang. Our research primarily focuses on enabling real-world applications to leverage the powerful computational capabilities of heterogeneous architectures, including Nvidia GPUs and domestic supercomputers. This encompasses algorithm design suitable for distributed and large-scale platforms, data compression, application refactoring and performance optimization across different architectures.

Large Scale Land Cover Mapping

SW-LCM: A Scalable and Weakly-supervised Land Cover Mapping Method on a New Sunway Supercomputer

截屏2024-07-05 下午4.42.35.png

Zhao Y, Zheng J, Fu H, et al. SW-LCM: a scalable and weakly-supervised land cover mapping method on a new Sunway supercomputer[C] 2023 IEEE International Parallel and Distributed Processing Symposium (IPDPS). IEEE, 2023: 657-667.

截屏2024-07-05 下午4.42.50.png

High-resolution land cover mapping (LCM) is an important application for studying and understanding the change of the earth surface. While deep learning (DL) methods demonstrate great potential in analyzing satellite images, they largely depend on massive high-quality labels. This paper proposes SWLCM, a Scalable and Weakly-supervised two-stage Land Cover Mapping method on a new Sunway Supercomputer. Our method consists of a k-means clustering module as a first stage, and an iterative deep learning module as a second stage. With the kmeans module providing a good enough starting point (taking inaccurate results as noisy labels), the deep learning module improves the classification results in an iterative way, without any labelling efforts required for processing large scenarios. To achieve efficiency for country-level land cover mapping, we design a customized data partition scheme and an on-the-fly assembly for k-means. Through careful parallelization and optimization, our k-means module scales to 98,304 computing nodes (over 38 million cores), and provides a sustained performance of 437.56 PFLOPS, in a real LCM task of the entire region of China; the iterative updating part scales to 24,576 nodes, with a performance of 11 PFLOPS. We produce a 10-m resolution land cover map of China, with an accuracy of 83.5% (10-class) or 73.2% (25-class), 7% to 8% higher than best existing products, paving ways for finer land surveys to support sustainability-related applications.