Deep Space Weather Model: Long-Range Solar Flare Prediction from Multi-Wavelength Images

ICCV 2025

Shunya Nagashima and Komei Sugiura

Keio University, Japan

Deep Space Weather Model accurately and reliably predicts solar flares by capturing long-range spatio-temporal dependencies and fine-grained features in multi-channel solar image series.

Abstract

Accurate, reliable solar flare prediction is crucial for mitigating potential disruptions to critical infrastructure, while predicting solar flares remains a significant challenge. Existing methods based on heuristic physical features often lack representation learning from solar images. On the other hand, end-to-end learning approaches struggle to model long-range temporal dependencies in solar images.

In this study, we propose Deep Space Weather Model (Deep SWM), which is based on multiple deep state space models for handling both ten-channel solar images and long-range spatio-temporal dependencies. Deep SWM also features a sparse masked autoencoder, a novel pretraining strategy that employs a two-phase masking approach to preserve crucial regions such as sunspots while compressing spatial information.

Furthermore, we built FlareBench, a new public benchmark for solar flare prediction covering a full 11-year solar activity cycle, to validate our method.

Our method outperformed baseline methods and even human expert performance on standard metrics in terms of performance and reliability.

Multi-Wavelength Solar Observations

Exploring the Sun's dynamic nature through different wavelengths, revealing the complex interactions between plasma, magnetic fields, and solar phenomena

AIA 94 Å

Hot corona & solar flares

Extreme Ultraviolet (EUV)

AIA 131 Å

Flaring regions

Extreme Ultraviolet (EUV)

AIA 171 Å

Quiet corona & loops

Extreme Ultraviolet (EUV)

AIA 193 Å

Corona & hot spots

Extreme Ultraviolet (EUV)

AIA 211 Å

Active regions

Extreme Ultraviolet (EUV)

AIA 304 Å

Chromosphere & transition

Extreme Ultraviolet (EUV)

AIA 335 Å

Active regions

Extreme Ultraviolet (EUV)

AIA 1600 Å

Upper photosphere

Ultraviolet (UV)

AIA 4500 Å

Photosphere

Visible Light

HMI Magnetogram

Solar magnetic field

High-resolution

Data sources: AIA level 1 images in nine wavelengths (EUV: 94Å, 131Å, 171Å, 193Å, 211Å, 304Å, 335Å; UV: 1600Å; Visible: 4500Å) and high-resolution (1K) magnetograms from the HMI, obtained from JSOC.

Overview

Problem Setting

We tackle the challenging task of predicting the class of the largest solar flare within a 24-hour horizon using full-disk multi-wavelength solar images. This is formulated as a multi-class classification problem with significant real-world implications for space weather forecasting.

Flare Class

Peak X-ray Flux (I) [W/m²]

I > 10^-4

10^-5 < I ≤ 10^-4

10^-6 < I ≤ 10^-5

I ≤ 10^-6

Correspondence between flare classes and peak X-ray flux intensities.

Model Architecture

We propose Deep SWM, a novel architecture extending deep state-space models for classifying the maximum solar flare class within a 24-hour horizon, utilizing HMI and multi-wavelength AIA images.

The novelties of our proposed method are as follows:

Solar Spatial Encoder (SSE)

Comprising the Depth-wise Channel Selective Module (DCSM) and the Spatio-Temporal State-Space Module (ST-SSM). The DCSM selectively weights multi-wavelength image channels to emphasize features relevant to solar events, while the ST-SSM efficiently captures long-range spatio-temporal dependencies in the solar images.

Long-range Temporal SSM (LT-SSM)

Extends deep state-space models to effectively model temporal dependencies exceeding the solar rotation period within the intermediate features obtained from the pretraining stage. This allows the LT-SSM to efficiently capture long-range relationships that are crucial for solar flare prediction.

Sparse MAE

A pretraining strategy tailored for solar images that extends the Masked Autoencoder (MAE). Sparse MAE addresses the challenge of sparse, yet crucial, information regions in solar images (e.g., sunspots) using a novel two-phase masking approach. This ensures that these crucial regions are less likely to be completely masked during pretraining, leading to improved intermediate feature representations.

Quantitative Results

Our method outperforms all baseline approaches across all metrics and even surpasses human expert performance, demonstrating the effectiveness of our approach for solar flare prediction.

Table 1: Comparison of our method with state-of-the-art approaches and human experts. Higher values are better for all metrics.

Method

Test period

GMGS↑

BSS≥M↑

TSS≥M↑

Flare Transformer w/o PF

2014-2017 (4 years)

0.220±0.116

-1.770±0.225

0.198±0.371

DeFN-R

2014-2015 (2 years)

0.302±0.055

0.036±0.982

0.279±0.162

CNN-LSTM

2019-12-01 – 2022-11-30 (3 years)

0.315±0.166

0.272±0.259

0.330±0.306

DeFN

2014-2015 (2 years)

0.375±0.141

0.022±0.782

0.413±0.150

Flare Transformer

2014-2017 (4 years)

0.503±0.059

0.082±0.974

0.530±0.112

Ours
2019-12-01 – 2022-11-30 (3 years)
0.582±0.032
0.334±0.299
0.543±0.074

Human experts

2000-2015 (16 years)

0.48

0.16

0.50

Qualitative Results

Reconstruction results obtained from the baseline MAE (ρ=0.5) and our proposed Sparse MAE. Rows (a), (b), (c), (d), (e), and (f) show 94 Å, 171 Å, 304 Å, 1600 Å, 4500 Å AIA, and HMI images, respectively, captured three hours before an upcoming X-class flare. Columns (i), (ii), (iii), (iv), and (v) present the original image, the baseline reconstruction, a visualization of patches with the top α% highest standard deviation highlighted, the spatial-level masking of the Sparse MAE, and the reconstruction of the Sparse MAE, respectively.

BibTeX

@inproceedings{nagashima2025deepswm,
  title={Deep Space Weather Model: Long-Range Solar Flare Prediction from Multi-Wavelength Images},
  author={Shunya Nagashima and Komei Sugiura},
  booktitle={Proceedings of the IEEE/CVF International Conference on Computer Vision},
  year={2025}
}