Add self-supervised 3D-Var-based AI data assimilation #194

SOHAMPAL23 · 2026-01-05T15:34:06Z

Pull Request

Add self-supervised AI-based data assimilation using 3D-Var loss

Description

This PR introduces a self-supervised AI-based data assimilation prototype
The implementation replaces supervised learning with a physics-based 3D-Var cost function, allowing the model to learn directly from:
Sparse/noisy observations

Scope

Focused on demonstrating the feasibility and correctness of the self-supervised assimilation approach
Intended as an experimental framework, not a full operational replacement

Checklist

Implements the 3D-Var objective as the training loss
Supports both forecast-based and cold-start first-guess states
Modular design for experimenting with different observation operators and error assumptions
Suitable as a foundation for extending to real-world variables

for more information, see https://pre-commit.ci

SOHAMPAL23 · 2026-01-06T09:09:16Z

hey @jacobbieker Can you review this

jacobbieker

Please don't include AI-generated code or documentation.

Also, this PR is way too large, it needs to be split up.

jacobbieker · 2026-01-06T09:19:41Z

graph_weather/models/aurora/model.py



 class SelfAttentionLayer(nn.Module):
+    """Self-attention layer for processing point cloud features.


Please split these just doc changes into a separate PR, this is a huge PR that is hard to review and has a lot of unrelated changes. Do that, and then I can review just the implementation.

jacobbieker · 2026-01-06T09:21:14Z

graph_weather/models/data_assimilation.py

+    return H
+
+
+def generate_synthetic_data(batch_size=32, grid_size=(10, 10), num_channels=1):


This generation for testing shouldn't be in the model file, only in the unit tests

jacobbieker · 2026-01-06T09:24:06Z

graph_weather/models/training_loop.py

These model training loops, etc. stuff shouldn't be in the /models/ directory, but in a subdirectory, like /models/data_assimilation/ or something similar

jacobbieker · 2026-01-06T09:25:23Z

graph_weather/models/visualization.py

These feel like more generic things for training that shouldn't be in a model PR. If you want to add more visualizations, feel free, but that should be separate.

graph_weather/data_assimilation_implementation.md

jacobbieker · 2026-01-06T09:29:54Z

graph_weather/models/evaluation.py

I don't think this evaluation should be included in this PR, a lot of the computation already exists somewhere in the repo, and is more generally useful. Please refactor or remove.

jacobbieker · 2026-01-06T09:30:29Z

graph_weather/models/data_assimilation.py

+        return analysis
+
+
+class SimpleDataAssimilationModel(nn.Module):


I would prefer just the model implementation from the paper, not a second, simpler one for this.

jacobbieker · 2026-01-06T09:31:17Z

graph_weather/models/data_assimilation.py

+
+        self.network = nn.Sequential(*layers)
+
+    def forward(self, background, observations):


In this repo, we want the models to take in points and output points, so they are all compatible. Please make this work with the same kind of interface as the GraphWeatherForecaster, or GenCast implementations.

jacobbieker · 2026-01-06T09:31:37Z

graph_weather/data/assimilation_dataloader.py

+        return sample
+
+
+def create_synthetic_assimilation_dataset(


jacobbieker · 2026-01-06T09:33:02Z

graph_weather/data/assimilation_dataloader.py

This might be needed, but needs changes. The models being added should be compatible with taking in points and lat/lon locations, and so should this one. This might make it not necessary to have this.

graph_weather/data/__init__.py

for more information, see https://pre-commit.ci

SOHAMPAL23 · 2026-01-06T10:48:46Z

Sure @jacobbieker
Working on the splitting part of the pr and making other changes in the new PR

jacobbieker · 2026-01-16T09:14:32Z

Closing this as #196 is a duplicate

SOHAMPAL23 and others added 3 commits January 5, 2026 20:33

feat: add self-supervised 3D-Var-based AI data assimilation prototype

15abece

[pre-commit.ci] auto fixes from pre-commit.com hooks

0700a8e

for more information, see https://pre-commit.ci

updation

71c03e2

SOHAMPAL23 marked this pull request as draft January 5, 2026 15:47

SOHAMPAL23 mentioned this pull request Jan 5, 2026

[Paper] AI-based Data Assimilation #173

Open

SOHAMPAL23 and others added 5 commits January 5, 2026 23:39

updating the files

1344290

[pre-commit.ci] auto fixes from pre-commit.com hooks

b896607

for more information, see https://pre-commit.ci

precommit ruff checks

1643839

Merge branch 'main' of https://github.com/SOHAMPAL23/graph_weather

82cbaeb

[pre-commit.ci] auto fixes from pre-commit.com hooks

48a8111

for more information, see https://pre-commit.ci

SOHAMPAL23 marked this pull request as ready for review January 6, 2026 09:07

jacobbieker self-requested a review January 6, 2026 09:18

jacobbieker requested changes Jan 6, 2026

View reviewed changes

jacobbieker reviewed Jan 6, 2026

View reviewed changes

graph_weather/data/__init__.py Outdated Show resolved Hide resolved

SOHAMPAL23 and others added 2 commits January 6, 2026 16:13

deletion of the unwanted files

40b75e1

[pre-commit.ci] auto fixes from pre-commit.com hooks

c6b3791

for more information, see https://pre-commit.ci

Refactor import statements in __init__.py

ab16fe0

jacobbieker closed this Jan 16, 2026



		class SelfAttentionLayer(nn.Module):
		"""Self-attention layer for processing point cloud features.

		return H


		def generate_synthetic_data(batch_size=32, grid_size=(10, 10), num_channels=1):

		return analysis


		class SimpleDataAssimilationModel(nn.Module):


		self.network = nn.Sequential(*layers)

		def forward(self, background, observations):

Uh oh!

Add self-supervised 3D-Var-based AI data assimilation #194

Add self-supervised 3D-Var-based AI data assimilation #194

Uh oh!

Conversation

SOHAMPAL23 commented Jan 5, 2026

Pull Request

Description

Scope

Checklist

Uh oh!

SOHAMPAL23 commented Jan 6, 2026

Uh oh!

jacobbieker left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

SOHAMPAL23 commented Jan 6, 2026

Uh oh!

jacobbieker commented Jan 16, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants