Enable patchwise training and prediction #135

davidwilby · 2024-11-07T12:51:51Z

Hey @tom-andersson - at long last, the long-awaited patchwise training and prediction feature that @nilsleh and @MartinSJRogers have been working on.

This PR adds patching capabilities to DeepSensor during training and inference.

Training

Optional args patching_strategy, patch_size, stride and num_samples_per_date are added to TaskLoader.__call__.

There are two available patching strategies: random_window and sliding_window. The random_window option randomly selects points in the x1 and x2 extent as the centroid of the patch. The number of patches is defined by the num_samples_per_date argument. The sliding_window function starts in the top left of the dataset and convolves from left to right and top to bottom over the data using the user-defined patch_size and stride.

TaskLoader.__call__ now contains additional conditional logic depending upon the patching strategy selected. If no patching strategy is selected, task_generator() runs exactly as before. If random_window (sliding_window) is selected the bounding boxes for the patches are generated using the sample_random_window() (sample_sliding_window()) methods. The bounding boxes are appended to the list bboxes, and passed to task_generator().

Within task_generator() after the sampling strategies are applied, the data is spatially sliced using each bbox in bboxes using the self.spatial_slice_variable() function.

When using a patching strategy, TaskLoader produces a list of tasks per date, rather than an individual task per date. A small change has been made to Task's summarise_str method to avoid an error when printing patched Tasks and to output more meaningful information.

Inference

To run patchwise predictions, a new method has been created in model.py called predict_patch(). This method iterates through and applies the pre-exisiting predict() method to each patched task. The predict() method has not been changed. Within each iteration, prior to running predict() for each patch, the bounding box of each patch is unnormalized, so the X_t of each patch can be passed to the predict() function. The patchwise predictions are stored in the list preds for subsequent stitching.

It is only possible to use the sliding_window patching function during inference, and the stride and patch size are defined when the user generates the test tasks within the task_loader() call. The data_processor must also be passed to predict_patch() method to enable unnormalisation of the coordinates of the bboxes in model.py.

Once the list of patchwise predictions are generated, stitch_clipped_predictions() is used to form a prediction at the original X_t extent. Currently, functionality is provided to subset or clip each patchwise prediction so there is no overlap between adjacent patches and then merge the patches using xr.combine_by_coords(). The modular nature of the code means there is scope for additional stitching strategies to be added after this PR, for example applying a weighting function to overlapping predictions. To ensure the patches are clipped by the correct amount, get_patch_overlap() calculates the overlap between adjacent patches. stitch_clipped_predictions() also contains code to handle patches at the edge or bottom of the dataset, where the overlap may be different.

The output from predict_patch() is the identical DeepSensor object produced in model.predict(), hence DeepSensor’s plotting functionality can subsequently be used in the same way.

Documentation and Testing

New notebook(s) are added illustrating the usage of both patchwise training and prediction.

New tests are added to verify the new behaviour.

Limitations

Patchwise prediction does not currently support predicting at more than one timestamp - calling predict_patch with more than one date raises a NotImplementedError.
predict_patch is a new, distinct function due to all the pre-processing it needs to do, the patchwise behaviour may be better served as an option in predict - let me know what you think.
Patched tasks don't exactly follow the proportions from patch_size, e.g. for a 'square' patch patch_size=(0.5,0.5) the exact dimensions won't be exactly square, this is accounted for in stitching of patches, but is slightly inelegant at the moment so we may want to come back and find a more refined solution in the future.
In test_model.test_patchwise_prediction I've temporarily commented-out the asserts checking for correct prediction shape, these fail with test datasets for now, but with real datasets the shapes are correct, see the patchwise_training_and_prediction.ipynb notebook.

Sliding window patching

Co-authored-by: David Wilby <[email protected]>

address montonic and prediction size issues

Move spatial slicing after spatial sampling

…train

lint patchwise code

Update patchwise training notebook with additional descriptive text

review-notebook-app · 2024-11-07T12:51:56Z

Check out this pull request on

See visual diffs & provide feedback on Jupyter Notebooks.

Powered by ReviewNB

tom-andersson

Very exciting! Great to see this finally out for review! Thanks for kicking this off with a description of the feature, a new documentation page, and some unit tests.

As it's a very large PR, we'll probably have to go through a few review cycles. With that in mind, I've just skimmed and left a few high-level comments with the assumption that there will be some more iteration and tidying before I take another closer look.

Before sending back for review, please:

Fix the failing unit tets. I think there is a type hint error.
Generate the documentation locally and check they make sense. See https://github.com/alan-turing-institute/deepsensor/blob/main/CONTRIBUTING.md#contributing-to-documentation

tom-andersson · 2024-11-10T16:25:54Z

deepsensor/model/model.py

+        progress_bar: int = 0,
+        verbose: bool = False,
+    ) -> Prediction:
+        """Predict on a regular grid or at off-grid locations.


Update the docstring to explain the patching procedure.

325de6d

I'm also trying out using autodoc's versionadded directive here, in yesterday's community meeting users were keen to have new features highlighted in the docs. What do you think?

.. versionadded:: 0.4.3 :py:func:`predict_patchwise()` method.

tom-andersson · 2024-11-10T16:26:59Z

deepsensor/data/loader.py

+        ## start with first patch top left hand corner at x1_min, x2_min
+        patch_list = []
+
+        # Todo: simplify these elif statements


Can you do this in this PR? Would prefer something flatter and more modular rather than heavily indented elif statements.

deepsensor/model/model.py

tom-andersson · 2024-11-10T16:31:29Z

deepsensor/model/model.py

+                        """
+                        Do not remove border for the patches along top and left of dataset and change overlap size for last patch in each row and column.
+
+                        At end of row (when patch_x2_index = data_x2_index), to calculate the number of pixels to remove from left hand side of patch:
+                        If x2 is ascending, subtract previous patch x2 max value from current patch x2 min value to get bespoke overlap in column pixels.
+                        To account for the clipping done to the previous patch, then subtract patch_overlap value in pixels 
+                        to get the number of pixels to remove from left hand side of patch.
+
+                        If x2 is descending. Subtract current patch max x2 value from previous patch min x2 value to get bespoke overlap in column pixels. 
+                        To account for the clipping done to the previous patch, then subtract patch_overlap value in pixels 
+                        to get the number of pixels to remove from left hand side of patch.
+
+                        """


This would probably make more sense in the method docstring as part of a general description of the method, at the lowest indentation level.

tom-andersson · 2024-11-10T17:42:02Z

deepsensor/model/model.py

+                ]
+                return (x1_index, x2_index)
+
+        def stitch_clipped_predictions(


It would probably make more sense to put this method in deepsensor.model.pred, since it does not use self, and just operates on Prediction objects. WDYT?

Agreed. I had wondered about where to separate out some of the methods used for patching, maybe the right compromise is to move this one.

Thinking more on this, there are a number of methods specifically for stitching patches together, constituting a few hundred lines of code. A few options for organising are: moving into a specific module as functions; moving into the Prediction class as static methods; or moving into a child class e.g. PatchwisePrediction(Prediction); moving to the pred module as functions outside of the Prediction class. Which would you prefer?

tom-andersson · 2024-11-10T17:43:26Z

deepsensor/model/model.py

+        )
+
+        ## Cast prediction into DeepSensor.Prediction object.
+        # TODO make this into seperate method.


Would you like to do this in this PR, and add it to deepsensor.model.pred?

deepsensor/model/model.py

tom-andersson · 2024-11-10T17:50:18Z

deepsensor/model/model.py

+            combined = {
+                var_name: xr.combine_by_coords(patches, compat="no_conflicts")
+                for var_name, patches in patches_clipped.items()
+            }


Can you expose an argument for the method used to combine patches (currently only "remove_overlap" supported, or whatever you think is more appropriate. This will make it more clear how to add new combining methods (like weighted averaging) in future.

deepsensor/model/model.py

tom-andersson · 2024-11-10T17:52:00Z

tests/test_model.py

+
+    # gridded predictions
+    assert [isinstance(ds, xr.Dataset) for ds in pred.values()]
+    # TODO come back to this, for artificial datasets here, shapes of predictions don't match inputs


I would prefer if we get to the bottom of this and uncomment the test before submitting.

Refactor `sample_sliding_window`

nilsleh and others added 30 commits October 10, 2023 19:01

stach changes

131c434

draft

3342b96

draft

b7cf3fa

merge main

70f3783

wrong merge

379e3b2

incorporate some of the feedback

85cd34b

run black

be8fffd

merge main

3415377

merge main

39dd15b

layout code

876970e

change __call__

d1cb338

revert

218f791

type annotation

37fe771

patch_size sampling test

fb20ccc

patchwise test trainer

5bda80b

gridded window patching

c276844

adding sliding window patching function

fde7e02

loader with bboxes

195a923

loader with boxes

824df24

Altering kwargs to enable for-loop and change sliding function

e6e1ae8

Merge branch 'patchwise_train' into msjr/patching

e75d022

move logic to call

bae0855

Merge branch 'main' into patchwise_train

a090d34

Merge branch 'patchwise_train' into msjr/patching

797f48e

Merge pull request #1 from nilsleh/msjr/patching

5291ec3

Sliding window patching

typo

7b09119

notebook with patchwise train

282c2be

refining stride to avoid error

dfa386d

inference patching

8d46653

predict_patches

acbad8b

Martin Rogers and others added 15 commits September 17, 2024 14:53

address montonic and prediction size issues

c0cd17e

move patchwise test out of class

d30e687

Update deepsensor/model/model.py

7a100ee

Co-authored-by: David Wilby <[email protected]>

Merge pull request #8 from davidwilby/montonic_errors

23733df

address montonic and prediction size issues

Move spatial slicing below gapfill sampling

4a4276d

Merge pull request #11 from davidwilby/gapfill_loop

2f0e2ba

Move spatial slicing after spatial sampling

Merge branch 'main' into patchwise_train

e2488f3

Merge remote-tracking branch 'origin/patchwise_train' into patchwise_…

54ee611

…train

lint patchwise code

9b1f30d

Update patchwise training notebook with additional descriptive text

812e056

Merge pull request #13 from davidwilby/patchwise_linting

3a34ed3

lint patchwise code

rename notebook; use new tqdm notebook; other small tweaks to text

4859f2d

add correct output and prediction plot

53f238f

Merge branch 'patchwise_train' into update_notebook

9e4254f

Merge pull request #14 from davidwilby/update_notebook

3e68ee1

Update patchwise training notebook with additional descriptive text

davidwilby requested a review from tom-andersson November 7, 2024 12:51

tom-andersson reviewed Nov 10, 2024

View reviewed changes

davidwilby added 7 commits November 13, 2024 15:56

use python 3.8 compatible type hints

f7d5422

rename predict_patch to predict_patchwise and fix references

527edff

remove mention of contributing in error message

afac690

refactor overlap calculation

277cdc3

use smaller test dataset

88ae024

update docstring for predict_patchwise

325de6d

account for non-gridded data correctly

9d79b34

davidwilby force-pushed the patchwise_train branch from 7273751 to 9d79b34 Compare November 29, 2024 11:09

davidwilby added 4 commits November 29, 2024 14:28

refactor to reduce duplication; reduce floating point errors

4c05b77

correct typo

4e028ab

add some comments

8b9a8ac

Merge pull request #16 from davidwilby/refactor_sample_sliding

d620e88

Refactor `sample_sliding_window`

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Enable patchwise training and prediction #135

Enable patchwise training and prediction #135

davidwilby commented Nov 7, 2024

review-notebook-app bot commented Nov 7, 2024

tom-andersson left a comment

tom-andersson Nov 10, 2024

davidwilby Nov 27, 2024

tom-andersson Nov 10, 2024

tom-andersson Nov 10, 2024

tom-andersson Nov 10, 2024

davidwilby Nov 28, 2024

davidwilby Nov 29, 2024

tom-andersson Nov 10, 2024

tom-andersson Nov 10, 2024

tom-andersson Nov 10, 2024

Enable patchwise training and prediction #135

Are you sure you want to change the base?

Enable patchwise training and prediction #135

Conversation

davidwilby commented Nov 7, 2024

Training

Inference

Documentation and Testing

Limitations

review-notebook-app bot commented Nov 7, 2024

tom-andersson left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment