Inference¶

There are three practical inference workflows in this repository: - call the public ISO-week API from depth_recon.inference.api - run the standalone script src/depth_recon/inference/run_single.py - call PixelDiffusionConditional.predict_step(...) directly

DepthDif supports pixel-space configs (src/depth_recon/configs/px_space/*) and latent-workflow configs (src/depth_recon/configs/lat_space/*). For latent workflow setup and command flow, see Autoencoder + Latent Diffusion.

Workflow 0: Public ISO-Week API¶

Use this path for PyPI and Colab-style inference. The public package is installed as depth-recon and imported as depth_recon.

python -m pip install depth-recon

from depth_recon import run_week_inference

run_dir = run_week_inference(
    year=2015,
    iso_week=25,
    rectangle=(-20.0, 30.0, 10.0, 50.0),
    device="cuda",
    config_repo="simon-donike/DepthDif",
)

When glorys_dir is omitted, this path uses the public ARGO/OSTIA workflow:

resolves configs, checkpoint, and land mask from Hugging Face
selects the ISO-week Wednesday as the target date
builds a land-mask-driven inference grid, filtered by rectangle when passed
downloads EN4/ARGO profile months touched by the selected week unless argo_dir is supplied
downloads the OSTIA daily SST file unless ostia_dir is supplied or auto_download_ostia=False
rasterizes sparse ARGO profiles into model input patches
runs PixelDiffusionConditional.predict_step(...)
optionally runs uncertainty_step(..., num_samples=20) with export_uncertainty=True or --export-uncertainty
stitches depth-level prediction GeoTIFFs and writes GeoJSON/CSV/YAML metadata

The return value is the run directory, normally inference/outputs/depthdif_argo_<YYYYMMDD>/. The public package path uses non-overlapping patches by default (patch_stride=tile_size, normally 128). Rectangle filtering keeps patch centers inside (lon_min, lat_min, lon_max, lat_max), with a nearest-patch fallback for very small boxes. GLORYS is not required for the standard public inference path; it is only needed for training, local comparison exports, or when intentionally using the GLORYS-backed branch described below.

Existing cached model, config, checkpoint, and mask files are reused automatically. By default, the package uses simon-donike/DepthDif at revision main, public config/checkpoint assets, and world_land_mask_glorys_0p1.tif. The runtime API materializes those assets into one inference super-config before export. The default cache is ~/.cache/depthdif.

To prepare the public model files and land mask before calling inference:

from depth_recon import resolve_public_inference_assets

bundle = resolve_public_inference_assets()
print(bundle.assets.checkpoint)
print(bundle.land_mask_path)

Pass progress_callback=lambda event, name, path: ... to report whether each artifact was cached, downloading, downloaded, builtin, or packaged.

Source files can be prepared separately:

depth-recon-download-argo --year 2015 --iso-week 25 --output-dir ./en4_profiles
depth-recon-download-ostia --year 2015 --iso-week 25 --output-dir ./ostia

The inference CLI wraps the same Python API:

depth-recon-infer-week \
  --year 2015 \
  --iso-week 25 \
  --rectangle -20 30 10 50 \
  --device cuda

OSTIA downloads use configured Copernicus Marine CLI credentials, or credentials passed as copernicus_username plus copernicus_token. The Copernicus Marine toolbox accepts that token through its password field, so copernicus_password remains supported as a backwards-compatible alias.

Pass auto_download_ostia=False without ostia_dir to run ARGO-only inference. The package fills the EO surface-conditioning channel with zeros in that mode so the checkpoint input contract remains unchanged.

The public no-GLORYS branch writes prediction artifacts only:

depthdif_argo_<YYYYMMDD>_prediction_<depth>.tif: stitched prediction rasters
depthdif_argo_<YYYYMMDD>_uncertainty.tif: optional 20-sample 1-channel prediction uncertainty raster when uncertainty export is enabled
depthdif_argo_<YYYYMMDD>_argo_points.geojson: observed ARGO point locations
depthdif_argo_<YYYYMMDD>_patch_splits.geojson: selected inference patch polygons
selected_patches.csv: selected patch metadata
run_summary.yaml: model/config/checkpoint paths, selected date, grid settings, and artifact paths

Supplying glorys_dir switches run_week_inference(...) to the repository's global exporter branch. That branch injects the provided source directories into a temporary data config, exports predictions, and can export matching GLORYS ground-truth rasters. For a deeper package walkthrough, see Public Inference Package.

Workflow 1: Use `src/depth_recon/inference/run_single.py`¶

src/depth_recon/inference/run_single.py is a configurable script for quick prediction sanity checks.

What it supports¶

load config files and instantiate model/datamodule
load checkpoint (explicit override or model.resume_checkpoint)
run from:
dataloader sample (MODE="dataloader")
random tensor batch (MODE="random")
optional intermediate sample capture

Important script settings¶

At the top of src/depth_recon/inference/run_single.py, set: - CONFIG_PATH (defaults to src/depth_recon/configs/px_space/inference_super_config.yaml) - SCENARIO (temperature, salinity, joint, or None to use the config) - CHECKPOINT_PATH (or keep None to use model.resume_checkpoint) - MODE, LOADER_SPLIT, DEVICE, INCLUDE_INTERMEDIATES

Workflow 1b: Export Global Depth Rasters¶

Use src/depth_recon/inference/export_global.py when you want the standard production inference path: one spatially complete ISO-week globe from raw ARGO/GLORYS/OSTIA/sea-surface products or an equivalent patch dataset. The script: - requires --year ... --iso-week ... and selects the nearest available dataset date within that ISO week - reads inference-grid settings from src/depth_recon/configs/px_space/inference_super_config.yaml by default - forces the inference grid to patch_grid_source=land_mask and require_argo_for_all=false; the default inference config uses patch_stride=96 for 25% overlap with 128-pixel patches - keeps every tile with at least the configured min_ocean_fraction ocean cover; the default 0.05 includes all patches with 5% or more ocean - runs batched predict_step(...) over all global patches for that week-centered date - runs one stochastic prediction per patch; the global smoothing/variance reduction comes from 25% spatial overlap and overlap-weighted stitching - can fan out inference over all visible CUDA devices via --multi-gpu / --no-multi-gpu - streams patch outputs into on-disk accumulation buffers instead of holding the full world tensor in RAM - stitches prediction GeoTIFFs for Surface, 10m, 50m, 100m, 250m, 500m, 1000m, 2000m, 2500m, and 5000m by averaging overlap counts, repairing boundary-contiguous nodata at the periodic -180/180 longitude edge, feathering the wrap when overlap exists, then conservatively filling tiny internal nodata seams - applies the configured land-mask GeoTIFF at the final write step so land pixels and uncovered water use the same GeoTIFF nodata value - maps requested depths to the nearest GLORYS/model channel and records requested depth, actual source depth, and channel index in TIFF metadata and run_summary.yaml - exports matching GLORYS rasters for the same ten depth levels by default via --export-ground-truth / --no-export-ground-truth; compact GeoTIFF-backed sources are decoded/dequantized into the active variable unit - writes absolute-error rasters for the same depth levels when GLORYS rasters are exported, computed as abs(prediction - GLORYS) in degrees Celsius for temperature or PSU for salinity on the finalized raster grid - optionally runs uncertainty_step(..., num_samples=20) via --export-uncertainty and writes stitched uncertainty GeoTIFFs for the same exported depth levels; use --uncertainty-collapse-depth for the older single-map export - writes all observed Argo point locations for that timestep as a GeoJSON alongside the rasters - exports full-profile metadata for up to 1000 observed Argo locations by default, saves their full (Argo, prediction, GLORYS) depth stacks plus graph references into a second GeoJSON, and renders one two-panel WebP q95 image per location under graphs/ with an OSTIA SST marker at depth 0 plus a side-by-side absolute-error panel; pass --full-sample-count 0 to disable, a positive count to change the cap, or a negative value to export all observed locations - writes a second GeoJSON of patch-square polygons carrying only the train/val split labels for that timestep - optionally packages Cesium globe assets and uploads them with rclone in the same command

Typical run:

/work/envs/depth/bin/python -m depth_recon.inference.export_global \
  --scenario temperature \
  --checkpoint logs/<run>/best.ckpt \
  --year 2010 \
  --iso-week 1 \
  --device cuda \
  --public-base-url https://<bucket-or-site>/inference_production/globe/ \
  --rclone-remote r2:<bucket>/inference_production/globe \
  --rclone-sync-scope globe

Adjust inference.grid.patch_stride or inference.grid.min_ocean_fraction in the inference super-config for coverage changes; CLI flags such as --patch-stride and --min-ocean-fraction still override the config for one-off runs. The prediction rasters still use one generation per patch; multi-generation sampling is only used for --export-uncertainty.

Outputs land under inference/outputs/<run_name>/ and include: - <run_name>_prediction_surface.tif, <run_name>_prediction_10m.tif, <run_name>_prediction_50m.tif, <run_name>_prediction_100m.tif, <run_name>_prediction_250m.tif, <run_name>_prediction_500m.tif, <run_name>_prediction_1000m.tif, <run_name>_prediction_2000m.tif, <run_name>_prediction_2500m.tif, <run_name>_prediction_5000m.tif: stitched prediction rasters - <run_name>_glorys_surface.tif, <run_name>_glorys_10m.tif, <run_name>_glorys_50m.tif, <run_name>_glorys_100m.tif, <run_name>_glorys_250m.tif, <run_name>_glorys_500m.tif, <run_name>_glorys_1000m.tif, <run_name>_glorys_2000m.tif, <run_name>_glorys_2500m.tif, <run_name>_glorys_5000m.tif: stitched GLORYS truth rasters by default - <run_name>_absolute_error_surface.tif, <run_name>_absolute_error_10m.tif, <run_name>_absolute_error_50m.tif, <run_name>_absolute_error_100m.tif, <run_name>_absolute_error_250m.tif, <run_name>_absolute_error_500m.tif, <run_name>_absolute_error_1000m.tif, <run_name>_absolute_error_2000m.tif, <run_name>_absolute_error_2500m.tif, <run_name>_absolute_error_5000m.tif: absolute prediction-vs-GLORYS error rasters when GLORYS export is enabled - <run_name>_uncertainty_<depth>.tif: optional 20-sample generation-uncertainty rasters by exported depth when --export-uncertainty is enabled; <run_name>_uncertainty.tif is used only with --uncertainty-collapse-depth - <run_name>_argo_points.geojson: all observed Argo point locations for the selected timestep - <run_name>_full_sample_locations.geojson: sampled full-profile Argo locations with full depth-stack properties and graph_png_path pointers - <run_name>_patch_splits.geojson: patch polygons for the selected timestep with split=train|val properties only - graphs/: one WebP q95 image per sampled full-profile location with side-by-side variable-vs-depth and absolute-error-vs-depth panels - globe/: Cesium tiles, hosted GeoJSON, copied graphs, and globe-config.json when --public-base-url or --rclone-remote is supplied - selected_patches.csv: the dataset rows used for the run - run_summary.yaml: checkpoint/config/date, forced inference-grid, land-mask, packaging, and upload metadata for traceability When --output-name is omitted, <run_name> defaults to global_top_band_<YYYYMMDD> and the run directory matches that name under inference/outputs/.

Dual-Variable Production Globe Export¶

Use depth_recon.inference.export_global_variables for the hosted production globe when both temperature and salinity should be available from one stable viewer manifest. It runs two independent single-variable exports with separate checkpoints, then packages both into one globe/ directory whose globe-config.json contains variables.temperature, variables.salinity, and legacy top-level fields for the default temperature layer.

Run this wrapper once with two checkpoints: one checkpoint trained for --scenario temperature and one trained for --scenario salinity. The wrapper calls the single-variable exporter internally for each scenario, packages the combined Cesium bundle, generates the analysis-dashboard JSON from each variable's absolute-error rasters, and uploads it automatically when --rclone-remote is provided. It also computes wavenumber spectral diagnostics by default under wavenumber_spectra/ and uploads them to <rclone-remote>/wavenumber_spectra; pass --no-export-wavenumber-spectra only for quick runs that should skip the spectral dashboard data. Pass --export-uncertainty to export and tile 20-sample uncertainty rasters by depth for temperature and salinity. Pass --export-temporal-consistency to also run the configured validation year, write compact temporal/ dashboard assets, and upload them to the sibling /temporal hosted path by default.

/work/envs/depth/bin/python -m depth_recon.inference.export_global_variables \
  --year 2018 \
  --iso-week 25 \
  --temperature-checkpoint logs/<temperature-run>/best.ckpt \
  --salinity-checkpoint logs/<salinity-run>/best.ckpt \
  --device cuda \
  --export-uncertainty \
  --export-temporal-consistency \
  --public-base-url https://globe-assets.hyperalislabs.com/inference_production/globe \
  --rclone-remote r2:depth-data/inference_production/globe \
  --rclone-sync-scope globe

By default, globe raster tiling keeps the estimated native raster zoom (--extra-zoom-levels 0), so 0.1 degree global TIFFs tile as 0-4, the closest 256 px Web Mercator level to the source grid. With --rclone-sync-scope globe, only the combined hosted globe bundle is synced to r2:depth-data/inference_production/globe; the raw temperature and salinity GeoTIFF run folders remain local. When temporal export is enabled, the temporal/ folder is uploaded separately to r2:depth-data/inference_production/temporal unless --temporal-rclone-remote overrides it. Wavenumber spectral assets are uploaded by default to r2:depth-data/inference_production/globe/wavenumber_spectra, including spectral-config.json, basin JSON files, compact CSV/NPZ intermediates, and copied dashboard static files. By default the wrapper passes a 1000 full-profile graph cap to each primary variable export, so the hosted bundle contains at most 1000 graph images per modality unless --full-sample-count is overridden; extra temporal-week exports always use --full-sample-count 0. Use --rclone-sync-scope run when the raw paired run directory should be uploaded too. The website should load https://globe-assets.hyperalislabs.com/inference_production/globe/globe-config.json, https://globe-assets.hyperalislabs.com/inference_production/temporal/temporal-config.json, and https://globe-assets.hyperalislabs.com/inference_production/globe/wavenumber_spectra/spectral-config.json.

Default output layout:

inference/outputs/global_variables_<YYYY>_W<WW>/
  temperature/
    temperature_prediction_surface.tif
    temperature_glorys_surface.tif
    temperature_absolute_error_surface.tif
    temperature_uncertainty_surface.tif
    ...
  salinity/
    salinity_prediction_surface.tif
    salinity_glorys_surface.tif
    salinity_absolute_error_surface.tif
    salinity_uncertainty_surface.tif
    ...
  globe/
    globe-config.json
    temperature/...
    salinity/...
  temporal/
    temporal-config.json
    basin-map.geojson
    basins/north_pacific_ocean.json
    weekly/temperature/2018_W01/...
    weekly/salinity/2018_W01/...
  temporal_runs/
    temperature/<YYYY>_W<WW>/...
    salinity/<YYYY>_W<WW>/...

The salinity export uses y_hat_salinity_denorm, y_salinity, y_salinity_valid_mask, salinity normalization/denormalization, and decoded GLORYS salinity patches from _load_y_salinity_patch. Its GeoTIFFs are written in PSU, and salinity globe layers use the default 30-40 PSU fixed color scale.

Post-Inference Wavenumber Spectra¶

Use depth_recon.inference.export_wavenumber_spectra to compute 2D spatial wavenumber spectra from finished global or paired variable inference runs without re-running inference. The command reads selected_patches.csv, finalized prediction GeoTIFFs, exported GLORYS rasters when present, packaged GLORYS source rasters as a fallback, and packaged surface-reference rasters (OSTIA analysed_sst for temperature, SSS sos for salinity). It assigns patch footprints to world_oceans.geojson regions only when at least 30% of the patch overlaps one basin; lower-overlap patches remain in the global aggregate only.

/work/envs/depth/bin/python -m depth_recon.inference.export_wavenumber_spectra \
  --run-dir inference/outputs/global_variables_2018_W25_v2 \
  --include-temporal-runs \
  --variables temperature salinity \
  --output-dir inference/outputs/global_variables_2018_W25_v2/wavenumber_spectra \
  --public-base-url https://globe-assets.hyperalislabs.com/inference_production/globe/wavenumber_spectra \
  --rclone-remote r2:depth-data/inference_production/globe/wavenumber_spectra

For each selected patch window, the exporter removes a fitted 2D plane, applies a 2D Hann window, runs np.fft.fft2, and radial-bins power into common logarithmic wavelength bins (30-1000 km by default). Outputs include patch_spectra.npz, patch_spectra_records.csv, aggregated_spectra.csv, summary.json, a copied interactive index.html spectral dashboard with spectral-config.json/basins/*.json/basin-map.geojson, and per-basin season/year PNG plots under plots/. Pass --rclone-remote to upload this bundle, or --no-dashboard to skip the hosted dashboard assets. This v1 analyzes temperature and salinity fields directly; it does not implement buoyancy spectra, IGW/BM partitioning, or frequency-wavenumber time analysis.

Paper-Week Inference Bundle And Metrics Export¶

Use depth_recon.inference.export_paper_week to create the reusable one-week paper bundle first, then run depth_recon.inference.export_paper_metrics as a pure disk post-processing step. The bundle path selects the requested ISO week, creates the deterministic held-out EN4/ARGO split before any model inference, removes those held-out locations from sparse ARGO inputs, exports all native GLORYS/model depth channels, persists held-out profile values, and saves dense GLORYS references once for metrics reuse.

Model methods are declared in a YAML file:

methods:
  idw:
    label: IDW
    model_type: idw_baseline
  cnn:
    label: CNN
    model_type: cnn_baseline
    temperature_checkpoint: logs/<cnn-temp>/best.ckpt
    salinity_checkpoint: logs/<cnn-sal>/best.ckpt
  unet:
    label: U-Net
    model_type: unet_baseline
    temperature_checkpoint: logs/<unet-temp>/best.ckpt
    salinity_checkpoint: logs/<unet-sal>/best.ckpt
  unet2d:
    label: U-Net 2D
    model_type: unet2d_baseline
    temperature_checkpoint: logs/<unet2d-temp>/best.ckpt
    salinity_checkpoint: logs/<unet2d-sal>/best.ckpt
  depthdif:
    label: DepthDif
    model_type: cond_px_dif
    temperature_checkpoint: logs/<dif-temp>/best.ckpt
    salinity_checkpoint: logs/<dif-sal>/best.ckpt

Typical bundle export:

/work/envs/depth/bin/python -m depth_recon.inference.export_paper_week \
  --year 2018 \
  --iso-week 25 \
  --models-config configs/paper_week_models.yaml \
  --output-dir inference/outputs/paper_2018_W25 \
  --device cuda \
  --sampler ddim \
  --ddim-steps 100 \
  --batch-size 8 \
  --en4-holdout-fraction 0.2 \
  --seed 7

The bundle writes paper_week_manifest.json, references/en4_holdout_locations.csv, references/en4_holdout_profiles.csv, climatology artifacts, and one methods/<method>/<variable>/ run per enabled method and variable. Each model run uses depth_export_mode: native, so depth_exports contains depth_000 through the final native channel.

Run metrics from the saved bundle:

/work/envs/depth/bin/python -m depth_recon.inference.export_paper_metrics \
  --paper-run-dir inference/outputs/paper_2018_W25 \
  --output-dir inference/outputs/paper_2018_W25/metrics

Metrics read the manifest dynamically, including CNN, U-Net 2D, and DepthDif when present, and compute RMSE, MAE, and R² by method, variable, target, and depth against the held-out EN4/ARGO profile CSV and the persisted dense GLORYS reference rasters. By default, metrics are evaluated only at native depth levels no deeper than 2000 m; override with --max-depth-m only for intentional comparisons with a different cutoff. Outputs include paper_metrics_summary.csv, paper_metrics_by_depth.csv, en4_holdout_metrics.csv, glorys_field_metrics.csv, en4_holdout_locations.csv, recon_results_table.tex, and paper_metrics_manifest.json.

The older direct metrics mode remains available for existing runs:

/work/envs/depth/bin/python -m depth_recon.inference.export_paper_metrics \
  --year 2018 \
  --iso-week 25 \
  --idw-run-dir inference/outputs/paper_2018_W25_idw \
  --lstm-run-dir inference/outputs/paper_2018_W25_lstm \
  --unet-run-dir inference/outputs/paper_2018_W25_unet \
  --output-dir inference/outputs/paper_metrics_2018_W25

All-Week Baseline Spectral Comparison Bundle¶

Use depth_recon.inference.export_spectral_comparison_bundle when the spectral dashboard should include every requested ISO week, every native depth level, and every method from the paper-week model config. The script creates one parent output folder, runs or reuses export_paper_week under weeks/<YYYY>_W<WW>/, reads each paper_week_manifest.json, computes wavenumber spectra for GLORYS, DepthDif, climatology, IDW, LSTM, CNN, and U-Net layers across all exported depth levels, writes one combined dashboard under wavenumber_spectra/, and optionally uploads either the whole bundle or just the spectral dashboard assets. DepthDif is emitted as the dashboard prediction layer; pass --sampler ddpm to keep the DDPM sampling path for its inference runs.

/work/envs/depth/bin/python -m depth_recon.inference.export_spectral_comparison_bundle \
  --config src/depth_recon/inference/inference_config.yaml \
  --models-config configs/paper_week_models.yaml \
  --year 2018 --all-weeks \
  --output-dir inference/outputs/spectral_comparison_2018 \
  --device cuda --sampler ddpm --batch-size 1 \
  --wavenumber-output-name wavenumber_spectra \
  --public-base-url https://globe-assets.hyperalislabs.com/inference_production/globe/wavenumber_spectra \
  --rclone-remote r2:depth-data/inference_production/globe/wavenumber_spectra \
  --upload-scope spectral

By default, existing weekly paper_week_manifest.json files are reused so interrupted all-week exports can resume, and no depth filter is applied. Add --depth-index or --depth-suffix only for intentional reduced exports. Add --rerun-existing to rebuild weekly inference folders, or use --iso-week, --week-start, and --week-end for smaller batches. Use --upload-scope bundle when the raw weekly inference folders should be uploaded together with the dashboard.

Workflow 1c: Export One Pooled Validation Error Summary¶

Use src/depth_recon/inference/export_validation_error_summary.py when you want one depth-vs-error summary across the whole dataset split instead of one map export or one sampled batch. The script: - loads the configured checkpoint and the explicit dataset split selected by --split (val by default) - optionally narrows that split to one ISO week via --year ... --iso-week ..., matching the same week-style selection used by the global export workflow - forces real-observation semantics by evaluating |Prediction - GLORYS| only on valid y support and |Prediction - ARGO| only on observed x support - pools all eligible validation pixels by depth level across the entire split and reports pooled medians rather than per-patch averages - writes validation_error_by_depth.csv with per-depth medians, counts, and the support-aware median profiles - saves validation_median_absolute_error_by_depth.png with the two pooled error traces - saves validation_median_profile_and_error_by_depth.png with the median (ARGO, prediction, GLORYS) profiles plus the pooled error panel - writes run_summary.yaml with checkpoint/config/device/split metadata and artifact names

Typical run:

/work/envs/depth/bin/python -m depth_recon.inference.export_validation_error_summary \
  --scenario temperature \
  --checkpoint logs/<run>/best.ckpt \
  --split val \
  --year 2015 \
  --iso-week 25 \
  --device cuda

Default outputs land under inference/outputs/validation_error_summary/: - validation_error_by_depth.csv: per-depth pooled error/profile summary table - validation_median_absolute_error_by_depth.png: single-panel pooled median absolute-error graph - validation_median_profile_and_error_by_depth.png: two-panel pooled median-profile/error figure - run_summary.yaml: checkpoint/config/split metadata plus artifact filenames

Workflow 1d: Export Absolute-Error Dashboard Data¶

The standalone /spatial-dashboard/ dashboard data is generated during global inference when prediction and ground truth are both exported. The exporter accumulates signed prediction-minus-GLORYS mosaics for every native target depth channel, stitches overlaps with the same deterministic weights used by raster export, takes absolute error after stitching, filters with the run land mask, aggregates finite ocean pixels by the nine regions in src/depth_recon/data/local_data/world_oceans.geojson and fixed 5-degree lat-lon cells by default, prepends an All Depths view with count-weighted average metrics across native depths, and writes error-analysis.json plus a companion analysis-grid.geojson in the run directory. When --export-uncertainty is enabled, the absolute-error JSON also includes a top-band uncertainty reliability payload with binned uncertainty-vs-error calibration, basin calibration, and highlighted low-uncertainty/high-error or high-uncertainty/low-error cells; the temporal dashboard does not consume this payload. The GeoJSON stores coast-clipped ocean geometry keyed by the same grid-cell ids so the dashboard map can keep square analysis cells while following jagged coastlines. The hosted dashboard page lives at docs/spatial-dashboard/index.html; it loads the hosted globe-config.json, then fetches every packaged modality analysis dataset listed there.

Generated dashboard output copied into the hosted globe directory: - error-analysis.json: global, basin, and grid-cell absolute-error metrics for every native depth channel, plus a first All Depths aggregate level - analysis-grid.geojson: coast-clipped ocean geometry for dashboard grid cells, generated from the run land mask

Existing runs need inference rerun to get true full-depth dashboard JSON. If a run has no precomputed error_analysis_json_path, the Cesium packager falls back to generating error-analysis.json from the existing absolute-error GeoTIFF depth exports only. The basin grouping is a land-filtered rasterization of world_oceans.geojson, with a dominant GeoJSON region label attached to every grid cell. Use --no-error-analysis only when you explicitly want to skip this packaging output.

Workflow 1e: Export Temporal Dashboard Data¶

The standard production path is src/depth_recon/inference/export_global_variables.py --export-temporal-consistency, which runs lightweight weekly exports for the configured validation year (data.split.val_year, currently 2018) into temporal_runs/, then aggregates compact full-depth absolute-error summaries into schema-v2 temporal-config.json plus one basins/<basin>.json file per world_oceans.geojson dashboard region for docs/temporal-dashboard/index.html. Add --export-temporal-globe when the same weekly runs should also be tiled into a separate 10m Cesium animation bundle for docs/temporal-globe/index.html. Use src/depth_recon/inference/export_temporal_global_variables.py only when you want a temporal-only run without generating the normal production globe first.

The temporal payload no longer stores week-to-week change, flicker, grid-cell, or top-cell metrics. Weekly runs still compare model prediction against GLORYS internally, but the hosted temporal bundle keeps only each week's 10m prediction and 10m absolute-error rasters for temperature and salinity. Full-native-depth dashboard values are aggregated as exact count-weighted yearly mean absolute errors by GeoJSON region and depth, and per-date basin mean absolute errors are aggregated across native depths for the temporal chart. The dashboard starts with no selected basin; map clicks or the basin selector choose an area, and clicking the active area again clears it. Temperature and salinity temporal inputs must cover matching weeks and depth metadata.

Typical run:

/work/envs/depth/bin/python -m depth_recon.inference.export_temporal_global_variables \
  --start-year 2018 \
  --start-iso-week 1 \
  --temperature-checkpoint logs/<temperature-run>/best.ckpt \
  --salinity-checkpoint logs/<salinity-run>/best.ckpt \
  --device cuda \
  --export-temporal-globe \
  --output-root inference/outputs \
  --output-name temporal_variables_2018_W22_W28 \
  --public-base-url https://globe-assets.hyperalislabs.com/inference_production/temporal \
  --rclone-remote r2:depth-data/inference_production/temporal

Use --reuse-existing-runs to resume an interrupted temporal export when the expected weekly run folders already contain run_summary.yaml. To aggregate existing runs without launching inference, call src/depth_recon/inference/export_temporal_consistency_dashboard.py directly with repeated --temperature-run-dir and --salinity-run-dir arguments.

Generated temporal dashboard output: - temporal-config.json: schema-v2 dashboard manifest with validation year, variable metadata, basin URLs, basin map URL, and weekly 10m artifact URLs - basin-map.geojson: selectable basin polygons copied from world_oceans.geojson - basins/<basin>.json: active-basin mean absolute error curves for temperature and salinity across native depth levels - weekly/<variable>/<YYYY_Www>/: copied 10m prediction and absolute-error GeoTIFFs for each validation week - index.html, javascripts/temporal-dashboard.js, and stylesheets/temporal-dashboard.css: copied static dashboard assets for bucket hosting

Workflow 1f: Export Temporal Globe Animation¶

Use --export-temporal-globe on either temporal wrapper to turn the retained weekly 10m rasters into a separate hostable Cesium animation bundle. This does not add GLORYS, ARGO points, uncertainty, or full-depth rasters. It only tiles 10m prediction and 10m absolute-error frames for each variable, keeps the same globe styling and optional Natural Earth basemap, and writes a compact temporal-globe-config.json manifest consumed by docs/temporal-globe/index.html.

The temporal globe packager defaults to WebP q80, --temporal-globe-extra-zoom-levels 0, and --temporal-globe-max-zoom-level 3, so 0.1 degree weekly frames stay compact by omitting native z4 tiles. The browser keeps the current frame plus previous, next, and next-plus-one frames as transparent warm Cesium layers while cycling at frame_interval_ms (1000 by default). When the normal production URL ends in /globe, the temporal globe upload URL defaults to the sibling /temporal-globe; override it with --temporal-globe-public-base-url and --temporal-globe-rclone-remote.

To package existing weekly temporal run folders without re-running inference, call the packager directly:

/work/envs/depth/bin/python -m depth_recon.inference.export_temporal_cesium_globe_assets \
  --temperature-run-dir inference/outputs/temporal_variables_2018/runs/temperature/2018_W01 \
  --salinity-run-dir inference/outputs/temporal_variables_2018/runs/salinity/2018_W01 \
  --output-dir inference/outputs/temporal_variables_2018/temporal-globe \
  --validation-year 2018 \
  --public-base-url https://globe-assets.hyperalislabs.com/inference_production/temporal-globe \
  --rclone-remote r2:depth-data/inference_production/temporal-globe

Generated temporal globe output: - temporal-globe-config.json: schema-v1 animation manifest with variables, frame URLs, 10m depth metadata, stable color scales, playback interval, and basemap URL - frames/<variable>/<YYYY_Www>/prediction_tiles_10m/: Cesium TMS WebP tiles for weekly 10m prediction - frames/<variable>/<YYYY_Www>/absolute_error_tiles_10m/: Cesium TMS WebP tiles for weekly 10m absolute error - basemaps/natural_earth_ii_webp_q95/: optional shared Natural Earth basemap when the local source TIFF exists - index.html, javascripts/load-temporal-globe.js, javascripts/temporal-globe.js, and stylesheets/globe.css: copied static temporal globe viewer assets for bucket hosting

Workflow 1g: Package One Run for the Cesium Globe¶

The standard path is to let src/depth_recon/inference/export_global.py package and upload the globe assets by passing --public-base-url and --rclone-remote. src/depth_recon/inference/export_cesium_globe_assets.py remains available when you need to re-package an existing run directory without re-running model inference. The packaging step: - reads one completed inference/outputs/<run_name>/ directory - colorizes every stitched prediction and ground-truth depth GeoTIFF, keeping GeoTIFF nodata transparent before applying the variable color ramp (0-30 C for temperature, 30-40 PSU for salinity), then tiles them as Cesium TMS WebP q95 with fixed 256 px gdal2tiles.py tiles - colorizes absolute-error GeoTIFFs with a green-to-red ramp stretched to each depth-specific 2nd-98th percentile range, then tiles them beside prediction and GLORYS as WebP q95 - colorizes and tiles the optional 1-channel uncertainty GeoTIFF when the run summary contains uncertainty_tif_path - rewrites the hosted Argo points GeoJSON with rounded coordinates and no extra properties - rewrites the sampled full-profile GeoJSON with rounded coordinates and only the popup properties, then copies the WebP q95 graphs emitted by the export workflow - merges both point exports into one hosted argo_sample_locations.geojson so the globe uses one toggleable ARGO layer with distinct markers for ordinary points and full-depth-profile points - rewrites the patch GeoJSON with rounded coordinates for the viewer overlay - writes globe/globe-config.json with a depth_levels list used by the static Cesium page depth slider; paired exports also write a variables object used by the Temperature/Salinity selector; the viewer only shows the Uncertainty raster option when uncertainty_tiles_url is present; when the local Natural Earth TIFF exists, the config points the viewer at the hosted WebP basemap - copies the inference-generated error-analysis.json into the globe directory and records its URL by default; older runs without that file use the GeoTIFF-based fallback

Typical run:

/work/envs/depth/bin/python -m depth_recon.inference.export_cesium_globe_assets \
  --run-dir inference/outputs/global_top_band_<YYYYMMDD> \
  --public-base-url https://<bucket-or-site>/inference_production/globe/ \
  --rclone-remote r2:<bucket>/inference_production/globe \
  --rclone-sync-scope globe

The hosted output lands under inference/outputs/global_top_band_<YYYYMMDD>/globe/ locally and under inference_production/globe/ in the bucket when synced. It includes: - prediction_tiles_surface/, prediction_tiles_100m/, etc.: TMS imagery tiles for each prediction depth raster - ground_truth_tiles_surface/, ground_truth_tiles_100m/, etc.: TMS imagery tiles for each GLORYS depth raster when present - absolute_error_tiles_surface/, absolute_error_tiles_100m/, etc.: TMS imagery tiles for each absolute-error raster when present - argo_sample_locations.geojson: hosted combined ARGO point overlay used by the single ARGO globe layer, with per-feature marker metadata for ordinary points versus full-depth profiles - argo_points.geojson: hosted raw observed-point overlay source retained alongside the combined file - full_sample_locations.geojson: hosted sampled-profile point overlay source retained alongside the combined file - graphs/: hosted WebP profile-comparison images opened by the sampled-profile popup - patch_splits.geojson: hosted patch grid overlay rendered with transparent fill and hard borders in the globe viewer - uncertainty_tiles/: optional Cesium TMS tiles for the 1-channel uncertainty raster - basemaps/natural_earth_ii_webp_q95/: optional hosted Natural Earth WebP basemap when the local source TIFF exists - globe-config.json: the viewer manifest consumed by the standalone globe/ viewer route - error-analysis.json: hosted error dashboard data copied from the inference-generated full-depth analysis JSON, or generated from exported absolute-error rasters for older runs - analysis-grid.geojson: hosted coast-clipped dashboard grid geometry copied from the run or generated from the land mask

Raw GeoTIFFs stay in the run directory and are not copied into globe/ for bucket upload.

When serving from a bucket, enable CORS for the docs origin so the standalone static globe page can fetch the tiled layers and GeoJSON. Make sure .webp files are served as image/webp.

Workflow 1h: Host the Comparison Globe¶

The standalone comparison viewer lives at docs/comparison-globe/index.html. It uses the same Cesium styling and controls as the spatial globe, but its raster selector compares GLORYS, DepthDif, IDW, Climatology, LSTM, CNN, and U-Net layers. By default it fetches https://globe-assets.hyperalislabs.com/inference_production/comparison-globe/comparison-globe-config.json; append ?config=<url> to test another hosted manifest.

The comparison manifest follows the same variable/depth shape as globe-config.json, with model tiles grouped per depth level:

{
  "default_variable": "temperature",
  "variables": {
    "temperature": {
      "variable": "temperature",
      "variable_label": "Temperature",
      "value_units": "degC",
      "color_scale_min": 0,
      "color_scale_max": 30,
      "depth_levels": [
        {
          "label": "10 m",
          "requested_depth_m": 10,
          "layers": {
            "glorys": { "tiles_url": "../globe/ground_truth_tiles_10m/" },
            "depthdif": { "tiles_url": "../globe/prediction_tiles_10m/" },
            "idw": { "tiles_url": "../baselines/idw/prediction_tiles_10m/" },
            "climatology": { "tiles_url": "../baselines/climatology/prediction_tiles_10m/" },
            "lstm": { "tiles_url": "../baselines/lstm/prediction_tiles_10m/" },
            "cnn": { "tiles_url": "../baselines/cnn/prediction_tiles_10m/" },
            "unet": { "tiles_url": "../baselines/unet/prediction_tiles_10m/" }
          }
        }
      ]
    }
  }
}

For convenience, each depth level also accepts direct URL fields: glorys_tiles_url or ground_truth_tiles_url, depthdif_tiles_url or prediction_tiles_url, idw_tiles_url, climatology_tiles_url, lstm_tiles_url, cnn_tiles_url, and unet_tiles_url. Optional argo_sample_locations_url and patch_splits_url can be placed either at the root or inside each variable config.

Workflow 2: Direct `predict_step`¶

The model inference entry points are: - PixelDiffusionConditional.predict_step(batch, batch_idx=0) - PixelDiffusionConditional.uncertainty_step(batch, batch_idx=0, num_samples=20)

Minimum required batch keys: - x - x_valid_mask - y_valid_mask

Common optional keys: - eo - x_valid_mask_1d - land_mask - output_land_mask - coords - date - sampler - clamp_known_pixels - return_intermediates - intermediate_step_indices

Returned outputs¶

predict_step returns a dictionary containing: - y_hat: standardized prediction - y_hat_denorm: temperature-denormalized prediction, masked to NaN where y_valid_mask==0 - denoise_samples: reverse samples (if requested) - x0_denoise_samples: per-step x0 predictions (if requested) - sampler: sampler used for prediction

The global and public inference exporters use 20 samples by default when uncertainty export is enabled (--uncertainty-num-samples overrides it). uncertainty_step returns uncertainty-only outputs: - uncertainty: pixel-wise standard deviation in denormalized physical units, collapsed to B x 1 x H x W - uncertainty_normalized: 0-1 min-max normalized uncertainty raster for display - uncertainty_temperature / uncertainty_salinity: field-specific uncertainty maps when active - uncertainty_num_samples: number of generations used - uncertainty_stat: currently std - sampler: sampler used for prediction

Example (`inference_super_config.yaml`)¶

from pathlib import Path

from depth_recon.configs.config_resolver_pixel import load_pixel_inference_config
from depth_recon.data.datamodule import DepthTileDataModule
from depth_recon.inference.core import build_dataset, load_checkpoint_weights
from depth_recon.models.diffusion import PixelDiffusionConditional

bundle = load_pixel_inference_config(
    scenario_override="temperature",
    runtime_config_dir=Path("/tmp/depthdif_inference_configs/example"),
    write_snapshots=False,
)
ckpt_path = "logs/<run>/best-epochXXX.ckpt"

train_dataset = build_dataset(bundle.effective_data_config_path, bundle.data_cfg["dataset"], split="train")
val_dataset = build_dataset(bundle.effective_data_config_path, bundle.data_cfg["dataset"], split="val")
datamodule = DepthTileDataModule(dataset=train_dataset, val_dataset=val_dataset)

model = PixelDiffusionConditional.from_config(
    model_config_path=bundle.effective_model_config_path,
    data_config_path=bundle.effective_data_config_path,
    training_config_path=bundle.effective_training_config_path,
    datamodule=datamodule,
)
load_checkpoint_weights(model, ckpt_path, strict=False)
model.eval()

Sampler Choice¶

Validation/inference sampler can be switched via training config: - training.validation_sampling.sampler: "ddpm" or "ddim" - DDIM controls: - ddim_num_timesteps - ddim_eta - ddim_temperature

DDIM sampling is working in the current inference path. Use 50 steps as the lowest practical setting that still gives acceptable results; increase the step count when output quality matters more than runtime.

The same sampler can also be injected per batch through batch["sampler"] in direct prediction calls.

Inference¶

Workflow 0: Public ISO-Week API¶

Workflow 1: Use src/depth_recon/inference/run_single.py¶

What it supports¶

Important script settings¶

Workflow 1b: Export Global Depth Rasters¶

Dual-Variable Production Globe Export¶

Post-Inference Wavenumber Spectra¶

Paper-Week Inference Bundle And Metrics Export¶

All-Week Baseline Spectral Comparison Bundle¶

Workflow 1c: Export One Pooled Validation Error Summary¶

Workflow 1d: Export Absolute-Error Dashboard Data¶

Workflow 1e: Export Temporal Dashboard Data¶

Workflow 1f: Export Temporal Globe Animation¶

Workflow 1g: Package One Run for the Cesium Globe¶

Workflow 1h: Host the Comparison Globe¶

Workflow 2: Direct predict_step¶

Returned outputs¶

Example (inference_super_config.yaml)¶

Sampler Choice¶

Workflow 1: Use `src/depth_recon/inference/run_single.py`¶

Workflow 2: Direct `predict_step`¶

Example (`inference_super_config.yaml`)¶