| --- |
| base_model: |
| - ibm-esa-geospatial/TerraMind-1.0-base |
| pipeline_tag: image-classification |
| tags: |
| - methane |
| - detection |
| - geospatial |
| - terramind |
| --- |
| |
| # FAST-EO Use Case 2 - Methane Detection |
|
|
| This repository contains data and code to reproduce experiments for fine-tuning **TerraMind-Base** to detect methane-related signatures in multispectral imagery. It includes multiple experiment variants and their corresponding datasets. |
|
|
| The file `Methane_benchmark_patches_summary_v3.xlsx` provides per-patch descriptions and defines the **fold splits** used to ensure non-overlapping partitions. Runner scripts use this Excel file to build train/val/test splits, typically reserving one fold for testing. |
|
|
| All scripts provide usage instructions via `--help` (or `-h`). |
|
|
| ## Requirements |
|
|
| - Install the TerraMind/Terratorch stack used by the project before running experiments. |
| - Ensure your environment has the required dependencies for data loading and training. |
|
|
| ## Experiments |
|
|
| ### Experiment 1: Fine-tuning on Methane Benchmark Dataset (MBD) |
|
|
| Fine-tune TerraMind-Base on the Methane Benchmark Dataset. The normalized dataset is provided in: |
|
|
| - `MBD_nan_S2_zscore/` |
|
|
| Training code is located in: |
|
|
| - `classification/` (includes dataset and dataloader classes) |
|
|
| ### Experiment 2: Fine-tuning on MBD with text captions |
|
|
| This experiment modifies the TerraMind-Base model to concatenate text-caption embeddings with visual embeddings. |
|
|
| - Caption embeddings are computed with `all-MiniLM-L6-v2`. |
| - Code and resources are in `classification_with_text/`. |
| - Original captions: `classification_with_text/MBD_text/` |
| - Precomputed embeddings: `combined_caption_embeddings.csv` |
|
|
| ### Experiment 3: Sentinel-2 with simulated atmospheric conditions |
|
|
| Evaluate generalization on Sentinel-2 data with simulated atmospheric conditions in both: |
| - Top-of-Atmosphere (TOA) |
| - Bottom-of-Atmosphere (BOA) |
|
|
| The model can be trained on this simulated data or used only for inference to test cross-domain robustness. |
|
|
| ### Experiment 4: Intuition-1 with simulated atmospheric conditions |
|
|
| Analogous to Experiment 3, but using Intuition-1 imagery with simulated atmospheric conditions (TOA and BOA). This experiment tests robustness under domain shift. |
|
|
| ### Experiment 5: Urban dataset without methane (false-positive stress test) |
|
|
| A control dataset containing only urban imagery without methane is used to check whether models learn methane-specific cues rather than urban signatures. The goal is to quantify false positives. |
|
|
| Scripts for loading and running inference on this dataset are provided in the repository. |
|
|
| ## Notes |
|
|
| - Use `--help` on each runner/training script to see available options. |
| - Keep fold definitions consistent with `Methane_benchmark_patches_summary_v3.xlsx` to ensure comparable results across experiments. |
|
|