KPLabs
/

TerraMind-Methane-Classification

Image Classification

Model card Files Files and versions

TerraMind-Methane-Classification / README.md

KPLabs's picture

Update README.md

3e10a2c verified 2 months ago

|

history blame contribute delete

2.81 kB

	---
	base_model:
	- ibm-esa-geospatial/TerraMind-1.0-base
	pipeline_tag: image-classification
	tags:
	- methane
	- detection
	- geospatial
	- terramind
	---

	# FAST-EO Use Case 2 - Methane Detection

	This repository contains data and code to reproduce experiments for fine-tuning TerraMind-Base to detect methane-related signatures in multispectral imagery. It includes multiple experiment variants and their corresponding datasets.

	The file `Methane_benchmark_patches_summary_v3.xlsx` provides per-patch descriptions and defines the fold splits used to ensure non-overlapping partitions. Runner scripts use this Excel file to build train/val/test splits, typically reserving one fold for testing.

	All scripts provide usage instructions via `--help` (or `-h`).

	## Requirements

	- Install the TerraMind/Terratorch stack used by the project before running experiments.
	- Ensure your environment has the required dependencies for data loading and training.

	## Experiments

	### Experiment 1: Fine-tuning on Methane Benchmark Dataset (MBD)

	Fine-tune TerraMind-Base on the Methane Benchmark Dataset. The normalized dataset is provided in:

	- `MBD_nan_S2_zscore/`

	Training code is located in:

	- `classification/` (includes dataset and dataloader classes)

	### Experiment 2: Fine-tuning on MBD with text captions

	This experiment modifies the TerraMind-Base model to concatenate text-caption embeddings with visual embeddings.

	- Caption embeddings are computed with `all-MiniLM-L6-v2`.
	- Code and resources are in `classification_with_text/`.
	- Original captions: `classification_with_text/MBD_text/`
	- Precomputed embeddings: `combined_caption_embeddings.csv`

	### Experiment 3: Sentinel-2 with simulated atmospheric conditions

	Evaluate generalization on Sentinel-2 data with simulated atmospheric conditions in both:
	- Top-of-Atmosphere (TOA)
	- Bottom-of-Atmosphere (BOA)

	The model can be trained on this simulated data or used only for inference to test cross-domain robustness.

	### Experiment 4: Intuition-1 with simulated atmospheric conditions

	Analogous to Experiment 3, but using Intuition-1 imagery with simulated atmospheric conditions (TOA and BOA). This experiment tests robustness under domain shift.

	### Experiment 5: Urban dataset without methane (false-positive stress test)

	A control dataset containing only urban imagery without methane is used to check whether models learn methane-specific cues rather than urban signatures. The goal is to quantify false positives.

	Scripts for loading and running inference on this dataset are provided in the repository.

	## Notes

	- Use `--help` on each runner/training script to see available options.
	- Keep fold definitions consistent with `Methane_benchmark_patches_summary_v3.xlsx` to ensure comparable results across experiments.