BioSCape

AGU 2024

View AGU 2024 Poster (PDF)

Please read below for more details on our motivation and methods.

Background

Additional Methods

Unit of analysis: We test our hypotheses using 500x500-m2 regions as our samples. At this region size, we have >3000 samples across three study sites spanning the GCFR. Data layers:

Classification Workflow: After evaluating several algorithms for their performance with our data, we settled on Random Forest, which we fit and evaluated multiple ways. First, we used a stratified random split for training, calibration and testing. We use a bootstrapping approach to understand uncertainty in the overall model accuracy. Second, we used spatial cross validation for training and testing, along with a random split for calibration. We use split conformal classification to generate a prediction set (and associated probabilities) for each pixel. While these approaches allow us to quantify epistemic uncertainty, we also quantify pixel-wise uncertainties to propagate through to our hypothesis testing (see below).

Statistical Approach for Causal Inference: We build off Van Cleemput et al. 2024, which identifies challenges and opportunities for causal inference from remotely sensed data, by integrating the following steps into our analyses: