Overview

Summary

WOODS has 2 model selection methods based on DomainBed (Gulrajani and Lopez-Paz, 2021):

Train Domain Validation: We split each trainin domain into training (in) and validation (out) subsets. We tran models using the training subset and choose the model maximizing the accuracy on the union of validation subsets. We then report the test accuracy of the chosen model on the test domain.
Test Domain Validation: We choose the model maximizing the accuracy on a validation set that followes the distribution of the test domain. We allow one query (the last checkpoint) per choice of hyperparameters, disallowing early stopping. This breaks the OOD generalization assumption that models does not have access to the test domain, but this metric is still insightful as to performance of different objectives.

summary-train-Results

summary-test-Results

Spurious Fourier

SF-IID-Results

SF-train-Results

SF-test-Results

Temporal Colored MNIST Sequences

tcmnist-seq-IID-Results

TCM_seq-train-Results

TCM_seq-test-Results

Temporal Colored MNIST Steps

tcmnist-step-IID-Results

TCM_step-train-Results

TCM_step-test-Results

CAP

CAP-IID-Results

CAP-train-Results

CAP-test-Results

SEDFx

SEDFx-IID-Results

sedfx-train-Results

sedfx-test-Results

PCL

PCL-IID-Results

PCL-train-Results

PCL-test-Results

HHAR

HHAR-IID-Results

HHAR-train-Results

HHAR-test-Results

LSA64

LSA64-IID-Results

LSA64-train-Results

LSA64-test-Results