We propose a general-purpose adversarial framework that learns phenotype-relevant features while explicitly removing population-related information from the latent representation.
📄 Paper
Adversarial Removal of Population Bias in Genomics Phenotype Prediction 2022 IEEE International Conference on Data Mining Workshops (ICDMW)
👉 View on IEEE Xplore
The framework consists of:
The encoder is trained to:
This ensures the learned representation contains phenotype information but not population bias.
| Method | MAE ↓ | MSE ↓ | PCC ↑ |
|---|---|---|---|
| Random Forest | 0.0719 | 0.0084 | 72.00% |
| Dual-CNN | 0.0742 | 0.0090 | 72.47% |
| RF + Linear Regression | 0.0706 | 0.0081 | 72.64% |
| Ours (Adversarial + MLP) | 0.0689 | 0.0077 | 74.17% |
Our adversarial framework improves:
Population stratification inflates associations and reduces generalization in: