Continual Learning for Adversarially Robust Medical AI Agents in Evolving Disease Landscapes
Keywords:
continual learning, adversarial robustness, medical artificial intelligence, disease evolution, system architecture, clinical decision support, model governance, fairnessAbstract
The deployment of artificial intelligence agents in clinical settings demands not only high diagnostic accuracy but also sustained robustness against adversarial perturbations and the capacity to adapt to constantly shifting disease landscapes. Medical AI systems, from diagnostic imaging classifiers to clinical decision support tools, are currently trained on static datasets that quickly become outdated as pathogens mutate, treatment protocols change, and population demographics evolve. This paper presents a comprehensive system-level analysis of continual learning frameworks designed to maintain adversarial robustness in medical AI agents operating under real-world constraints. We examine architectural trade-offs between plasticity for new knowledge acquisition and stability for retaining previously learned representations, particularly when those representations are vulnerable to adversarial attacks that can degrade patient safety. The discussion spans infrastructure requirements for online model updates, governance mechanisms for certifying deployed models after each retraining cycle, and the ethical implications of adaptive systems that must balance fairness across subpopulations while defending against malicious inputs. Through cross-domain comparisons with autonomous driving and cybersecurity, we illustrate how medical AI faces unique challenges due to high stakes, heterogeneous data distributions, and regulatory oversight. We propose a conceptual framework that integrates continual learning with adversarial training, uncertainty quantification, and human-in-the-loop validation. The paper further addresses sustainability concerns, including computational cost, energy consumption, and the need for decentralized data governance in federated learning topologies. Finally, we outline policy recommendations for regulatory bodies to ensure that adaptive medical AI agents remain both safe and equitable as disease landscapes continue to evolve.
References
1. Esteva, A., Kuprel, B., Novoa, R. A., Ko, J., Swetter, S. M., Blau, H. M., & Thrun, S. (2017). Dermatologist-level classification of skin cancer with deep neural networks. Nature, 542(7639), 115–118.
2. Topol, E. J. (2019). High-performance medicine: The convergence of human and artificial intelligence. Nature Medicine, 25(1), 44–56.
3. Zech, J. R., Badgeley, M. A., Liu, M., Costa, A. B., Titano, J. J., & Oermann, E. K. (2018). Variable generalization performance of a deep learning model to detect pneumonia in chest radiographs: A cross-sectional study. PLOS Medicine, 15(11), e1002683.
4. Finlayson, S. G., Bowers, J. D., Ito, J., Zittrain, J. L., Beam, A. L., & Kohane, I. S. (2019). Adversarial attacks on medical machine learning. Science, 363(6433), 1287–1289.
5. Kirkpatrick, J., Pascanu, R., Rabinowitz, N., Veness, J., Desjardins, G., Rusu, A. A., ... & Hadsell, R. (2017). Overcoming catastrophic forgetting in neural networks. Proceedings of the National Academy of Sciences, 114(13), 3521–3526.
6. Lopez-Paz, D., & Ranzato, M. (2017). Gradient episodic memory for continual learning. Advances in Neural Information Processing Systems, 30, 6467–6476.
7. Madry, A., Makelov, A., Schmidt, L., Tsipras, D., & Vladu, A. (2018). Towards deep learning models resistant to adversarial attacks. International Conference on Learning Representations.
8. Ma, X., Niu, Y., Gu, L., Wang, Y., Zhao, Y., Bailey, J., & Lu, F. (2021). Understanding adversarial attacks on deep learning based medical image analysis systems. Pattern Recognition, 110, 107641.
9. Finlayson, S. G., Beam, A. L., & van der Schaar, M. (2021). Adversarial attacks on medical machine learning: A visual explanation. The New England Journal of Medicine, 384(24), 2333–2335.
10. Athalye, A., Carlini, N., & Wagner, D. (2018). Obfuscated gradients give a false sense of security: Circumventing defenses to adversarial examples. International Conference on Machine Learning, 80, 274–283.
11. Rusu, A. A., Rabinowitz, N. C., Desjardins, G., Soyer, H., Kirkpatrick, J., Kavukcuoglu, K., ... & Hadsell, R. (2016). Progressive neural networks. arXiv preprint arXiv:1606.04671.
12. Shin, H., Lee, J. K., Kim, J., & Kim, J. (2017). Continual learning with deep generative replay. Advances in Neural Information Processing Systems, 30, 2990–2999.
13. French, R. M. (1999). Catastrophic forgetting in connectionist networks. Trends in Cognitive Sciences, 3(4), 128–135.
14. Tramer, F., Carlini, N., Brendel, W., & Madry, A. (2020). On adaptive attacks to adversarial example defenses. Advances in Neural Information Processing Systems, 33, 1633–1645.
15. Sculley, D., Holt, G., Golovin, D., Davydov, E., Phillips, T., Ebner, D., ... & Dennison, D. (2015). Hidden technical debt in machine learning systems. Advances in Neural Information Processing Systems, 28, 2503–2511.
16. Finn, C., Abbeel, P., & Levine, S. (2017). Model-agnostic meta-learning for fast adaptation of deep networks. International Conference on Machine Learning, 70, 1126–1135.
17. Rebuffi, S. A., Kolesnikov, A., Sperl, G., & Lampert, C. H. (2017). iCaRL: Incremental classifier and representation learning. IEEE Conference on Computer Vision and Pattern Recognition, 2001–2010.
18. Hu, S. (2026). Research on Security Enhancement Methods for Adversarial Robust Large Language Model Intelligent Agents for Medical Decision-Making Tasks. arXiv preprint arXiv:2605.08257.
19. Blanchard, P., El Mhamdi, E. M., Guerraoui, R., & Stainer, J. (2017). Machine learning with adversaries: Byzantine tolerant gradient descent. Advances in Neural Information Processing Systems, 30, 119–129.
20. Carlini, N., & Wagner, D. (2017). Adversarial examples are not easily detected: Bypassing ten detection methods. ACM Workshop on Artificial Intelligence and Security, 3–14.
21. Gal, Y., & Ghahramani, Z. (2016). Dropout as a Bayesian approximation: Representing model uncertainty in deep learning. International Conference on Machine Learning, 48, 1050–1059.
22. Lakshminarayanan, B., Pritzel, A., & Blundell, C. (2017). Simple and scalable predictive uncertainty estimation using deep ensembles. Advances in Neural Information Processing Systems, 30, 6402–6413.
23. US Food and Drug Administration. (2019). Proposed regulatory framework for modifications to artificial intelligence/machine learning (AI/ML)-based software as a medical device (SaMD). FDA Discussion Paper.
24. D’Amour, A., Heller, K., Moldovan, D., Adlam, B., Alipanahi, B., Beutel, A., ... & Sculley, D. (2020). Underspecification presents challenges for credibility in modern machine learning. arXiv preprint arXiv:2011.03395.
25. Abadi, M., Chu, A., Goodfellow, I., McMahan, H. B., Mironov, I., Talwar, K., & Zhang, L. (2016). Deep learning with differential privacy. ACM SIGSAC Conference on Computer and Communications Security, 308–318.
26. Wang, J., Cao, Z., Parada, P., & Song, Y. (2020). Lifelong learning with a mobile edge computing framework for medical image analysis. IEEE Transactions on Mobile Computing, 21(2), 630–643.
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2026 International Journal of Artificial Intelligence Research

This work is licensed under a Creative Commons Attribution 4.0 International License.
This article is published under the Creative Commons Attribution 4.0 International License (CC BY 4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.



