Attention-Based Multisource Remote Sensing Fusion Using Hyperspectral and LiDAR Observations

Authors

  • Bruce Salonen Department of Computer Science and Engineering, University at Buffalo, Buffalo, NY, USA.
  • Ananya A. Chandra Department of Computer Science, Binghamton University, Binghamton, NY, USA.
  • Malcolm Neal Department of Computer Science, George Mason University, Fairfax, VA, USA.

Keywords:

attention mechanisms; hyperspectral imaging; LiDAR; multisource fusion; remote sensing infrastructure; socio-technical systems; deep learning governance

Abstract

The integration of hyperspectral imaging and Light Detection and Ranging observations represents a transformative capability in modern remote sensing, enabling simultaneous acquisition of high-dimensional spectral signatures and precise three-dimensional structural information. While conventional fusion methods have relied primarily on stacking feature vectors or applying linear transformations, these approaches often fail to capture the complex, non-linear interactions between spectral and spatial modalities. This paper presents a comprehensive analysis of attention-based architectures for multisource remote sensing fusion, examining their capacity to model long-range dependencies, prioritize salient features, and dynamically weight contributions from heterogeneous data streams. We systematically explore the architectural trade-offs associated with attention mechanisms, including self-attention, cross-attention, and multi-head configurations, within the context of hyperspectral and LiDAR data fusion. The study further investigates the implications of such systems for large-scale infrastructure deployment, emphasizing considerations of computational efficiency, energy consumption, data governance, and model robustness across varying geographic and operational conditions. Through a cross-domain analytical lens, we examine how attention-based fusion frameworks can be designed to support sustainable and equitable deployment in environmental monitoring, urban planning, and disaster response. The paper also addresses challenges related to spectral redundancy, spatial resolution disparities, and label scarcity, proposing governance-oriented strategies for training data curation and model validation. By situating technical innovations within broader socio-technical infrastructure systems, the analysis underscores the need for transparent, auditable, and fair fusion methodologies. We conclude that attention-based multisource fusion, when informed by rigorous structural and policy considerations, offers a robust pathway toward more accurate, resilient, and interpretable remote sensing systems.

References

1. Ghamisi, P., Rasti, B., Yokoya, N., Wang, Q., Hofle, B., Bruzzone, L., ... & Benediktsson, J. A. (2019). Multisource and multitemporal data fusion in remote sensing: A comprehensive review of the state of the art. IEEE Geoscience and Remote Sensing Magazine, 7(1), 6-39.

2. Bioucas-Dias, J. M., Plaza, A., Dobigeon, N., Parente, M., Du, Q., Gader, P., & Chanussot, J. (2012). Hyperspectral remote sensing data analysis and future challenges. IEEE Geoscience and Remote Sensing Magazine, 1(2), 6-36.

3. Hoffer, R. M., & Fleming, M. D. (2020). Airborne laser scanning for vegetation structure assessment: A review of methods and applications. Remote Sensing of Environment, 245, 111852.

4. Cao, X., Xu, L., Meng, D., Zhao, Q., & Xu, Z. (2021). Integration of hyperspectral and LiDAR data for land cover classification using deep learning: A critical review. ISPRS Journal of Photogrammetry and Remote Sensing, 176, 124-140.

5. Audebert, N., Le Saux, B., & Lefevre, S. (2018). Beyond RGB: Very high resolution urban remote sensing with multimodal deep networks. ISPRS Journal of Photogrammetry and Remote Sensing, 140, 20-32.

6. Liao, W., Pizurica, A., Philips, W., & Pi, Y. (2015). A review of fusion frameworks for hyperspectral and LiDAR data integration. IEEE Geoscience and Remote Sensing Magazine, 3(3), 8-24.

7. Ha, V. K., Ho, Q. T., & Jung, H. (2020). Convolutional neural networks for land cover classification using fused hyperspectral and LiDAR data. IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing, 13, 4726-4738.

8. Xu, D., & Ouyang, S. (2021). Spatial-spectral alignment for multimodal remote sensing fusion using graph attention networks. IEEE Transactions on Geoscience and Remote Sensing, 59(11), 9361-9374.

9. Vaswani, A., Shazeer, N., Parmar, N., Uszkoreit, J., Jones, L., Gomez, A. N., ... & Polosukhin, I. (2017). Attention is all you need. Advances in Neural Information Processing Systems, 30.

10. Hong, D., Gao, L., Yokoya, N., Yao, J., Chanussot, J., Du, Q., & Zhang, B. (2021). More diverse means better: Multimodal deep learning meets remote sensing imagery classification. IEEE Transactions on Geoscience and Remote Sensing, 59(5), 4340-4354.

11. Yang, J. X., Wang, J., Li, Z., Sui, C., Long, Z., & Zhou, J. (2025). HSLiNets: Evaluating Band Ordering Strategies in Hyperspectral and LiDAR Fusion. IEEE Geoscience and Remote Sensing Letters.

12. Qi, C. R., Su, H., Mo, K., & Guibas, L. J. (2017). PointNet: Deep learning on point sets for 3D classification and segmentation. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 652-660.

13. Chen, C., Li, S., & Qin, J. (2022). Cross-attention fusion network for multisource remote sensing data classification. IEEE Geoscience and Remote Sensing Letters, 19, 1-5.

14. Guo, M. H., Xu, T. X., Liu, J. J., Liu, Z. N., Jiang, P. T., Mu, T. J., ... & Hu, S. M. (2022). Attention mechanisms in computer vision: A survey. Computational Visual Media, 8(3), 331-368.

15. Dosovitskiy, A., Beyer, L., Kolesnikov, A., Weissenborn, D., Zhai, X., Unterthiner, T., ... & Houlsby, N. (2021). An image is worth 16x16 words: Transformers for image recognition at scale. International Conference on Learning Representations.

16. Voita, E., Talbot, D., Moiseev, F., Sennrich, R., & Titov, I. (2019). Analyzing multi-head self-attention: Specialized heads do the heavy lifting, the rest can be pruned. Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics, 5797-5808.

17. Zhang, X., Sun, Y., Song, H., & Li, J. (2023). Resolution-adaptive fusion of hyperspectral and LiDAR data using transformer networks. IEEE Transactions on Geoscience and Remote Sensing, 61, 1-15.

18. Kitaev, N., Kaiser, Ł., & Levskaya, A. (2020). Reformer: The efficient transformer. International Conference on Learning Representations.

19. Jain, S., & Wallace, B. C. (2019). Attention is not explanation. Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics, 3543-3556.

20. Tuia, D., Persello, C., & Bruzzone, L. (2016). Domain adaptation for the classification of remote sensing data: An overview of recent advances. IEEE Geoscience and Remote Sensing Magazine, 4(2), 41-57.

21. Strubell, E., Ganesh, A., & McCallum, A. (2019). Energy and policy considerations for deep learning in NLP. Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics, 3645-3650.

22. Rolf, E., Proctor, J., Rodolfi, T., Carleton, T., & Greicius, T. (2021). The unequal distribution of satellite-based environmental data products. Environmental Research Letters, 16(9), 094035.

23. Xu, Y., Zheng, Y., & Yu, J. (2022). Multi-modal medical image fusion using attention-guided deep networks: A review and taxonomy. Information Fusion, 83, 34-52.

24. Huang, Y., Liu, Y., Su, Y., & Zhang, L. (2022). Multi-modal fusion for autonomous driving: A survey of sensor integration and attention mechanisms. IEEE Transactions on Intelligent Transportation Systems, 23(10), 16974-16991.

25. Kampffmeyer, M., Salberg, A. B., & Jenssen, R. (2016). Semantic segmentation of small objects and modeling of uncertainty in urban remote sensing images using deep convolutional neural networks. IEEE Geoscience and Remote Sensing Letters, 13(7), 932-936.

26. Cong, Y., Khanna, S., Mok, S., & Zhu, X. X. (2023). Foundation models for Earth observation: A comprehensive survey and outlook. arXiv preprint arXiv:2311.04967.

27. Rasti, B., Ghamisi, P., & Gloaguen, R. (2020). Physics-aware deep learning for hyperspectral image analysis: A review of models, datasets, and future directions. IEEE Geoscience and Remote Sensing Magazine, 8(4), 86-112.

Downloads

Published

2026-05-27

How to Cite

Bruce Salonen, Ananya A. Chandra, & Malcolm Neal. (2026). Attention-Based Multisource Remote Sensing Fusion Using Hyperspectral and LiDAR Observations. International Journal of Artificial Intelligence Research, 1(2). Retrieved from https://isipress.org/index.php/IJAIR/article/view/202