Accelerating Real-Time Intent Discovery in Digital Advertising via High-Throughput Systems Integrating Financial-Grade Time Series Forecasting and Large Language Models
Abstract
The efficacy of modern digital advertising infrastructures is increasingly dependent on the ability to perform sub-millisecond intent discovery within highly volatile auction environments. While traditional programmatic architectures rely on historical behavioral heuristics, they often fail to capture the transient, context-dependent shifts in consumer intent that drive conversion. This paper proposes a novel system architecture that integrates financial-grade time series forecasting with large language models to facilitate real-time intent discovery. By treating user interaction streams as high-frequency financial assets, we apply temporal learning pipelines designed for market microstructure to the advertising bidding process. This integration allows for the synthesis of quantitative engagement dynamics with the deep semantic reasoning of transformer-based architectures. Our research focuses on the system-level trade-offs between inferential depth and execution latency, emphasizing the necessity of hardware-aware distributed orchestration. We explore the structural requirements for a high-throughput pipeline that can ingest billions of events daily while maintaining causal consistency across geographically disparate edge nodes. Furthermore, the paper addresses critical socio-technical dimensions, including the governance of autonomous bidding agents, the sustainability of massive-scale transformer inference, and the ethical imperatives of algorithmic fairness in digital commerce. By aligning the precision of financial engineering with the contextual awareness of natural language processing, this framework provides a robust blueprint for the next generation of digital advertising infrastructures, ensuring that intent discovery is both statistically accurate and semantically grounded.
References
1.Abadi, M., Chu, A., Goodfellow, I., McMahan, H. B., Mironov, I., Talwar, K., & Zhang, L. (2016). Deep learning with differential privacy. Proceedings of the 2016 ACM SIGSAC Conference on Computer and Communications Security, 308–318.
2.Acemoglu, D., & Restrepo, P. (2019). Automation and new tasks: How technology displaces and creates labor. Journal of Economic Perspectives, 33(2), 3–30.
3.Baltussen, G., van Vliet, P., & van Vliet, S. (2021). The cross-section of stock returns before 1926 (and beyond). Journal of Financial Economics, 141(3), 1146–1163.
4.Bommasani, R., et al. (2021). On the opportunities and risks of foundation models. arXiv preprint arXiv:2108.07258.
5.Brown, T., Mann, B., Ryder, N., Subbiah, M., Kaplan, J. D., Dhariwal, P., ... & Amodei, D. (2020). Language models are few-shot learners. Advances in Neural Information Processing Systems, 33, 1877–1901.
6.Cartea, A., Jaimungal, S., & Penalva, J. (2015). Algorithmic and High-Frequency Trading. Cambridge University Press.
7.Chen, L., & Zheng, Z. (2023). LLM-augmented financial analysis: Challenges and opportunities. Journal of Financial Data Science, 5(4), 12–28.
8.Dean, J., & Ghemawat, S. (2008). MapReduce: Simplified data processing on large clusters. Communications of the ACM, 51(1), 107–113.
9.Dwork, C. (2008). Differential privacy: A survey of results. International Conference on Theory and Applications of Models of Computation, 1–19.
10.Engle, R. F. (1982). Autoregressive conditional heteroscedasticity with estimates of the variance of United Kingdom inflation. Econometrica, 987–1007.
11.Ghoshal, B., & Tucker, A. (2022). Scalable inference for deep learning in finance. Quantitative Finance, 22(10), 1845–1860.
12.Goodfellow, I., Bengio, Y., & Courville, A. (2016). Deep Learning. MIT Press.
13.Goyal, N., et al. (2023). High-throughput inference for large language models: A systems perspective. ACM SIGOPS Operating Systems Review, 57(1), 45–56.
14.Hendershott, T., Jones, C. M., & Menkveld, A. J. (2011). Does algorithmic trading improve liquidity? The Journal of Finance, 66(1), 1–33.
15.Kaplan, J., McCandlish, S., Henighan, T., Brown, T. B., Chess, B., Child, R., ... & Amodei, D. (2020). Scaling laws for neural language models. arXiv preprint arXiv:2001.08361.
16.Kirilenko, A. S., Kyle, A. S., Samadi, M., & Tuzun, T. (2017). The Flash Crash: High-frequency trading in an electronic market. The Journal of Finance, 72(3), 967–998.
17.Lo, A. W. (2017). Adaptive Markets: Financial Evolution at the Speed of Thought. Princeton University Press.
18.Liu, T. (2026). Leakage-Safe Benchmark Design for Market-Stress Early Warning: An Economically Credible Evaluation.
19.Lopez de Prado, M. (2018). Advances in Financial Machine Learning. Wiley.
20.Narayanan, D., Phanishayee, A., Shi, K., Chen, X., & Zaharia, M. (2019). PipeDream: Generalized pipeline parallelism for DNN training. Proceedings of the 27th ACM Symposium on Operating Systems Principles.
21.O’Neil, C. (2016). Weapons of Math Destruction: How Big Data Increases Inequality and Threatens Democracy. Crown.
22.Pasquale, F. (2015). The Black Box Society: The Secret Algorithms That Control Money and Information. Harvard University Press.
23.Rajbhandari, S., Rasley, J., Ruwase, O., & He, Y. (2020). ZeRO: Memory optimizations toward training trillion parameter models. SC20: International Conference for High Performance Computing, Networking, Storage and Analysis.
24.Shalf, J. (2020). The future of computing beyond Moore’s Law. Philosophical Transactions of the Royal Society A, 378(2166).
25.Shiller, R. J. (2019). Narrative Economics: How Stories Go Viral and Drive Major Economic Events. Princeton University Press.
26.Stoica, I., et al. (2017). Ray: A distributed framework for emerging AI applications. 13th USENIX Symposium on Operating Systems Design and Implementation.
27.Vaswani, A., Shazeer, N., Parmar, N., Uszkoreit, J., Jones, L., Gomez, A. N., ... & Polosukhin, I. (2017). Attention is all you need. Advances in Neural Information Processing Systems, 30.
28.Varian, H. R. (2007). Position auctions. International Journal of Industrial Organization, 25(6), 1163–1178.
29.Wu, S., et al. (2023). BloombergGPT: A large language model for finance. arXiv preprint arXiv:2303.17564.
30.Zaharia, M., et al. (2012). Resilient distributed datasets: A fault-tolerant abstraction for in-memory cluster computing. 9th USENIX Symposium on Networked Systems Design and Implementation.
31.Zhang, L., et al. (2021). Deep reinforcement learning for automated stock trading: An ensemble strategy. SSRN Electronic Journal.
32.Zhou, Y., et al. (2022). Mixture-of-experts with exponential selection. arXiv preprint arXiv:2202.08906.
33.Mo, F., Haddadi, H., Katiyar, K., Ansari, R., & Chuah, C. N. (2021). PPFL: Privacy-preserving federated learning with trusted execution environments. Proceedings of the 19th Annual International Conference on Mobile Systems, Applications, and Services, 94–108.
34.Wang, J., et al. (2021). A field guide to federated optimization. arXiv preprint arXiv:2107.06917.
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2026 International Journal of Artificial Intelligence Research

This work is licensed under a Creative Commons Attribution 4.0 International License.
This article is published under the Creative Commons Attribution 4.0 International License (CC BY 4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.



