Leveraging NLP and Retrieval-Augmented Generation (RAG) Models for Automated Business Intelligence Query Resolution

Authors

  • Linda Aluso Department of Computer Information Systems, University of Louisville, KY, USA Author
  • Joy Onma Enyejo Department of Business Management, Nasarawa State University Keffi, Nasarawa State, Nigeria Author

DOI:

https://doi.org/10.32628/IJSRSET242439

Keywords:

Retrieval-Augmented Generation, Natural Language Processing, Automated Business Intelligence, Query Resolution, Conversational Analytics Systems, Adaptive Contextual Retrieval-Augmented Intelligence Engine

Abstract

The rapid expansion of enterprise data ecosystems has intensified the demand for intelligent Business Intelligence (BI) systems capable of translating natural language queries into accurate analytical insights without requiring technical expertise. Traditional BI query interfaces rely heavily on structured query languages and predefined dashboards, creating bottlenecks in decision-making workflows and limiting accessibility for non-technical stakeholders. This paper proposes a novel framework titled Adaptive Contextual Retrieval-Augmented Intelligence Engine (ACRIE), which integrates advanced Natural Language Processing (NLP) with Retrieval-Augmented Generation (RAG) to automate business intelligence query resolution with improved semantic understanding, contextual grounding, and analytical accuracy. The proposed model combines transformer-based semantic parsing using Domain-Tuned BERT embeddings, hybrid vector–symbolic retrieval, and a dynamically optimized RAG pipeline that retrieves enterprise knowledge artifacts prior to response synthesis. Unlike conventional RAG implementations that rely solely on similarity search, ACRIE introduces a Context-Aware Query Reformulation Algorithm (CAQRA) and a Multi-Stage Evidence Ranking Mechanism (MSERM) to minimize hallucinations and enhance numerical reasoning during analytical query generation. Performance is evaluated against baseline systems including Text-to-SQL transformers, standard RAG architectures, and rule-based BI assistants across benchmark enterprise datasets. Experimental comparisons demonstrate measurable improvements in query resolution accuracy, latency reduction, and analytical consistency. The proposed framework achieves higher semantic alignment scores and improved retrieval precision while maintaining scalable inference performance suitable for real-time analytics environments. Graph-based evaluations and comparative performance visualizations illustrate gains in response correctness, contextual relevance, and decision-support reliability. The findings establish that integrating adaptive retrieval intelligence with domain-aware NLP significantly advances automated BI systems, enabling conversational analytics that bridge the gap between enterprise data warehouses and executive decision-making processes. The study contributes a scalable architecture and algorithmic innovation capable of redefining next-generation intelligent analytics platforms.

Downloads

Download data is not yet available.

References

Aluso, L. (2021). Forecasting marketing ROI through cross-platform data integration between HubSpot CRM and Power BI. International Journal of Scientific Research in Science, Engineering and Technology, 8(6), 356–378. https://doi.org/10.32628/IJSRSET214420

Aluso, L., & Enyejo, J. O. (2023). Integrating ETL workflows with LLM-augmented data mapping for automated business intelligence systems. International Journal of Scientific Research and Modern Technology, 2(11), 76–89. https://doi.org/10.38124/ijsrmt.v2i11.1078

Aluso, L., Enyejo, J. O., Amebleh, J., & Balogun, S. A. (2024). A comparative analysis of SQL-based and cloud-native data warehousing architectures for real-time financial reporting. International Journal of Scientific Research and Modern Technology, 3(12), 78–90. https://doi.org/10.38124/ijsrmt.v3i12.1179

Andreas, J., Bufe, J., Burkett, D., Chen, C., Clausman, J., Crawford, J., . & Zotov, A. (2020). Task-oriented dialogue as dataflow synthesis. Transactions of the Association for Computational Linguistics, 8, 556-571.

Animasaun, J. B., Ijiga, O. M., Ayoola, V. B., & Enyejo, L. A. (2024). Impact of solvent polarity on volatile and non-volatile cannabinoid recovery: A multivariate GC-MS/LC-MS extraction optimization study. International Journal of Scientific Research and Modern Technology.

Bojanowski, P., Grave, E., Joulin, A., & Mikolov, T. (2017). Enriching word vectors with subword information. Transactions of the association for computational linguistics, 5, 135-146.

Buch, S., Fei-Fei, L., & Goodman, N. D. (2021). Neural event semantics for grounded language understanding. Transactions of the Association for Computational Linguistics, 9, 875-890.

Elias, M. (2012). Enhancing User Interaction with Business Intelligence Dashboards (Doctoral dissertation, Ecole Centrale Paris).

Enyejo, J. O., Adeyemi, A. F., Olola, T. M., Igba, E., & Obani, O. Q. (2024). Resilience in supply chains: How technology is helping USA companies navigate disruptions. Magna Scientia Advanced Research and Reviews, 11(02), 261–277. https://doi.org/10.30574/msarr.2024.11.2.0129

Guan, Z., Hou, F., Li, B., Phang, C. W., & Chong, A. Y. L. (2022). What influences the purchase of virtual gifts in live streaming in China? A cultural context‐sensitive model. Information Systems Journal, 32(3), 653-689.

Idika, C. N., James, U. U., Ijiga, O. M., & Enyejo, L. A. (2023). Digital twin-enabled vulnerability assessment with zero trust policy enforcement in smart manufacturing cyber-physical systems. International Journal of Scientific Research in Computer Science, Engineering and Information Technology, 9(6). https://doi.org/10.32628/CSEIT23906189

Kpogli, S. A., Onwuzurike, M. A., & Enyejo, J. O. (2024). Integrating artificial intelligence and learning sciences to reduce cognitive load and achievement gaps in data-driven K-12 instructional systems. International Journal of Scientific Research in Computer Science, Engineering and Information Technology, 10(6), 2569–2589. https://doi.org/10.32628/CSEIT25113575

Lewis, P., Perez, E., Piktus, A., Petroni, F., Karpukhin, V., Goyal, N., Küttler, H., Lewis, M., Yih, W.-t., Rocktäschel, T., Riedel, S., & Kiela, D. (2020). Retrieval-augmented generation for knowledge-intensive NLP tasks. Advances in Neural Information Processing Systems, 33, 9459–9474.

Muntala, P. S. R. P. (2022). Natural Language Querying in Oracle Fusion Analytics: A Step toward Conversational BI. International Journal of Emerging Trends in Computer Science and Information Technology, 3(3), 81-89.

Onwuzurike, M. A., & Kpogli, S. A. (2022). Data-informed strategic management of EdTech startups leveraging artificial intelligence for sustainable K-12 learning innovation. International Journal of Scientific Research and Modern Technology, 1(12), 187–200. https://doi.org/10.38124/ijsrmt.v1i12.1117

Picado, J., Ataei, P., Termehchy, A., & Fern, A. (2016). Schema independent and scalable relational learning by castor. Proceedings of the VLDB Endowment, 9(13), 1589-1592.

Schneider, C., Weinmann, M., & Vom Brocke, J. (2018). Digital nudging: guiding online user choices through interface design. Communications of the ACM, 61(7), 67-73.

Xue, H., Jiang, W., Miao, C., Ma, F., Wang, S., Yuan, Y., . & Su, L. (2020). DeepMV: Multi-view deep learning for device-free human activity recognition. Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies, 4(1), 1-26.

Yang, A., Wang, Q., Liu, J., Liu, K., Lyu, Y., Wu, H., . & Li, S. (2019). Enhancing pre-trained language representations with rich knowledge for machine reading comprehension. In Proceedings of the 57th annual meeting of the association for computational linguistics (pp. 2346-2357).

Yu, T., Zhang, R., Yang, K., Yasunaga, M., Wang, D., Li, Z., . & Radev, D. (2018). Spider: A large-scale human-labeled dataset for complex and cross-domain semantic parsing and text-to-sql task. In Proceedings of the 2018 conference on empirical methods in natural language processing (pp. 3911-3921).

Downloads

Published

18-08-2024

Issue

Section

Research Articles

How to Cite

[1]
Linda Aluso and Joy Onma Enyejo, “Leveraging NLP and Retrieval-Augmented Generation (RAG) Models for Automated Business Intelligence Query Resolution”, Int J Sci Res Sci Eng Technol, vol. 11, no. 4, pp. 534–557, Aug. 2024, doi: 10.32628/IJSRSET242439.