Query Optimizer for the ETL Process in Data Warehouses

Authors(2) :-Bhadresh Pandya, S M Shah

ETL (Extraction-Transformation-Loading) process is responsible for extracting data from several sources, cleansing, transforming, integrating and loading into a data warehouse. Extraction process accesses large amount of data by executing several complex queries in source databases. These queries are repetitive and executed at regular interval to refresh the data warehouse. Extraction of data from source must be completed in a certain time window; hence it is necessary to optimize its execution time. In this paper, we delve into the optimization of queries by recommending indices which reduces cost of the queries and improves performance of the queries.

Authors and Affiliations

Bhadresh Pandya
Professor, Department of Computer Science, Kadi Sarva Vishwavidyalaya, Gandhinagar , Gujarat, India
S M Shah
Director, S. V Institute of Computer Studies, Kadi, Gujarat, India

Extraction Transformation Loading, Data Warehouses, Query Optimizer, Business Intelligence, Execution Plan of Query, Database Tuning, Query Tuning, Performance Tuning

  1. AlkisSimitsis, PanosVassiliadis, TimosSellis, Optimizing ETL Processes in Data Warehouses, In Proc. ICDE, pages 564–575, 2005.
  2. E. Malinowski, E. Zima´nyi, Hierarchies in a multidimensional model: From conceptual modeling to logical representation, Data & Knowledge Engineering, 2005 Elsevier
  3. Josep Aguilar-Saborit, Victor Munte´s-Mulero, Calisto ZuzarteJosep-L. Larriba-Pey, Star join revisited: Performance internals for cluster architectures, Data & Knowledge Engineering, 2007 Elsevier
  4. Michel Schneider, Integrated vision of federated data warehouses, Data Integration and the Semantic Web, 2006
  5. Songting Chen, Cheetah: A High Performance, Custom Data Warehouse on Top of MapReduce, 36th International Conference on Very Large Data Bases, September 13-17, 2010, Singapore.
  6. Umeshwar Dayal, Malu Castellanos, Alkis Simitsis, Kevin Wilkinson, Data Integration Flows for Business Intelligence, EDBT, 2009
  7. XuanThi Dung •WennyRahayu • David Taniar, A high performance integrated web data warehousing, Cluster Computing, 2007 - Springer
  8. Benoit Dageville, Dinesh Das, Karl Dias, Khaled Yagoub, Mohamed Zait, Mohamed Ziauddin, Automatic SQL Tuning in Oralce 10g,  VLDB Conference, Canada 2004
  9. M. GOLFARELLI, S. RIZZI, E. SALTARELLI, Index selection techniques in data warehouse systems, In Proc. DMDW, 2002
  10. Kurt Stockinger, Kesheng Wu, Bitmap Indices for Data Warehouses, In Data Warehouses and OLAP. 2007. IRM Press. London
  11. Stéphane Azefack, Kamel Aouiche, Jérôme Darmont, Dynamic index selection in data warehouses, In 4th International Conference on Innovations in Information Technology, 2007, Dubai
  13. Kai-Uwe Sattler, Eike Schallehn, Ingolf Geist, Autonomous Query-driven Index Tuning, in International Database Engineering & Applications Symposium, Portugal, 2004

Publication Details

Published in : Volume 1 | Issue 3 | May-June 2015
Date of Publication : 2015-06-25
License:  This work is licensed under a Creative Commons Attribution 4.0 International License.
Page(s) : 329-333
Manuscript Number : IJSRSET151375
Publisher : Technoscience Academy

Print ISSN : 2395-1990, Online ISSN : 2394-4099

Cite This Article :

Bhadresh Pandya, S M Shah, " Query Optimizer for the ETL Process in Data Warehouses, International Journal of Scientific Research in Science, Engineering and Technology(IJSRSET), Print ISSN : 2395-1990, Online ISSN : 2394-4099, Volume 1, Issue 3, pp.329-333, May-June-2015.
Journal URL : http://ijsrset.com/IJSRSET151375

Article Preview

Follow Us

Contact Us