Current position: Home >> Scientific Research >> Paper Publications

AN APPROACH FOR DESIGNING, MODELING AND REALIZING ETL PROCESSES BASED ON UNIFIED VIEWS MODEL

Release Time:2019-03-09  Hits:

Indexed by: Journal Article

Date of Publication: 2011-06-01

Journal: INTERNATIONAL JOURNAL OF SOFTWARE ENGINEERING AND KNOWLEDGE ENGINEERING

Included Journals: EI、SCIE

Volume: 21

Issue: 4

Page Number: 543-570

ISSN: 0218-1940

Key Words: ETL processes; data warehouses; unified views model; UML profile; conceptual modeling

Abstract: Extraction-Transformation-Loading (ETL) tools are pieces of software responsible for the extraction of data from several sources, their cleaning, customization and insertion into Data Warehouses (DWs). Complexity, usability and maintainability are the primary problems concerning ETL processes. To deal with these problems, in this paper we provide a dynamic approach for designing, modeling and realizing ETL processes. We propose a new architecture based on Unified Views Model (UVM) for ETL processes, in which Unified view layer is added between source data level and DWs level. The unified views model serves as the means to conform the structure and semantics of the source data to the ones of the data warehouses, and help designers understand and analyze the meaning, relationships and lineage of information. In order to guarantee the transparency access and the usability, two mapping methods are adopted between Unified view level and source data level as well as between DWs level and Unified view level. Based on this architecture, the method of constructing UVM and ETL operations among three levels is given. Then, we describe how to build the conceptual modeling for ETL processes based on UVM by using an extension of the Unified Modeling Language (UML). Finally, we present an ETL tool based on UVM (UVETL) with the goal of facilitating the design, modeling and realization of ETL processes, and give a case study to exemplify the benefits of our proposal.

Prev One:多层集成动态生产调度系统

Next One:A hybrid particle swarm optimization algorithm for job-shop scheduling problem