• Title/Summary/Keyword: star schema

Search Result 19, Processing Time 0.042 seconds

Design of Snowflake schema concept using Drill-across Operator (Drill-across연산자를 이용한 Snowflake schema 개념 설계)

  • 김경주;오근탁;이윤배
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2004.05b
    • /
    • pp.354-357
    • /
    • 2004
  • Data warehouse is subject-oriented, integrated, non-volatiled data, and it used for OLAP(On-Line Analytical Processing) the extraction of information from making decision processing. In the present, lots of study have been devoted to multidimensional modeling between OLAP operator and star schema. In this paper, the design of using the snowflake schema for object-oriented conceptual relation is more extended than using drill-across operator. The object-oriented relation schema which was not applicable has been improved through the design.

  • PDF

Performance Comparison of Column-Oriented and Row-Oriented Database Systems for Star Schema Join Processing (스타 스키마 조인 처리에 대한 세로-지향 데이터베이스 시스템과 가로-지향 데이터베이스 시스템의 성능 비교)

  • Oh, Byung-Jung;Ahn, Soo-Min;Kim, Kyung-Chang
    • Journal of the Korea Society of Computer and Information
    • /
    • v.16 no.8
    • /
    • pp.29-38
    • /
    • 2011
  • Unlike in traditional row-oriented database systems, a column-oriented database system stores data in column-oriented and not row-oriented order. Recently, research results revealed the effectiveness of column-oriented databases for applications such as data warehouse and decision support systems that access large volumes of data in a read only manner. In this paper, we investigate the join strategies for column-oriented databases and prove the effectiveness of column-oriented databases in data warehouse systems. For unbiased comparison, the two database systems are analyzed using the star schema benchmark and the performance analysis of a star schema join query is carried out. We experimented with well-known join algorithms and considered early materialization and late materialization join strategies for column-oriented databases. The performance results confirm that star schema join queries perform better in terms of disk I/O cost in column-oriented databases than in row-oriented databases. In addition, the late materialization strategy showed more performance gain than the early materialization strategy in column-oriented databases.

Self Maintainable Data Warehouse Views for Multiple Data Sources (다중 데이터 원천을 가지는 데이터웨어하우스 뷰의 자율갱신)

  • Lee, Woo-Key
    • Asia pacific journal of information systems
    • /
    • v.14 no.3
    • /
    • pp.169-187
    • /
    • 2004
  • Self-maintainability of data warehouse(DW) views is an ability to maintain the DW views without requiring an access to (i) any underlying databases or (ii) any information beyond the DW views and the delta of the databases. With our proposed method, DW views can be updated by using only the old views and the differential files such as different files, referential integrity differential files, linked differential files, and backward-linked differential files that keep the truly relevant tuples in the delta. This method avoids accessing the underlying databases in that the method achieves self-maintainability even in preparing auxiliary information. We showed that out method can be applicable to the DW views that contain joins over relations in a star schema, a snowflake schema, or a galaxy schema.

XML2Star Algorithm Creating Star Schema from Source Data in XML (XML 소스 데이터로부터 스타 스키마를 생성하기 위한 XML2Star 알고리즘)

  • 최은하;김진호
    • Proceedings of the Korean Information Science Society Conference
    • /
    • 2002.10c
    • /
    • pp.190-192
    • /
    • 2002
  • 데이터 웨어하우스는 기업의 의사 결정을 지원하기 위해 기업의 운영 데이터베이스로부터 추출한 데이터의 집합으로써 OLAP 분석에 이용된다. OLAP은 데이터에 대한 다양한 분석을 위해 이들 데이터를 다차원 데이터 모델로 표현하고 이를 활용하여 복잡한 질의 처리 및 다차원 데이터 분석에 이용한다. 이러한 OLAP의 다차원 데이터를 관계형 데이터베이스에서 표현하기 위해 스타 스키마가 널리 사용된다. 지금까지의 데이터 웨어하우스는 일반적으로 ER 도형으로 설계된 소스 데이터로부터 스타 스키마를 설계하고 구축하였다. 하지만, 최근 인터넷의 급성장으로 인해 차세대 웹 문서의 표준인 XML을 통한 인터넷 상의 문서 전송 및 정보 교환이 활발해 지고 있으며, XML 문서에 대한 다차원적인 분석이 요구됨에 따라 데이터 웨어하우스는 XML 문서로부터의 스타 스키마 설계 및 저장이 필요하게 되었다. 따라서 본 논문에서는 XML DTD로부터 애트리뷰트 트리를 생성하여 스타 스키마를 설계하고 이 DTD를 따르는 XML 문서에서 스타 스키마의 인스턴스를 추출하여 관계형 데이터베이스에 저장하기 위한 XML2Star 알고리즘을 개발하였다. 이것을 통해 기업 및 사용자는 OLAP에서 XML 기반의 스타 스키마를 이용한 다차원적인 분석이 가능하게 된다.

  • PDF

MLPPI Wizard: An Automated Multi-level Partitioning Tool on Analytical Workloads

  • Suh, Young-Kyoon;Crolotte, Alain;Kostamaa, Pekka
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.12 no.4
    • /
    • pp.1693-1713
    • /
    • 2018
  • An important technique used by database administrators (DBAs) is to improve performance in decision-support workloads associated with a Star schema is multi-level partitioning. Queries will then benefit from performance improvements via partition elimination, due to constraints on queries expressed on the dimension tables. As the task of multi-level partitioning can be overwhelming for a DBA we are proposing a wizard that facilitates the task by calculating a partitioning scheme for a particular workload. The system resides completely on a client and interacts with the costing estimation subsystem of the query optimizer via an API over the network, thereby eliminating any need to make changes to the optimizer. In addition, since only cost estimates are needed the wizard overhead is very low. By using a greedy algorithm for search space enumeration over the query predicates in the workload the wizard is efficient with worst-case polynomial complexity. The technology proposed can be applied to any clustering or partitioning scheme in any database management system that provides an interface to the query optimizer. Applied to the Teradata database the technology provides recommendations that outperform a human expert's solution as measured by the total execution time of the workload. We also demonstrate the scalability of our approach when the fact table (and workload) size increases.

Proposal And Application of An OLAP View For the Definition of the Users Requirements (사용자 요구사항 정의를 위한 OLAP View의 제안 및 활용)

  • Choi, Sung-Kyo;Park, Jong-Mo
    • The KIPS Transactions:PartD
    • /
    • v.11D no.4
    • /
    • pp.967-974
    • /
    • 2004
  • An OLAP system is the decision support tool with which a user can analyze the information interactively in the various aspects. However, the traditional existing construction of an OLAP system has the inefficiency problem of increasing the time and cost caused by the frequent changes of the users requirements. In this paper, we analyse existing construction procedure of OLAP systems and propose a prototyping OLAP View based on the star schema which can help and guide the designer In supporting efficiently of users requirements. Through an implementation of S shopping mall using the OLAP View, we show that our proposal is able to support OLAP construction strategy and provide accurate understanding of data resources. With an OLAP View, we can reduce the required time and cost of implementing OLAP system.

Generic Multidimensional Model of Complex Data: Design and Implementation

  • Khrouf, Kais;Turki, Hela
    • International Journal of Computer Science & Network Security
    • /
    • v.21 no.12spc
    • /
    • pp.643-647
    • /
    • 2021
  • The use of data analysis on large volumes of data constitutes a challenge for deducting knowledge and new information. Data can be heterogeneous and complex: Semi-structured data (Example: XML), Data from social networks (Example: Tweets) and Factual data (Example: Spreading of Covid-19). In this paper, we propose a generic multidimensional model in order to analyze complex data, according to several dimensions.

A Design and Practical Use of Spatial Data Warehouse for Spatiall Decision Making (공간적 의사결정을 위한 공간 데이터 웨어하우스 설계 및 활용)

  • Park Ji-Man;Hwang Chul-sue
    • Spatial Information Research
    • /
    • v.13 no.3 s.34
    • /
    • pp.239-252
    • /
    • 2005
  • The major reason that spatial data warehousing has attracted a great deal of attention in business GIS in recent years is due to the wide availability of huge amount of spatial data and the imminent need for fuming such data into useful geographic information. Therefore, this research has been focused on designing and implementing the pilot tested system for spatial decision making. The purpose of the system is to predict targeted marketing area by discriminating the customers by using both transaction quantity and the number of customer using credit card in department store. Moreover, the pilot tested system of this research provides OLAP tools for interactive analysis of multidimensional data of geographically various granularities, which facilitate effective spatial data mining. focused on the analysis methodology, the case study is aiming to use GIS and clustering for knowledge discovery. Especially, the importance of this study is in the use of snowflake schema model capabilities for GIS framework.

  • PDF

Translation of Star Schema into Entity-Relationship Diagrams for Data Warehouse Conceptual Design (데이터 웨어하우스의 개념적 설계를 위한 스타 스키마에서 ER 도형으로의 변환 기법)

  • 최은하;김진호;옥수호
    • Proceedings of the Korean Information Science Society Conference
    • /
    • 2002.04b
    • /
    • pp.142-144
    • /
    • 2002
  • 데이터 웨어하우스는 기업의 의사 결정을 지원하기 위해 기업의 운영 데이터베이스로부터 추출한 데이터의 집합으로써 OLAP 분석에 이용된다. OLAP은 이들 데이터를 다양하게 분석할 수 있도록 다차원 데이터로 표현하고 이를 활용하여 복잡한 분석 질의 처리 및 다차원 데이터 분석에 활용한다. 이러한 OLAP의 다차원 데이터를 관계형 데이터베이스에서 표현하기 위해 스타 스키마가 널리 사용된다. 또한 다차원 데이터와 데이터 웨어하우스는 방대한 분량를 갖는 전체 기업의 데이터를 표현하고 있어 이를 설계하는 것이 매우 복잡하고 많은 노력이 소요된다. 따라서 이를 설계하기 위한 체계적인 설계 방법론이 필요하다. 데이터 웨어하우스의 원천 데이터가 되는 운영 데이터베이스는 현재 ER 도형을 이용하여 개념적인 방법으로 널리 설계되고 있다. 따라서 이 논문에서는 ER 도형으로 설계된 운영 데이터베이스로부터 데이터 웨어하우스를 설계하는 개념적인 방법론을 제시한다. 이에 따라 OLAP 분석을 위해 사용할 수 있는 다양한 유형의 스타 스키마에 대해 ER 도형으로 표현/변환하는 방법을 제시한다. 이를 통해 자신이 원하는 다차원 데이터를 얻기 위해 유지해야 할 데이터 웨어하우스를 ER 도형을 이용하여 개념적으로 편리하게 설계하는 방법/지침을 제공하며, 나아가 해당 유형의 스타 스키마가 갖는 의미를 개념적으로 쉽게 전달할 수 있도록 하였다.

  • PDF

Development of the Design Methodology for Large-scale Data Warehouse based on MongoDB

  • Lee, Junho;Joo, Kyungsoo
    • Journal of the Korea Society of Computer and Information
    • /
    • v.23 no.3
    • /
    • pp.49-54
    • /
    • 2018
  • A data warehouse is a system that collectively manages and integrates data of a company. And provides the basis for decision making for management strategy. Nowadays, analysis data volumes are reaching critical size challenging traditional data ware housing approaches. Current implemented solutions are mainly based on relational database that are no longer adapted to these data volume. NoSQL solutions allow us to consider new approaches for data warehousing, especially from the multidimensional data management point of view. In this paper, we extend the data warehouse design methodology based on relational database using star schema, and have developed a consistent design methodology from information requirement analysis to data warehouse construction for large scale data warehouse construction based on MongoDB, one of NoSQL.