Data Modeling

This stage is done completely in the IDE tool.

The IDE tool has the following ITBA DWH definition:

  • Target Entity is a Fact or Dimension.

    Entity name along with column names is defined according to the Star-Schema Architecture.

  • Integration Entity is the source table representation.

    Entity name along with column names and data types is defined according to source table design.

  • Source Entity is the actual and accurate representation of an entity to be extracted from the source.

    It describes how and what data is extracted from the source. It is generated automatically according to the integration entity design. However depending on the extractor being used, it may require some changes as the automatic generation is targeted for the DB extractor only. Note that there are three types of extractor that you can chose in the source entity's properties: Direct (DB extractor), Fixed, and Rule (API extractor or Web Service).

  • Transformation Entity is the logical representation of all the transformations that are performed by the DWH during the data processing. It’s the accurate representation of the Target Entity.

  • Column Mapping - Source to target column mapping

The modeling approach:

The Solution Modeling Highlights:

  • Different functions are collaborating on same models simultaneously until reaching a final version.
  • Actually serves as a high-level DB Studio + ETL tool. No need to touch the DB directly.
  • 95% of the solution is defined by this high-level modeling.