Archive for August, 2008

Complex Transformations in the data warehouse

At this point, we have a populated set of temporary tables that match the destination tables in the data warehouse. These tables have been populated with the source data, but still without any of the complex transformations that need to take place. The data may not be consistent with itself or with other data in […]

The need for Detailed Information

This is the area of the data warehouse that stores all the detailed information in the starflake schema. In many cases, all the detailed information is not held online the whole time, but aggregated to the next level of detail, and the detailed information is then offloaded into tape archive. On a rolling basis, detailed […]

Making sure data is consistent

This is when we ensure that any data being loaded does not contradict the information already within the data warehouse. In some cases, the changes could be valid - for example, updating information about the product hierarchy - but the changes need to be controlled carefully, so as not to render meaningless any of the […]

Creating Indexes And Views

At this point, the warehouse manager has to create indexes against the information in the fact or dimension tables. One would expect the index creation time to be significant, even if we need only to create an index against a fact table partition. Because of this, most relational technologies have facilities to create indexes in […]