If this is your first visit, be sure to check out the FAQ by clicking the link above. You may have to register before you can post: click the register link above to proceed. To start viewing messages, select the forum that you want to visit from the selection below.

 
Go Back  dBforums > General > Database Concepts & Design > Can a Data Warehouse Have a Normalized Design?

Reply
 
LinkBack Thread Tools Search this Thread Display Modes
  #1 (permalink)  
Old 09-14-06, 12:24
DBADave DBADave is offline
Registered User
 
Join Date: Jan 2003
Posts: 95
Can a Data Warehouse Have a Normalized Design?

I only have a basic understanding of data warehouse design. I've always thought a data warehouse is primarily a denormalized model, but recently we have been speaking with a data warehouse vendor who says their product is normalized and is not a Star or Snowflake schema. This sounds more like a data store. Can a data warehouse have a primarily normalized model?

Thanks, Dave
Reply With Quote
  #2 (permalink)  
Old 09-14-06, 17:27
loquin loquin is offline
Super Moderator
 
Join Date: Jun 2004
Location: Arizona, USA
Posts: 1,797
Sure, a data warehouse can have a normalized design, but, in a data warehouse, that's probably NOT a good thing.
__________________
Lou
使大吃一惊
"Lisa, in this house, we obey the laws of thermodynamics!" - Homer Simpson
"I have my standards. They may be low, but I have them!" - Bette Middler
"It's a book about a Spanish guy named Manual. You should read it." - Dilbert

Reply With Quote
  #3 (permalink)  
Old 09-14-06, 23:17
blindman blindman is offline
World Class Flame Warrior
 
Join Date: Jun 2003
Location: Ohio
Posts: 11,726
Actually, I would RECOMMEND a normalized schema for a data warehouse, with additional tables for commonly aggregated data. I would use a Star or Snowflake schema for datamarts that are spun off of the data warehouse.
It sounds like your vendor is an adherant of Inmon's philosophy of data warehousing, rather than Kimball's "one-size star schema fits all" approach.
__________________
If it's not practically useful, then it's practically useless.

blindman
www.chess.com: "sqlblindman"
Reply With Quote
  #4 (permalink)  
Old 09-14-06, 23:22
DBADave DBADave is offline
Registered User
 
Join Date: Jan 2003
Posts: 95
I read a few articles this morning on the Internet about Enterprise Data Warehouses and how the designs are typically normalized vs. a Data Warehouse, where the model is denormalized. It looks like we are wanting an Enterprise Data Warehouse.

Thanks, Dave
Reply With Quote
  #5 (permalink)  
Old 09-15-06, 11:15
blindman blindman is offline
World Class Flame Warrior
 
Join Date: Jun 2003
Location: Ohio
Posts: 11,726
It is very easy to create non-normalized data from normalized data.
It is very difficult to create normalized data from non-normalized data.
Lesson: Store your data in a normalized schema. Create additional non-normalized and pre-aggregated tables if processing speed becomes an issue.
__________________
If it's not practically useful, then it's practically useless.

blindman
www.chess.com: "sqlblindman"
Reply With Quote
  #6 (permalink)  
Old 09-15-06, 12:18
certus certus is offline
Registered User
 
Join Date: Dec 2003
Location: Canada
Posts: 710
There is another angle to it.

A dimensional (denormalized) data warehouse is the product of a series of translations.

1. Data Sources are analyzed.
2. A normalized model called the Operational Data Store is created to represent the enterprise.
3. A Dimensional Data Store is created based on the normalized Operational Data Store.
4. ETL routines are written to move data from the Source Systems to the Operational Data Store
5. ETL routines are written to move data from the Operational Data Store to the Dimensional Data Store
6. Data is then extracted transformed and loaded from the Data Sources into the Operational Data Store
7. Data is then extracted transformed and loaded from the Operational Data Store into the Dimensional Data Store
8. Analytical tools are used against the Dimensional Data Store for reporting
9. The two step ETL process (6, 7 & 8) is repeated on a schedule as frequently as every day

This is the Kimball method.

Similar processes are used to create subsets that function as Data Marts.

I can't comment on Inmon's strategy except that he is in favor of normalized models (Operational Data Stores or even properly normalized Source Systems) being used without modification as processing power increases and makes the performance barriers of multiple joins go down. Kimball's rebuttal is that dimensional model is easier to grasp by the end user than a complex normalized data model.
__________________
visit: relationary

Last edited by certus; 09-15-06 at 13:47.
Reply With Quote
  #7 (permalink)  
Old 09-15-06, 13:09
DBADave DBADave is offline
Registered User
 
Join Date: Jan 2003
Posts: 95
This is very helpful. Thanks
Reply With Quote
  #8 (permalink)  
Old 09-15-06, 17:44
blindman blindman is offline
World Class Flame Warrior
 
Join Date: Jun 2003
Location: Ohio
Posts: 11,726
Kimball doesn't do data warehouses. He sells datamarts and calls them datawarehouses.
Actually, he sells seminars.
And telling people that they can create a datawarehouse just by using a simple star schema is a great way to get people who don't know jack about data warehousing to pay to attend your seminar.
__________________
If it's not practically useful, then it's practically useless.

blindman
www.chess.com: "sqlblindman"
Reply With Quote
  #9 (permalink)  
Old 09-17-06, 00:07
certus certus is offline
Registered User
 
Join Date: Dec 2003
Location: Canada
Posts: 710
I don't agree, blindman. I've worked on extremely large star schemas. They worked and they definitely weren't datamarts.
__________________
visit: relationary
Reply With Quote
  #10 (permalink)  
Old 09-17-06, 09:35
blindman blindman is offline
World Class Flame Warrior
 
Join Date: Jun 2003
Location: Ohio
Posts: 11,726
There is simply a limit to the complexity of relationships that can be established with a fixed star schema. Any logical design that can be boiled down to a single fact table and associated dimensions is a datamart, not a datawarehouse. How large it is, as in "number of records", is irrelevant. What matters is the number of tables, the complexity of the relationships, and the variety of data from different areas of the business enterprise that it is able to store.
__________________
If it's not practically useful, then it's practically useless.

blindman
www.chess.com: "sqlblindman"
Reply With Quote
  #11 (permalink)  
Old 09-17-06, 23:17
certus certus is offline
Registered User
 
Join Date: Dec 2003
Location: Canada
Posts: 710
Kimball does not limit the number of fact tables to one. You can have as many fact tables and dimensions as you need. A dimension can bridge more than one fact table.
__________________
visit: relationary
Reply With Quote
Reply

Thread Tools Search this Thread
Search this Thread:

Advanced Search
Display Modes

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is Off
HTML code is Off
Trackbacks are On
Pingbacks are On
Refbacks are On