Dimensional Modeling in Depth (PDF version)
Why Attend
Excellence in dimensional modeling remains the keystone of a well-designed data warehouse presentation area, regardless of whether you’ve adopted a Kimball, Corporate Information Factory (CIF), or hybrid architecture. This course gives you the opportunity to learn directly from the industry’s dimensional modeling thought leaders.
The word “Kimball” is synonymous with dimensional modeling. Ralph has established an extensive portfolio of dimensional techniques and vocabulary, including conformed dimensions, slowly changing dimensions, junk dimensions, mini-dimensions, bridge tables, periodic and accumulating snapshot fact tables, and the list goes on.
In this course, you will learn practical dimensional modeling techniques covering basic to advanced issues and patterns. Concepts are taught through a combination of lectures, class exercises, small group workshops, and individual problem solving illustrated by real-world industry scenarios. Our goal is for students to gain an in-depth understanding of dimensional modeling so they can confidently apply the techniques in their workplace following the training.
Who Should Attend
This course is designed for data warehouse architects, data modelers, database administrators, business analysts, and ETL or BI application developers and system designers. It’s appropriate for anyone interested in an A to Z coverage of dimensional modeling.
Instructors
Margy Ross and Ralph Kimball, co-authors of The Data Warehouse Toolkit, 3rd Edition
Course Overview
Day 1
- Dimensional Modeling Fundamentals
- Retail Sales Case Study
- Order Management Design Workshop
Day 2
- Inventory Case Study
- Billing Design Review Exercise
- Slowly Changing Dimensions
- Credit Card Design Workshop
- Insurance Case Study
Day 3
- Financial Applications, General Ledger, and Budgeting Value Chain
- Retail Bank Account Tracking Workshop
- ETL Backroom Dimensional Designs
Day 4
- Customer Behavior Patterns and Modeling Challenges
- Real Time Tracking
Course Details
Day 1
Dimensional Modeling Fundamentals
- Publishing responsibilities of DW/BI professionals
- Role of dimensional modeling in the Kimball, Corporate Information Factory (CIF), and hybrid architectures
- Fact and dimension table characteristics
- Surrogate key for dimensions
- Fact table granularity
- Degenerate dimensions
- Benefits of dimensional modeling
- 4-step design process
Retail Sales Case Study
- Transaction fact tables
- Denormalized dimension table hierarchies
- Dealing with nulls
- Dimension role-playing
- Date and time-of-day dimension considerations
- Centipede fact tables with too many dimensions
- Star versus snowflake schemas
- Factless fact tables
Order Management Design Workshop
- Complications with operational header/line data
- Allocated facts at different levels of detail
- Abstract, generic dimensions
- Freeform text comments
- Junk dimensions for miscellaneous transaction indicators
- Multiple currencies
Day 2
Inventory Case Study
- Implications of business processes on data architecture
- Semi-additive facts
- Three types of fact tables – transaction, periodic snapshot and accumulating snapshot
- Conformed dimensions – identical and shrunken roll-ups
- Enterprise Data Warehouse Bus Architecture and matrix for master data and integration
- Drilling across fact tables
- Consolidated cross-process fact tables
- Individual exercise: Translate business requirements into DW bus matrix
Billing Design Review Exercise
- Common design flaws and mistakes to avoid
- Checklist for conducting design reviews
Slowly Changing Dimensions
- Basic Type 1, 2 and 3 techniques
- Advanced techniques to deliver current and point-in-time attribute values
- Mini-dimensions for large, rapidly changing dimensions
- Multiple mini-dimensions and outriggers
Credit Card Design Workshop
- Complementary transaction and periodic snapshot schemas
- Design considerations for one dimension versus two dimensions
- Bridge tables for many-valued dimension attributes
- Fact table normalization
Insurance Case Study
- Review of design patterns and techniques
- Development of bus matrix from extended case study
- Complex, unpredictable accumulating snapshots
- Detailed implementation bus matrix
Dimensional Modeling Process
- Process flow, tasks and deliverables
Day 3
Financial Applications – Profit Equation
- Allocating costs to the same grain as revenue
- Profit margin point analysis and value banding
Financial Applications – General Ledger
- Tracking instantaneous balances
- Multiple time zones
- Drilling down in the general ledger to a document
Financial Applications – Budgeting Value Chain
- Budgets, commitments and expenditures
- Bridge tables for variable-depth ragged hierarchies
- Shared ownership and time-varying ragged hierarchies
- Pathstring alternative for ragged hierarchies
- Tracking the “age of the book”
- Calculating the “policy loss triangle”
Retail Bank Account Tracking Workshop
- Multiple account types with hundreds of potential attributes and facts
- Many-to-many account to customer map and weighted versus “impact” reports
- Tagging accounts as “about to go bankrupt”
- Super-types and sub-types
Automobile Options Exercise
- Column versus row trade-offs based on usability and scalability
Compliance-Enabled Data Warehouses
- Eliminating Type 1 and Type 3 updates
ETL Back Room Dimensional Designs
- Tracking data quality with error event fact table
- Column, structure, and business rule tests for data quality
- Reporting data quality with audit dimension
Day 4
Customer Relationship Management Payoffs Discussion
- Business users’ expectations and bottom line impact?
- Data sources needed? Common quality/integration problems?
Complex Customer Behavior Case Studies
- Building study groups
- Sequential time dependent study groups
- Applying study groups to marketing panels and medical outcomes
Building Custom User Interfaces
- Examples: car option selection, value band definition, study group creation
Customer Dimension Modeling Challenges
- Sparse but wide demographics attributes
- Finding detailed customer profile at random times in the past
- Tricky time span queries
- Simultaneous facts and dimensions
- Relationship between prospects and customers
Real Time Customer Tracking
- Hot partitions
- Handling unresolved customer identities in real time
Modeling Sequential Behavior
- Step dimension for describing sequential behavior
- RFID and web page challenges
- Modeling product purchase sequences
Big Data Analytic Use Cases
- Competing DBMS and Hadoop architectures
- Attaching dimensions to big data
- Drilling across conventional and big data sources
Final Customer-Centric Topics
- “Text” facts for customer cluster identification
- Structured questionnaires


