Data Modeling for Analytics: Optimize for Queries, Not Transactions
Explains why transactional data models are inefficient for analytics and how to design denormalized, query-optimized models for better performance.
Explains why transactional data models are inefficient for analytics and how to design denormalized, query-optimized models for better performance.
Explains what a semantic layer is, its components, and how it provides consistent business definitions for data queries and AI agents.
Seven critical mistakes that can derail semantic layer projects in data engineering, with practical advice on how to avoid them.
Explains Headless BI and how a universal semantic layer centralizes metric definitions to replace tool-specific models, enabling consistent analytics.
A comprehensive guide to data modeling, explaining its meaning, three abstraction levels, techniques, and importance for modern data systems.
Explains the three levels of data modeling (conceptual, logical, physical) and their importance in database design.
Seven common data modeling mistakes that cause reporting errors and slow analytics, with practical solutions to avoid them.
Explains the difference between a metrics layer and a semantic layer in data architecture, clarifying their distinct roles and relationship.
A step-by-step guide to building a robust semantic layer for consistent data metrics, covering architecture, stakeholder alignment, and implementation.
Explores how data modeling principles adapt for modern lakehouse architectures using open formats like Apache Iceberg and the Medallion pattern.
Explains database denormalization: when to flatten data for faster analytics queries and when to avoid it.
A software architect's humorous account of a client derailing a professional Power BI dashboard project with a chaotic, self-built data model.
A monthly roundup of curated links and articles covering data engineering, Kafka, stream processing, and AI, with top picks highlighted.
An introduction to data modeling concepts, covering OLTP vs OLAP systems, normalization, and common schema designs for data engineering.
A technical exploration of using C# records and collections for immutable data models, covering benefits and practical implementation details.
Explains the CQRS pattern, its benefits for scaling read/write operations independently, and when to use it in software architecture.
A recap of AWS re:Invent 2024 Day 3, covering sessions on advanced DynamoDB data modeling and key AI/Data announcements from Amazon Bedrock.
A critique of modern bug trackers, proposing a 'separation of concerns' principle to better distinguish factual bug records from planning data.
Explores the differences between event and entity data modeling, when to use each approach, and practical design considerations for structuring data effectively.
Explores Data-Oriented Programming principles in Java, focusing on modeling data with records and sealed types.